1. What is sound

    1. sequence of vibrations in varying pressure strengths
  2. What is Fourier Transform

  3. What is spectrogram

    1. visualizing sound

      1. waveplot

        https://s3-us-west-2.amazonaws.com/secure.notion-static.com/eb3fd66d-d800-46ce-beb6-10f864ef3400/Untitled.png

      2. fourier transform (frequency domain)

        fourier transform is a function that gets a signal in the time domain as input and outputs its decomposition into frequencies

        https://s3-us-west-2.amazonaws.com/secure.notion-static.com/3680baa7-1998-49cf-90e6-f0a2a2374464/Untitled.png

        https://s3-us-west-2.amazonaws.com/secure.notion-static.com/c10d1fa7-cba8-4df1-9816-d9a3612db076/Untitled.png

      3. A spectrogram is a visual representation of the spectrum of frequencies of a signal as it varies with time.

      https://s3-us-west-2.amazonaws.com/secure.notion-static.com/4c31f9b0-08f6-41f1-b03b-8424b9420a81/Untitled.png

  4. What is mel scale

    1. result of some non-linear transformation of the frequency scale. This Mel Scale is constructed such that sounds of equal distance from each other on the Mel Scale, also “sound” to humans as they are equal in distance from one another.

      It partitions the Hz scale into bins, and transforms each bin into a corresponding bin in the Mel Scale, using a overlapping triangular filters.

      https://s3-us-west-2.amazonaws.com/secure.notion-static.com/4895ba3c-2d6c-44a8-aad0-c64e0074034b/Untitled.png

      Mel-scale은 이러한 pitch에서 발견한 사람의 음을 인지하는 기준(threshold)을 반영한 scale 변환 함수이다. 위에서처럼 [Hz -> 음계]의 관계가 exponential하기 때문에 주파수를 바로 linear하게 다루지 말고, log함수를 통과시켜 mel scale로 바꾼다음, linear하게 다루자는 것이다.

      https://s3-us-west-2.amazonaws.com/secure.notion-static.com/19ddc277-5fbc-4450-ab60-86d47ab6e325/Untitled.png