I manually collected 10 minutes of dog barking audio from YouTube and cleaned the data in Adobe Premiere Pro by removing silence and non-barking sounds. Using a Python script, I sliced the audio into 1-second segments and converted them into Mel-spectrograms to serve as input features .

I trained a simple neural network on this dataset; however, due to the limited data size, the model’s accuracy is currently below 50%.