Extracting Audio datasets for machine learning