Using Python (pandas) and HTML to process CSV files