TIL my simple chatbot idea needed way more training data than I thought
I wanted to make a bot that could answer questions about my town's bus schedule, figured it would take a weekend. I mean, the info is all online, right? I spent two whole weeks just cleaning and formatting the timetable data from the city's PDFs before the model could even start learning the routes. Has anyone else had a project balloon in time because of data prep?