Demystifying Data Preparation for Large Language Models (LLMs)