Optimizing LLMs: Enhancing Data Preprocessing Techniques