UFO Sightings: Machine Learning Pipeline
Analyzed 80K+ UFO reports to uncover structure in noisy, witness-reported data using a full cleaning, clustering, and classification workflow.
what I built
- Built a preprocessing pipeline for cleaning text-heavy reports and engineering usable features.
- Applied K-means and DBSCAN to explore hidden structure before moving into supervised modeling.
- Trained an MLPClassifier and iterated on the pipeline to improve identification performance.
result / signal
- Worked on 80K+ records
- Improved performance from ~1.2% to 7.6%