About Me

I’m a Data Science student at UCF’s Burnett Honors College (GPA 3.83), starting the M.S. in Statistics & Data Science in Fall 2026 as a Graduate Teaching Assistant. I work at the intersection of statistical theory and production ML — building things that actually ship.

At Insurity, Inc. I’ve built RAG/LLM prototypes, migrated 175 enterprise reports to Power BI ($700K cost avoidance at 17x industry speed), and architected HIPAA-compliant audit infrastructure covering 23 PHI tables. My honors thesis — advised by Dr. Chudamani Poudyal (SDMSS, UCF) — proved Lipschitz stability bounds (L = 24.72) for CNN-based EEG seizure detection and achieved 97% accuracy on biological time-series data. I also fine-tuned Qwen2.5-Coder-14B via QLoRA on 10,795 curated data science examples, published on HuggingFace.

Outside work: Eagle Scout, Epilepsy Foundation Ambassador, drummer, and Arch Linux enthusiast (Hyprland).


Projects

DataSci-Coder: Fine-Tuned LLM for Data Science
Fine-tuned Qwen2.5-Coder-14B via QLoRA on 10,795 curated examples spanning statistics, ML, and deep learning. Achieves 93.3% instruction compliance vs. 91.4% base (+1.9%), with 10/10 code-only output vs. 6/10 base.
HuggingFace  ·  GitHub

EEG Seizure Detection — Honors Thesis
1D CNN with Butterworth filtering pipeline achieving 97% accuracy and 0.99 AUC on EEG time-series. Formally proved Lipschitz stability bounds (L = 24.72) — bridging deep learning application with mathematical theory.
Published Thesis  ·  GitHub

AI Driver Risk Scoring Platform
End-to-end telematics pipeline with 47 engineered behavioral features, 0.98 ROC-AUC, and a real-time Streamlit dashboard with SHAP explainability and gamification.
GitHub

Active Research Project
AI research at UCF — details coming soon.


Skills

Languages: Python · R · SQL · Julia · Bash · JavaScript · C
ML / Deep Learning: PyTorch · Scikit-learn · TensorFlow · QLoRA/PEFT · HuggingFace Transformers · PyTorch Geometric · XGBoost
LLM & NLP: RAG systems · LLM fine-tuning · prompt engineering · Anthropic/OpenAI/Azure APIs
Data & Visualization: Pandas · NumPy · Plotly · Streamlit · Power BI · SHAP
Statistics: Time Series · Survival Analysis · Stochastic Processes · Bayesian Methods · Signal Processing · A/B Testing
Infrastructure: Azure · AWS · Docker · SLURM/HPC · Git


Contact

jacksonSmall@ucf.edu  ·  GitHub  ·  LinkedIn  ·  HuggingFace