CV

Jackson Small

jacksonSmall@ucf.edu
407-907-5072
Orlando, FL, US

Summary

Data Science student at UCF Burnett Honors College (GPA 3.83). Admitted UCF M.S. Statistics and Data Science (Fall 2026) with Graduate Teaching Assistantship. First Data Science undergraduate to publish in UCF STARS Honors Undergraduate Thesis repository.

Education

  • M.S. in Statistics and Data Science
    2028-05-01
    University of Central Florida
  • B.S. in Data Science, Minor in Statistics
    2026-05-01
    University of Central Florida, Burnett Honors College
    GPA: 3.83
    Courses: Applied Time Series (STA 4852), Statistical Learning (STA 4364), Survival Analysis (STA 4365), Stochastic Processes (STA 4322), Biostatistics (STA 4173), Regression (STA 4164), Unsupervised Learning (STA 4724), Multivariate Analysis (ISC 4241), Mathematical Modeling (MAP 4191), Big Data Analytics (STA 4163)

Work Experience

  • AI Associate Developer
    2025-10-01 -
    Insurity, Inc.
    Full-stack enterprise development — RAG/LLM development, big data migrations, HIPAA compliance infrastructure, SDK integrations.
    • Built and deployed RAG/LLM prototypes using Anthropic, OpenAI, and Replit APIs for core insurance product features
    • Migrated 175 SSRS reports to Power BI in 3 weeks ($700,000 cost avoidance, 17x industry speed, 1.1M+ lines XML, 187 DB connections, 100% pass rate)
    • Architected dual-layer HIPAA audit system for 23 PHI tables (9,196 lines C#/SQL), 85-96% storage reduction, zero audit gaps
    • Delivered enterprise SDK integration (10,723 LOC, 334 automated test checks) in 22 days vs. 3-6 month industry standard
  • Research Assistant (Data Science) Intern
    2024-05-01 - 2024-08-01
    Design Interactive, Inc.
    Human systems research — VR/AR simulations and behavioral data analysis.
    • Built automated data validation pipelines in Python and SQL for large human-systems research datasets
    • Led QA on VR/AR Unity simulations; validated eye-tracking software for participant gaze capture
    • Conducted statistical analysis on behavioral data across experimental conditions; led pilot studies (~300 participants)

Skills

Languages

  • Python
  • R
  • SQL
  • Julia
  • Bash
  • JavaScript
  • C
  • PowerShell

ML & Deep Learning

  • PyTorch
  • TensorFlow
  • Scikit-learn
  • XGBoost
  • PyTorch Geometric
  • QLoRA/PEFT
  • Hugging Face Transformers
  • SMOTE

LLM & NLP

  • RAG systems
  • LLM fine-tuning
  • prompt engineering
  • Anthropic API
  • OpenAI API
  • Azure OpenAI
  • Llama
  • Qwen
  • BioMistral

Data Science

  • Pandas
  • NumPy
  • SciPy
  • Statsmodels
  • Matplotlib
  • Seaborn
  • Plotly
  • SHAP

Statistics & Methods

  • Time Series
  • Survival Analysis
  • Stochastic Processes
  • Bayesian Methods
  • Multivariate Analysis
  • Signal Processing
  • A/B Testing
  • Cross-Validation

Tools & Cloud

  • Azure
  • AWS
  • Docker
  • Git/GitHub
  • FastAPI
  • Streamlit
  • Power BI
  • SLURM/HPC
  • Jupyter
  • VS Code

Publications

  • Theoretical Analysis of CNNs for Automatic Seizure Detection in EEG Signals
    2025
    UCF STARS Honors Undergraduate Theses
    First Data Science undergraduate to publish in the UCF STARS Honors Undergraduate Thesis repository. Built 1D CNN with Butterworth filtering pipeline achieving 97% accuracy and 0.99 AUC on EEG seizure detection. Formally proved Lipschitz stability bounds (L = 24.72). Advised by Dr. Chudamani Poudyal (SDMSS, UCF).

Presentations

  • Theoretical Analysis of CNNs for Automatic Seizure Detection in EEG Signals
    2026
    UCF Student Scholar Symposium
    Orlando, FL, USA
    Poster presentation of Honors Undergraduate Thesis research
  • Theoretical Analysis of CNNs for Automatic Seizure Detection in EEG Signals
    2025
    Burnett Honors College Family Weekend
    Orlando, FL, USA
    Poster presentation of Honors Undergraduate Thesis research

Portfolio

  • DataSci-Coder: Fine-Tuned LLM for Data Science
    2026
    Portfolio
    Fine-tuned Qwen2.5-Coder-14B via QLoRA on 10,795 curated DS examples. 93.3% instruction compliance vs 91.4% base. Published on HuggingFace.
  • Research Project (Coming Soon)
    2026
    Portfolio
    Active research project — details to be published upon completion.
  • Honors Thesis: CNN EEG Seizure Detection
    2025
    Portfolio
    Published UCF Honors thesis. 97% accuracy, 0.99 AUC, Lipschitz stability bounds (L=24.72).
  • AI Driver Risk Scoring Platform
    2025
    Portfolio
    End-to-end telematics platform with 0.98 ROC-AUC, 47 engineered features, real-time Streamlit dashboard.