Data Scientist & Developer

William
Lonon.

I turn data into decisions — machine learning, research, and the tools that put models to work. I study how systems behave, measure what they actually do, and build the pipelines that make the answers usable.

Python

Machine Learning

Statistics

SQL

Pandas

scikit-learn

Data Viz

LLM APIs

See the work → GitHub ↗

About

I'm a data scientist and developer based in Arkansas. I hold a B.S. in Data Science from the University of Arkansas, where I completed honors research on the behavior and biases of generative AI.

My work sits where analysis meets engineering: framing the question, building the model, measuring it honestly, and shipping the tool that delivers the result. I'm as comfortable in a Jupyter notebook as I am wiring up an API or deploying a site.

I care about rigor and clarity in equal measure — research you can trust, and results you can act on. I'm currently open to data science and analytics roles.

Beyond the numbers, I run an independent music label and build software for the things I care about. The full builder portfolio lives at williamlonon.dev.

William Lonon · Data Scientist · Fayetteville, AR

Research

Published, peer-mentored research at the intersection of data science and AI safety.

Honors Thesis · 2026

Recursion, Regurgitation, and Regeneration

B.S. Honors Thesis · Data Science, University of Arkansas · Advised by Dr. Karl Schubert

Testing the limits and revealing the biases of generative AI through multimodal feedback loops. I ran GPT-4o and DALL·E in a "telephone game" — chaining image captioning and text-to-image generation over many iterations — and measured how bias compounds across cycles using CLIP similarity and facial-recognition metrics to track semantic drift, identity preservation, and information loss.

Models hedged away from naming a controversial public figure's real-world associations up to ~1800% more often than on standalone images
Documented systematic identity drift across regenerations (e.g., Martin Luther King Jr. morphing toward Adolf Hitler)
Exposed measurable gaps between stated content policies and actual model behavior

Python GPT-4o DALL·E CLIP Async Pipelines

Read thesis ↗ Source ↗

Selected Work

Applied data science and machine learning — from audio ML to LLM-driven automation.

Audio ML

Samplebot-3000

A desktop tool that turns any audio file into a playable sampler instrument. It slices a recording at every onset, fingerprints each slice by timbre, and clusters acoustically similar sounds onto the same key — an unsupervised ML pipeline you can play with your keyboard.

Onset slicing + timbral feature extraction with librosa
Three clustering modes: KMeans, Agglomerative, and HDBSCAN
Real-time playback engine with ADSR, filtering, and velocity crossfade

Python librosa scikit-learn HDBSCAN

Source ↗

AI Automation

Promo Bot

An end-to-end, LLM-powered outreach pipeline built to promote music releases at scale — targeting 9,000+ industry contacts with personalized, genre-aware pitches. From data extraction to automated, customized delivery.

Playwright scraper handling virtual scrolling and dynamic DOM
Claude API for personalized generation by genre & contact type
Full pipeline from extraction to automated delivery

Python Anthropic Claude Playwright

Private repo

Looking for the sites, bots & tools? → My full software & web-development portfolio — live sites, automation, and projects built end-to-end — lives at williamlonon.dev.

Visit williamlonon.dev ↗

Toolkit

The stack I reach for, from first question to shipped result.

Languages

Python · SQL · R · JavaScript

ML & Stats

scikit-learn · Clustering · Regression · Hypothesis Testing

Data

Pandas · NumPy · Feature Engineering · Pipelines

Viz & Reporting

Matplotlib · Seaborn · Jupyter · Dashboards

AI & LLMs

GPT-4o · Claude API · CLIP · Prompt Pipelines

Ship

Git · GitHub · Vercel · APIs

Get in touch

Let's work
with data.

Open to data science and analytics roles, research collaborations, and consulting. Have a problem worth measuring? Reach out.

✉ Click to reveal email

☏ Click to reveal phone

◈ LinkedIn ⌥ GitHub ↓ Download Resume

William Lonon.

About

Research

Selected Work

Toolkit

William
Lonon.