Case study AMIRA
R / Shiny / LLMEnterprise analytics platform for plasma fractionation — 59% deviation reduction and 70% token optimization via hybrid RAG.
Alexis Roldan · Lancaster, CA
Full-Stack Data Scientist & Sr. Data Engineer
I build manufacturing analytics and GenAI systems end to end for Takeda's Los Angeles plasma fractionation plant — owning the ETL, the modeling, the app, the RAG layer, and the deployment. Currently at Takeda Pharmaceutical.
Lancaster, CA

I'm a seasoned Full-Stack Data Scientist and Software Developer with a passion for advanced analytics and statistical modeling that drives real decisions on the plant floor. My biggest deliverables are full-stack: I own the ETL, the modeling, the app, the LLM/RAG layer, the deployment, and the executive comms.
English & Spanish (Native) · Japanese (Conversational)
Race4Value — FTE Optimization Winner
2023
Star of the Quarter, Q3 FY22
2022
Employee of the Quarter
2017
Mobile-first PWA for married-filing-jointly tax tracking — GPT-4o spending chat, receipt capture, no database (all state in S3 JSON).

QC lab sample tracking with TV/kiosk mode and a forecast model — 3-phase SLA logic across 25 sample types, live state in S3.
Photograph your wardrobe once; GPT-4o vision catalogs each piece and a deterministic combination engine pairs outfits — the AI only ranks and explains, never invents.
Generates short-form reel concepts — hook → scene-by-scene → caption → CTA — streamed live, field-by-field from a song, vibe or lyric.
Drafts yearly goals and self-assessments through conversation — provider-agnostic across 20+ LLMs with a mock fallback, exporting polished DOCX/PDF.

Modified COCOMO II model for Shiny/data-science repos — 3 analysis modes, scenario comparison, AI-assisted planning.

Interactive R/Shiny app extracting text from images via AWS Textract OCR for downstream analytics.

Finds high-signal YouTube videos using view-to-subscriber ratio analysis — discovery without the algorithm.

Personal dashboard surfacing latest videos from favorite channels — AWS-hosted data pipeline, deployed on Posit Connect.

Python voice assistant with speech recognition, real-time financial data and weather integration.

Reusable analytics functions and custom visualization themes, installable via GitHub.
Takeda Pharmaceutical
Support the Los Angeles Digital Roadmap, AI/ML development and advanced analytics to drive data-driven decisions.
Takeda Pharmaceutical
Drove the AGILE 4.0 Digital Workstream and cutting-edge analytics to accelerate Takeda's digital capabilities across GMS sites.
Takeda Pharmaceutical
Built AI/ML and advanced analytics programs to gain process insight and accelerate digital capabilities.
Oxford Machine Learning Summer School — AI for Global Goals
ODSC AI Bootcamp — Open Data Science Conference
Data Science Professional — HarvardX
Statistical Learning, Inference & Modeling — HarvardX / Stanford Online
Computer Science — Cal State Northridge
I'm always open to conversations about data engineering, applied ML, and GenAI in regulated/manufacturing environments. Drop a note and I'll get back to you.
Location
Lancaster, CA