Skip to content

Alexis Roldan · Lancaster, CA

20 Years Building Production AI/ML for Biopharma

Full-Stack Data Scientist & Sr. Data Engineer

I build manufacturing analytics and GenAI systems end to end for Takeda's Los Angeles plasma fractionation plant — owning the ETL, the modeling, the app, the RAG layer, and the deployment. Currently at Takeda Pharmaceutical.

Lancaster, CA

01who I am

About

Alexis Roldan

I turn biopharma manufacturing data into production systems.

I'm a seasoned Full-Stack Data Scientist and Software Developer with a passion for advanced analytics and statistical modeling that drives real decisions on the plant floor. My biggest deliverables are full-stack: I own the ETL, the modeling, the app, the LLM/RAG layer, the deployment, and the executive comms.

  • End-to-end Data Science & ML
  • Data Engineering & Pipelines
  • Business Intelligence & Visualization
  • Process Optimization & Digital Twins
  • Generative AI & LLM Applications
Lean Six Sigma Green BeltAGILE ChampionDatabricks ChampionMulesoft Champion

English & Spanish (Native) · Japanese (Conversational)

20
Years in biopharma
30+
Solutions in production
~150
Users per flagship app
93%
Faster deviation search

Recognition

Race4Value — FTE Optimization Winner

2023

Star of the Quarter, Q3 FY22

2022

Employee of the Quarter

2017

02what I've built

Selected Work

0Databases

PennyTrail

Next.js / TypeScript

Mobile-first PWA for married-filing-jointly tax tracking — GPT-4o spending chat, receipt capture, no database (all state in S3 JSON).

Next.js 16TypeScriptTailwind v4shadcn/ui
MIA multimodal intelligent assistant interface

MIA

Next.js / GenAI

Bilingual learning assistant across 10 personas — streaming OpenAI chat, voice in/out, web search via function calling, document extraction.

Next.js 16OpenAIGoogle Custom SearchWeb Speech API
3-phaseSLA logic

LabCast

R / Shiny

QC lab sample tracking with TV/kiosk mode and a forecast model — 3-phase SLA logic across 25 sample types, live state in S3.

R Shinybs4DashForecast modelEDB / LIMS
Internal enterprise app
GPT-4oVision tagging

Aether Wardrobe

Next.js / GenAI

Photograph your wardrobe once; GPT-4o vision catalogs each piece and a deterministic combination engine pairs outfits — the AI only ranks and explains, never invents.

Next.js 16TypeScriptTailwind v4Supabase
Private project — walkthrough on request
StreamingLive AI output

Musical Reel Studio

Next.js / GenAI

Generates short-form reel concepts — hook → scene-by-scene → caption → CTA — streamed live, field-by-field from a song, vibe or lyric.

Next.js 15TypeScriptPrismaClerk
Private project — walkthrough on request
20+LLM providers

ReflectAI

React / Express / GenAI

Drafts yearly goals and self-assessments through conversation — provider-agnostic across 20+ LLMs with a mock fallback, exporting polished DOCX/PDF.

React + ViteExpressPluggable LLMsOpenAI / Anthropic / Ollama
Private project — walkthrough on request
Shiny App Valuation Toolkit dashboard

Shiny App Valuation Toolkit

R / Shiny

Modified COCOMO II model for Shiny/data-science repos — 3 analysis modes, scenario comparison, AI-assisted planning.

RShinyCOCOMO IIPlotly
RIOT OCR application interface

RIOT — Image OCR App

R / Shiny / AWS

Interactive R/Shiny app extracting text from images via AWS Textract OCR for downstream analytics.

RShinyAWS TextractOCR
TubeScout video discovery dashboard

TubeScout

Python

Finds high-signal YouTube videos using view-to-subscriber ratio analysis — discovery without the algorithm.

PythonStreamlitYouTube API
YouTube feed dashboard

YouTube Feed Dashboard

R / Shiny / AWS

Personal dashboard surfacing latest videos from favorite channels — AWS-hosted data pipeline, deployed on Posit Connect.

RShinyAWSYouTube API
Voice Assistant AI Streamlit interface

Voice Assistant AI

Python

Python voice assistant with speech recognition, real-time financial data and weather integration.

PythonStreamlitSpeech Recognition
roldanpack R package

roldanpack

R

Reusable analytics functions and custom visualization themes, installable via GitHub.

RPackage
0320 years in biopharma

Experience

Baxter 2005–2015Baxalta 2015–2016Shire 2016–2019Takeda 2019–Present
  1. Sr. Data Engineer & Data Scientist (Sr. Manager)

    Apr 2025 – Present

    Takeda Pharmaceutical

    Support the Los Angeles Digital Roadmap, AI/ML development and advanced analytics to drive data-driven decisions.

    • Leading the LA Digital Roadmap and AI/ML development
    • 30+ solutions deployed to production
    • Designing, building and maintaining local data pipelines for critical business data
    • Architected hybrid-RAG systems achieving 70% token reduction and 59% deviation reduction
    • Built agentic BI dashboards and AI-powered text-to-SQL apps for quality management
  2. Data Scientist III (Sr. Manager)

    Nov 2022 – Mar 2024

    Takeda Pharmaceutical

    Drove the AGILE 4.0 Digital Workstream and cutting-edge analytics to accelerate Takeda's digital capabilities across GMS sites.

    • Drove the AGILE 4.0 Digital Workstream across all GMS sites
    • Executed local and global GMS/GQ big-data and analytics strategy
    • Built and maintained data pipelines and BI applications across all GMS plasma sites
  3. Data Scientist II (Manager)

    Apr 2021 – Nov 2022

    Takeda Pharmaceutical

    Built AI/ML and advanced analytics programs to gain process insight and accelerate digital capabilities.

    • Achieved ~50% reduction in process variation via digital-twin models (PIMS project)
    • Designed A/B tests and developed end-to-end ML applications
    • Los Angeles lead data scientist for global analytics initiatives
04the toolkit

Skills & Education

Languages

RPythonSQL.NETHTML / CSSJava

Data & ML

ShinyDashMinitabJMPSIMCA / OPLSApache Spark

Databases

SQL ServerPostgreSQLOracle

Cloud & DevOps

AWSDatabricksDockerGitPosit Connect

BI Tools

Qlik SensePower BITableau

AI & GenAI

LLM IntegrationRAG ArchitecturePrompt EngineeringText-to-SQLAgentic AIClaude / GPT APIsVoice AI (TTS/STT)

Methodologies

Six SigmaSCRUM / AgileDMAICMLOpsProject Management

Education & Certifications

  • 2023

    Oxford Machine Learning Summer SchoolAI for Global Goals

  • 2023

    ODSC AI BootcampOpen Data Science Conference

  • 2020

    Data Science ProfessionalHarvardX

  • 2020

    Statistical Learning, Inference & ModelingHarvardX / Stanford Online

  • 2012

    Computer ScienceCal State Northridge

05let's talk

Get in touch

Have a project or role in mind?

I'm always open to conversations about data engineering, applied ML, and GenAI in regulated/manufacturing environments. Drop a note and I'll get back to you.