Data Scientist ( LLM MLOps ) $150K - $175K San Francisco CA (San Francisco)
4 days ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Direct message the job poster from GroRapid Labs
Helping Web3 Startups Hire Top Golang & Rust Engineers | Tech Recruiter (US | Europe)
About the job
Data Scientist – LLMs, Python, MLOps
Remote | Full-Time
A 2024-founded startup based in San Francisco is building structured data tools to improve the accuracy and reliability of large language models. Their platform powers agentic, RAG-native systems through modular knowledge graphs and developer-friendly APIs, turning unstructured data into useful, trusted knowledge.
What you will do
Turn raw JSON, CSV, or HTML into clean insights. Profile, visualize, and identify patterns or outliers—before anyone asks.
Train and tune models for classification, ranking, and RAG with LLMs to move recall and precision metrics forward every week.
API Integrator
Wrap models using FastAPI, validate inputs with Pydantic, and deploy clean, testable endpoints using CI pipelines.
MLOps Wrangler
Monitor data and model drift, run batch jobs, add simple tests, and ensure long-term system reliability.
Insight Storyteller
Communicate findings through Jupyter notebooks, dashboards, and Loom videos. Make insights clear and accessible to legal and non-technical stakeholders.
Startup Swiss-Army Knife
Take initiative to fix data issues, infra gaps, and edge cases—without waiting for formal tasks or assignments.
- You might be a fit if you have
- 3–5 years of experience with Python and tools like pandas, Polars, PyTorch, or TensorFlow
- Experience building and deploying APIs with FastAPI and Pydantic
- Practical use of LLMs for data augmentation or cleaning tasks
- Proficient in SQL, Postgres/DuckDB, and object storage like S3
- Familiarity with CI/CD pipelines (e.g., GitHub Actions)
- You document clearly and share proactively
- Bonus if you have
- Experience with web scraping using Scrapy or Playwright, or working with PACER, NHTSA, or FDA datasets
- Familiarity with vector databases like Qdrant or pgvector, and prompt engineering
- Exposure to regulated environments like SOC 2, HIPAA, etc.
Why this role
- You’ll work at the core of production-grade AI systems—from structured LLM pipelines to real-time API deployment. Perfect for someone who thrives in fast-moving, high-ownership environments and wants to build meaningful, technical systems that make LLMs safer and more reliable.Seniority level
- Seniority levelMid-Senior level Employment type
- Employment typeFull-time Job function
- Job functionEngineering and Information Technology
- IndustriesSoftware Development
Referrals increase your chances of interviewing at GroRapid Labs by 2x
Sign in to set job alerts for “Data Scientist” roles.
San Francisco, CA $172,000.00-$203,000.00 3 weeks ago
AI Training for Data Science (Freelance, Remote)
San Francisco, CA $140,000.00-$195,000.00 3 weeks ago
South San Francisco, CA $120,000.00-$135,000.00 3 weeks ago
Research Scientist (Multi-agent Systems)
San Francisco, CA $180,000.00-$220,000.00 3 days ago
Brisbane, CA $161,000.00-$185,000.00 3 days ago
Software Engineer, Python - AI Training (Freelance, Remote)Machine Learning Scientist (Staff / Sr Staff) - Power MarketsInternship - Research Scientist (AI Agents)
San Francisco, CA $157,500.00-$233,400.00 1 day ago
San Francisco, CA $140,000.00-$200,000.00 2 weeks ago
Machine Learning Engineer, Core EngineeringScientist II, Real World Data Science - Translational Research
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
Apply Job!
Apply to this Job