Tag: production

Product Evals in Three Simple Steps

Label some data, align LLM-evaluators, and run the eval harness with each change.

23 Nov 2025 · 9 min · eval engineering production

AI Engineer 2025 - Improving RecSys & Search with LLM techniques

Recsys & search are converging with LLMs via semantic IDs, data augmentation, and unified foundation models.

04 Jun 2025 · 1 min · recsys llm engineering production

NVIDIA GTC 2025 - Building LLM-Powered Applications

Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025.

18 Mar 2025 · 1 min · llm engineering production

39 Lessons on Building ML Systems, Scaling, Execution, and More

ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.

03 Nov 2024 · 10 min · machinelearning engineering production leadership

Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)

Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators.

18 Aug 2024 · 49 min · llm eval production survey 🔥

AI Engineer 2024 Keynote - What We Learned from a Year of LLMs

Special double-feature closing keynote from the 6 authors of the hit O'Reilly article on Applied LLMs.

27 Jun 2024 · 2 min · llm ai engineering production

Netflix PRS 2024 - Applying LLMs to Recommendation Experiences

Challenges and lessons from deploying LLM experiences: evals, scalability, guardrails.

31 May 2024 · 2 min · llm engineering production leadership

Prompting Fundamentals and How to Apply them Effectively

Structured input/output, prefilling, n-shots prompting, chain-of-thought, reducing hallucinations, etc.

26 May 2024 · 17 min · llm production 🔥

What We've Learned From A Year of Building with LLMs

From the tactical nuts & bolts to the operational day-to-day to the long-term business strategy.

12 May 2024 · 1 min · llm engineering production leadership 🔥

Push Notifications: What to Push, What Not to Push, and How Often

Sending helpful & engaging pushes, filtering annoying pushes, and finding the frequency sweet spot.

24 Dec 2023 · 18 min · teardown recsys machinelearning production

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

Evals, retrieval-augmented generation, guardrails, and collecting feedback; all that good stuff.

09 Oct 2023 · 17 min · llm ai engineering production

How to Match LLM Patterns to Problems

Distinguishing problems with external vs. internal LLMs, and data vs non-data patterns

13 Aug 2023 · 6 min · llm production

Patterns for Building LLM-based Systems & Products

Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.

30 Jul 2023 · 66 min · llm engineering production 🔥

More Design Patterns For Machine Learning Systems

9 patterns including HITL, hard mining, reframing, cascade, data flywheel, business rules layer, and more.

23 Apr 2023 · 20 min · machinelearning engineering production recsys

Content Moderation & Fraud Detection - Patterns in Industry

Collecting ground truth, data augmentation, cascading heuristics and models, and more.

26 Feb 2023 · 16 min · teardown machinelearning production

RecSys 2022: Recap, Favorite Papers, and Lessons

My three favorite papers, 17 paper summaries, and ML and non-ML lessons.

02 Oct 2022 · 14 min · recsys engineering production

RecSys 2022 Keynote - Is the Juice Worth the Squeeze?

Invited keynote at the Workshop for Online Recommender Systems and User Modeling (ORSUM)

23 Sep 2022 · 2 min · recsys engineering production

Writing Robust Tests for Data & Machine Learning Pipelines

Or why I should write fewer integration tests.

04 Sep 2022 · 19 min · engineering machinelearning production 🩷

Simplicity is An Advantage but Sadly Complexity Sells Better

Pushing back on the cult of complexity.

14 Aug 2022 · 10 min · machinelearning engineering production 🔥

RecSys 2021 - Papers and Talks to Chew on

Simple baselines, ideas, tech stacks, and packages to try.

03 Oct 2021 · 5 min · recsys deeplearning production survey

MLOps Community - System Design for RecSys & Search

An overview of system design, candidate retrieval, and ranking, with industry examples.

15 Sep 2021 · 1 min · recsys machinelearning production

SF Big Analytics - System Design for RecSys & Search

Why real-time RecSys? What does the system design look like in industry? How to build an MVP?

13 Jul 2021 · 1 min · recsys machinelearning production

System Design for Recommendations and Search

Breaking it into offline vs. online environments, and candidate retrieval vs. ranking steps.

27 Jun 2021 · 13 min · teardown production engineering recsys 🔥

Search: Query Matching via Lexical, Graph, and Embedding Methods

An overview and comparison of the various approaches, with examples from industry search systems.

25 Apr 2021 · 21 min · teardown machinelearning production 🔥

DataTalksClub - Building an ML System; Behind the Scenes

Design and architecture, tech stack, methodology, results, and lessons learned.

07 Feb 2021 · 5 min · machinelearning production

Real-time Machine Learning For Recommendations

Why real-time? How have China & US companies built them? How to design & build an MVP?

10 Jan 2021 · 21 min · teardown machinelearning recsys production 🔥

RecSys 2020: Takeaways and Notable Papers

Emphasis on bias, more sequential models & bandits, robust offline evaluation, and recsys in the wild.

27 Sep 2020 · 16 min · recsys deeplearning production survey

My Notes From Spark+AI Summit 2020 (Application-Specific Talks)

Part II of the previous write-up, this time on applications and frameworks of Spark in production

05 Jul 2020 · 15 min · machinelearning deeplearning production survey

How to Set Up a Python Project For Automation and Collaboration

After this article, we'll have a workflow of tests and checks that run automatically with each git push.

21 Jun 2020 · 20 min · engineering production python productivity

Why Are My Airflow Jobs Running “One Day Late”?

A curious discussion made me realize my expert blind spot. And no, Airflow is not late.

17 Jun 2020 · 3 min · engineering production til

A Practical Guide to Maintaining Machine Learning in Production

Can maintaining machine learning in production be easier? I go through some practical tips.

25 May 2020 · 16 min · machinelearning engineering production

6 Little-Known Challenges After Deploying Machine Learning

I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.

18 May 2020 · 14 min · machinelearning engineering production

DataScience SG x ODSC Meetup - Applying ML to Healthcare

In-depth sharing on how to put machine learning systems into production.

09 Oct 2019 · 4 min · machinelearning production

DATAx - A Production ML system for SEA's Biggest Hospital Group

How we built an ML system to predict hospitalization costs at admission; sharing at DATAx Conference.

06 Mar 2019 · 4 min · production machinelearning

Product Categorization API Part 3: Creating an API

Or how to put machine learning models into production.

13 Feb 2017 · 8 min · machinelearning production python 🛠

Image search is now live!

A web app to find similar products based on image.

14 Jan 2017 · 4 min · deeplearning python production 🛠

Strata x Hadoop 2016 - How Lazada Ranks Products

How Lazada ranks products to improve customer experience and conversion at Strata 2016.

09 Dec 2016 · 1 min · machinelearning production

Image classification API is now live!

A simple web app to classify fashion images into Amazon categories.

27 Nov 2016 · 2 min · deeplearning python production 🛠

eugeneyan

Tag:
production
(38)