production
(36)Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025.
18 Mar 2025  Ā·  1 min  Ā·  llm engineering production
ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
03 Nov 2024  Ā·  10 min  Ā·  machinelearning engineering production leadership
Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators.
18 Aug 2024  Ā·  49 min  Ā·  llm eval production š„
Special double-feature closing keynote from the 6 authors of the hit O'Reilly article on Applied LLMs.
27 Jun 2024  Ā·  2 min  Ā·  llm ai engineering production
Challenges and lessons from deploying LLM experiences: evals, scalability, guardrails.
31 May 2024  Ā·  2 min  Ā·  llm engineering production leadership
Structured input/output, prefilling, n-shots prompting, chain-of-thought, reducing hallucinations, etc.
26 May 2024  Ā·  17 min  Ā·  llm production š„
From the tactical nuts & bolts to the operational day-to-day to the long-term business strategy.
12 May 2024  Ā·  1 min  Ā·  llm engineering production leadership š„
Sending helpful & engaging pushes, filtering annoying pushes, and finding the frequency sweet spot.
24 Dec 2023  Ā·  18 min  Ā·  teardown recsys machinelearning production
Evals, retrieval-augmented generation, guardrails, and collecting feedback; all that good stuff.
09 Oct 2023  Ā·  17 min  Ā·  llm ai engineering production
Distinguishing problems with external vs. internal LLMs, and data vs non-data patterns
13 Aug 2023  Ā·  6 min  Ā·  llm production
Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.
30 Jul 2023  Ā·  66 min  Ā·  llm engineering production š„
9 patterns including HITL, hard mining, reframing, cascade, data flywheel, business rules layer, and more.
23 Apr 2023  Ā·  20 min  Ā·  machinelearning engineering production recsys
Collecting ground truth, data augmentation, cascading heuristics and models, and more.
26 Feb 2023  Ā·  16 min  Ā·  teardown machinelearning production
My three favorite papers, 17 paper summaries, and ML and non-ML lessons.
02 Oct 2022  Ā·  14 min  Ā·  recsys engineering production
Invited keynote at the Workshop for Online Recommender Systems and User Modeling (ORSUM)
23 Sep 2022  Ā·  2 min  Ā·  recsys engineering production
Or why I should write fewer integration tests.
04 Sep 2022  Ā·  19 min  Ā·  engineering machinelearning production š©·
Pushing back on the cult of complexity.
14 Aug 2022  Ā·  10 min  Ā·  machinelearning engineering production š„
Simple baselines, ideas, tech stacks, and packages to try.
03 Oct 2021  Ā·  5 min  Ā·  recsys deeplearning production survey
An overview of system design, candidate retrieval, and ranking, with industry examples.
15 Sep 2021  Ā·  1 min  Ā·  recsys machinelearning production
Why real-time RecSys? What does the system design look like in industry? How to build an MVP?
13 Jul 2021  Ā·  1 min  Ā·  recsys machinelearning production
Breaking it into offline vs. online environments, and candidate retrieval vs. ranking steps.
27 Jun 2021  Ā·  13 min  Ā·  teardown production engineering recsys š„
An overview and comparison of the various approaches, with examples from industry search systems.
25 Apr 2021  Ā·  21 min  Ā·  teardown machinelearning production š„
Design and architecture, tech stack, methodology, results, and lessons learned.
07 Feb 2021  Ā·  5 min  Ā·  machinelearning production
Why real-time? How have China & US companies built them? How to design & build an MVP?
10 Jan 2021  Ā·  21 min  Ā·  teardown machinelearning recsys production š„
Emphasis on bias, more sequential models & bandits, robust offline evaluation, and recsys in the wild.
27 Sep 2020  Ā·  16 min  Ā·  recsys deeplearning production survey
Part II of the previous write-up, this time on applications and frameworks of Spark in production
05 Jul 2020  Ā·  15 min  Ā·  machinelearning deeplearning production survey
After this article, we'll have a workflow of tests and checks that run automatically with each git push.
21 Jun 2020  Ā·  20 min  Ā·  engineering production python productivity š„
A curious discussion made me realize my expert blind spot. And no, Airflow is not late.
17 Jun 2020  Ā·  3 min  Ā·  engineering production til
Can maintaining machine learning in production be easier? I go through some practical tips.
25 May 2020  Ā·  16 min  Ā·  machinelearning engineering production
I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.
18 May 2020  Ā·  14 min  Ā·  machinelearning engineering production
In-depth sharing on how to put machine learning systems into production.
09 Oct 2019  Ā·  4 min  Ā·  machinelearning production
How we built an ML system to predict hospitalization costs at admission; sharing at DATAx Conference.
06 Mar 2019  Ā·  4 min  Ā·  production machinelearning
Or how to put machine learning models into production.
13 Feb 2017  Ā·  8 min  Ā·  machinelearning production python š
A web app to find similar products based on image.
14 Jan 2017  Ā·  4 min  Ā·  deeplearning python production š
How Lazada ranks products to improve customer experience and conversion at Strata 2016.
09 Dec 2016  Ā·  1 min  Ā·  machinelearning lazada production
A simple web app to classify fashion images into Amazon categories.
27 Nov 2016  Ā·  2 min  Ā·  deeplearning python production š
Join 10,300+ readers getting updates on machine learning, RecSys, LLMs, and engineering.