production
(34)ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
03 Nov 2024  ·  10 min  ·  machinelearning engineering production leadership
Special double-feature closing keynote from the 6 authors of the hit O'Reilly article on Applied LLMs.
27 Jun 2024  ·  2 min  ·  llm engineering production
Challenges and lessons from deploying LLM experiences: evals, scalability, guardrails.
31 May 2024  ·  2 min  ·  llm engineering production leadership
Structured input/output, prefilling, n-shots prompting, chain-of-thought, reducing hallucinations, etc.
26 May 2024  ·  17 min  ·  llm production
From the tactical nuts & bolts to the operational day-to-day to the long-term business strategy.
12 May 2024  ·  1 min  ·  llm engineering production leadership
Sending helpful & engaging pushes, filtering annoying pushes, and finding the frequency sweet spot.
24 Dec 2023  ·  18 min  ·  teardown recsys machinelearning production
Evals, retrieval-augmented generation, guardrails, and collecting feedback; all that good stuff.
09 Oct 2023  ·  17 min  ·  llm engineering production
Distinguishing problems with external vs. internal LLMs, and data vs non-data patterns
13 Aug 2023  ·  6 min  ·  llm production
Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.
30 Jul 2023  ·  66 min  ·  llm engineering production 🔥
9 patterns including HITL, hard mining, reframing, cascade, data flywheel, business rules layer, and more.
23 Apr 2023  ·  20 min  ·  machinelearning engineering production recsys
Collecting ground truth, data augmentation, cascading heuristics and models, and more.
26 Feb 2023  ·  16 min  ·  teardown machinelearning production
My three favorite papers, 17 paper summaries, and ML and non-ML lessons.
02 Oct 2022  ·  14 min  ·  recsys engineering production
Invited keynote at the Workshop for Online Recommender Systems and User Modeling (ORSUM)
23 Sep 2022  ·  2 min  ·  recsys engineering production
Or why I should write fewer integration tests.
04 Sep 2022  ·  19 min  ·  engineering machinelearning production 🩷
Pushing back on the cult of complexity.
14 Aug 2022  ·  10 min  ·  machinelearning engineering production 🔥
Simple baselines, ideas, tech stacks, and packages to try.
03 Oct 2021  ·  5 min  ·  recsys deeplearning production survey
An overview of system design, candidate retrieval, and ranking, with industry examples.
15 Sep 2021  ·  1 min  ·  recsys machinelearning production
Why real-time RecSys? What does the system design look like in industry? How to build an MVP?
13 Jul 2021  ·  1 min  ·  recsys machinelearning production
Breaking it into offline vs. online environments, and candidate retrieval vs. ranking steps.
27 Jun 2021  ·  13 min  ·  teardown production engineering recsys 🔥
An overview and comparison of the various approaches, with examples from industry search systems.
25 Apr 2021  ·  21 min  ·  teardown machinelearning production 🔥
Design and architecture, tech stack, methodology, results, and lessons learned.
07 Feb 2021  ·  5 min  ·  machinelearning production
Why real-time? How have China & US companies built them? How to design & build an MVP?
10 Jan 2021  ·  21 min  ·  teardown machinelearning recsys production 🔥
Emphasis on bias, more sequential models & bandits, robust offline evaluation, and recsys in the wild.
27 Sep 2020  ·  16 min  ·  recsys deeplearning production survey
Part II of the previous write-up, this time on applications and frameworks of Spark in production
05 Jul 2020  ·  15 min  ·  machinelearning deeplearning spark production survey
After this article, we'll have a workflow of tests and checks that run automatically with each git push.
21 Jun 2020  ·  20 min  ·  engineering production python productivity 🔥
A curious discussion made me realize my expert blind spot. And no, Airflow is not late.
17 Jun 2020  ·  3 min  ·  engineering production til
Can maintaining machine learning in production be easier? I go through some practical tips.
25 May 2020  ·  16 min  ·  machinelearning engineering production
I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.
18 May 2020  ·  14 min  ·  machinelearning engineering production
In-depth sharing on how to put machine learning systems into production.
09 Oct 2019  ·  4 min  ·  machinelearning production
How we built an ML system to predict hospitalization costs at admission; sharing at DATAx Conference.
06 Mar 2019  ·  4 min  ·  production machinelearning
Or how to put machine learning models into production.
13 Feb 2017  ·  8 min  ·  machinelearning production python 🛠
A web app to find similar products based on image.
14 Jan 2017  ·  4 min  ·  deeplearning python production 🛠
How Lazada ranks products to improve customer experience and conversion at Strata 2016.
09 Dec 2016  ·  1 min  ·  machinelearning lazada production
A simple web app to classify fashion images into Amazon categories.
27 Nov 2016  ·  2 min  ·  deeplearning python production 🛠
Join 9,100+ readers getting updates on machine learning, RecSys, LLMs, and engineering.