engineering
(38)Chip Huyen and I share what we've learned, best practices, and insights at NVIDIA GTC 2025.
18 Mar 2025  Ā·  1 min  Ā·  llm engineering production
Setting up my new MacBook Pro from scratch
17 Nov 2024  Ā·  5 min  Ā·  engineering misc
ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
03 Nov 2024  Ā·  10 min  Ā·  machinelearning engineering production leadership
FastAPI, FastHTML, Next.js, SvelteKit, and thoughts on how coding assistants influence builders' choices.
08 Sep 2024  Ā·  8 min  Ā·  learning engineering python š š„
Special double-feature closing keynote from the 6 authors of the hit O'Reilly article on Applied LLMs.
27 Jun 2024  Ā·  2 min  Ā·  llm ai engineering production
Challenges and lessons from deploying LLM experiences: evals, scalability, guardrails.
31 May 2024  Ā·  2 min  Ā·  llm engineering production leadership
From the tactical nuts & bolts to the operational day-to-day to the long-term business strategy.
12 May 2024  Ā·  1 min  Ā·  llm engineering production leadership š„
How unit testing machine learning code differs from typical software practices
25 Feb 2024  Ā·  6 min  Ā·  machinelearning engineering python
Evals, retrieval-augmented generation, guardrails, and collecting feedback; all that good stuff.
09 Oct 2023  Ā·  17 min  Ā·  llm ai engineering production
Evals, RAG, fine-tuning, caching, guardrails, defensive UX, and collecting user feedback.
30 Jul 2023  Ā·  66 min  Ā·  llm engineering production š„
Writing drafts via retrieval-augmented generation. Also reflecting on the week's journal entries.
11 Jun 2023  Ā·  6 min  Ā·  llm engineering š
9 patterns including HITL, hard mining, reframing, cascade, data flywheel, business rules layer, and more.
23 Apr 2023  Ā·  20 min  Ā·  machinelearning engineering production recsys
My three favorite papers, 17 paper summaries, and ML and non-ML lessons.
02 Oct 2022  Ā·  14 min  Ā·  recsys engineering production
Invited keynote at the Workshop for Online Recommender Systems and User Modeling (ORSUM)
23 Sep 2022  Ā·  2 min  Ā·  recsys engineering production
Or why I should write fewer integration tests.
04 Sep 2022  Ā·  19 min  Ā·  engineering machinelearning production š©·
Pushing back on the cult of complexity.
14 Aug 2022  Ā·  10 min  Ā·  machinelearning engineering production š„
Some off-the-beaten uses of Python learned from reading libraries.
31 Jul 2022  Ā·  10 min  Ā·  python engineering š„
Understanding and spotting patterns to use code and components as intended.
12 Jun 2022  Ā·  13 min  Ā·  machinelearning engineering python š„
How they differ and why they work better in different situations.
20 Mar 2022  Ā·  7 min  Ā·  engineering productivity misc
Hard-won lessons on how to start data science projects effectively.
06 Mar 2022  Ā·  7 min  Ā·  datascience engineering productivity
Breaking it into offline vs. online environments, and candidate retrieval vs. ranking steps.
27 Jun 2021  Ā·  13 min  Ā·  teardown production engineering recsys š„
Pointers to think through your methodology and implementation, and the review process.
07 Mar 2021  Ā·  15 min  Ā·  writing machinelearning engineering
Three documents I write (one-pager, design doc, after-action review) and how I structure them.
28 Feb 2021  Ā·  10 min  Ā·  writing engineering productivity š©· š„
Access, serving, integrity, convenience, autopilot; use what you need.
21 Feb 2021  Ā·  19 min  Ā·  teardown machinelearning engineering š„
What the top teams did to win the 36-hour data hackathon. No, not machine learning.
14 Feb 2021  Ā·  6 min  Ā·  datascience engineering misc
A personal take on their deliverables and skills, and what it means for the industry and your team.
08 Nov 2020  Ā·  11 min  Ā·  datascience machinelearning engineering career
What questions do they answer? How do they compare? What open-source solutions are available?
25 Oct 2020  Ā·  16 min  Ā·  teardown datascience engineering š„
Checking for correct implementation, expected learned behaviour, and satisfactory performance.
06 Sep 2020  Ā·  14 min  Ā·  machinelearning engineering python
Updating our FastAPI app to let users select options and download results.
05 Aug 2020  Ā·  3 min  Ā·  engineering python til
I couldn't find any guides on serving HTML with FastAPI, thus I wrote this to plug the hole on the internet.
23 Jul 2020  Ā·  3 min  Ā·  engineering python til š„
I wanted to add my recent writing to my GitHub Profile README but was too lazy to do manual updates.
11 Jul 2020  Ā·  3 min  Ā·  engineering python til
After this article, we'll have a workflow of tests and checks that run automatically with each git push.
21 Jun 2020  Ā·  20 min  Ā·  engineering production python productivity š„
A curious discussion made me realize my expert blind spot. And no, Airflow is not late.
17 Jun 2020  Ā·  3 min  Ā·  engineering production til
Can maintaining machine learning in production be easier? I go through some practical tips.
25 May 2020  Ā·  16 min  Ā·  machinelearning engineering production
I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.
18 May 2020  Ā·  14 min  Ā·  machinelearning engineering production
OMSCS CS6200 (Introduction to OS) - Moving data from one process to another, multi-threaded.
15 Dec 2019  Ā·  7 min  Ā·  omscs learning engineering
Keynote on how Asia's tech giants scale and their SuperApp strategy.
03 Oct 2019  Ā·  3 min  Ā·  machinelearning engineering leadership
OMSCS CS6300 (Software Development Process) - Java and collaboratively developing an Android app.
13 Aug 2017  Ā·  5 min  Ā·  omscs learning engineering
Join 10,300+ readers getting updates on machine learning, RecSys, LLMs, and engineering.