machinelearning
(56)ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
03 Nov 2024  ·  10 min  ·  machinelearning engineering production leadership
What to interview for, how to structure the phone screen, interview loop, and debrief, and a few tips.
07 Jul 2024  ·  21 min  ·  machinelearning career leadership 🔥
Evals for classification, summarization, translation, copyright regurgitation, and toxicity.
31 Mar 2024  ·  33 min  ·  llm eval machinelearning
How unit testing machine learning code differs from typical software practices
25 Feb 2024  ·  6 min  ·  machinelearning engineering python
Sending helpful & engaging pushes, filtering annoying pushes, and finding the frequency sweet spot.
24 Dec 2023  ·  18 min  ·  teardown recsys machinelearning production
How to use open-source, permissive-use data and collect less labeled samples for our tasks.
05 Nov 2023  ·  12 min  ·  llm eval machinelearning python
9 patterns including HITL, hard mining, reframing, cascade, data flywheel, business rules layer, and more.
23 Apr 2023  ·  20 min  ·  machinelearning engineering production recsys
Writing good instructions to achieve high precision and throughput.
12 Mar 2023  ·  6 min  ·  machinelearning mechanism
Collecting ground truth, data augmentation, cascading heuristics and models, and more.
26 Feb 2023  ·  16 min  ·  teardown machinelearning production
Pilot & copilot, literature review, methodology review, and timeboxing.
22 Jan 2023  ·  7 min  ·  mechanism machinelearning productivity
Or why I should write fewer integration tests.
04 Sep 2022  ·  19 min  ·  engineering machinelearning production 🩷
Pushing back on the cult of complexity.
14 Aug 2022  ·  10 min  ·  machinelearning engineering production 🔥
Understanding and spotting patterns to use code and components as intended.
12 Jun 2022  ·  13 min  ·  machinelearning engineering python 🔥
Industry examples, exploration strategies, warm-starting, off-policy evaluation, and more.
08 May 2022  ·  14 min  ·  teardown recsys machinelearning
Thinking about recsys as interventional vs. observational, and inverse propensity scoring.
10 Apr 2022  ·  8 min  ·  recsys eval machinelearning
What to consider for in terms of data, roadmap, role, manager, tooling, etc.
13 Feb 2022  ·  8 min  ·  datascience machinelearning career 🔥
Beyond getting that starting role, how does one continue growing in the field?
19 Jan 2022  ·  6 min  ·  learning career machinelearning
Daliana and I had a 2hr chat on all things data science and machine learning.
02 Dec 2021  ·  1 min  ·  datascience machinelearning career
More than two dozen interviews with ML Practitioners sharing their stories and advice
25 Nov 2021  ·  1 min  ·  machinelearning career 🛠
Why this is the first rule, some baseline heuristics, and when to move on to machine learning.
19 Sep 2021  ·  8 min  ·  machinelearning career 🔥
An overview of system design, candidate retrieval, and ranking, with industry examples.
15 Sep 2021  ·  1 min  ·  recsys machinelearning production
How to generate labels from scratch with semi, active, and weakly supervised learning.
01 Aug 2021  ·  12 min  ·  teardown machinelearning
Building semantic search; how to calculate recall when relevant documents are unknown.
20 Jul 2021  ·  1 min  ·  machinelearning 📬
Why real-time RecSys? What does the system design look like in industry? How to build an MVP?
13 Jul 2021  ·  1 min  ·  recsys machinelearning production
A whirlwind tour of bandits, embedding+MLP, sequences, graph, and user embeddings.
13 Jun 2021  ·  25 min  ·  teardown recsys machinelearning deeplearning
How to go from knowing machine learning to applying it at work to drive impact.
02 May 2021  ·  12 min  ·  machinelearning career 🩷
An overview and comparison of the various approaches, with examples from industry search systems.
25 Apr 2021  ·  21 min  ·  teardown machinelearning production 🔥
Mike and I take a philosophical detour on Talk Python and discuss life lessons from machine learning.
26 Mar 2021  ·  1 min  ·  machinelearning life
Short vs. long-term gain, incremental vs. disruptive innovation, and resume-driven development.
21 Mar 2021  ·  12 min  ·  datascience machinelearning leadership
Pointers to think through your methodology and implementation, and the review process.
07 Mar 2021  ·  15 min  ·  writing machinelearning engineering
Access, serving, integrity, convenience, autopilot; use what you need.
21 Feb 2021  ·  19 min  ·  teardown machinelearning engineering 🔥
Design and architecture, tech stack, methodology, results, and lessons learned.
07 Feb 2021  ·  5 min  ·  machinelearning production
Why real-time? How have China & US companies built them? How to design & build an MVP?
10 Jan 2021  ·  21 min  ·  teardown machinelearning recsys production 🔥
Data cleaning, transfer learning, overfitting, ensembling, and more.
22 Nov 2020  ·  11 min  ·  machinelearning career life
A personal take on their deliverables and skills, and what it means for the industry and your team.
08 Nov 2020  ·  11 min  ·  datascience machinelearning engineering career
Setbacks she faced, overcoming them, and how writing changed her life.
01 Nov 2020  ·  11 min  ·  career machinelearning writing
Step-by-step walkthrough on the environment, compilers, and installation for ScaNN.
14 Oct 2020  ·  3 min  ·  python machinelearning til
Checking for correct implementation, expected learned behaviour, and satisfactory performance.
06 Sep 2020  ·  14 min  ·  machinelearning engineering python
Should I switch from a regex-based to ML-based solution on my application?
04 Sep 2020  ·  4 min  ·  machinelearning 📬
Why (and why not) be more end-to-end, how to, and Stitch Fix and Netflix's experience
09 Aug 2020  ·  17 min  ·  datascience machinelearning leadership 🔥
Part II of the previous write-up, this time on applications and frameworks of Spark in production
05 Jul 2020  ·  15 min  ·  machinelearning deeplearning spark production survey
Sharing my notes & practical knowledge from the conference for people who don't have the time.
28 Jun 2020  ·  11 min  ·  machinelearning deeplearning spark survey
Can maintaining machine learning in production be easier? I go through some practical tips.
25 May 2020  ·  16 min  ·  machinelearning engineering production
I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.
18 May 2020  ·  14 min  ·  machinelearning engineering production
Comparing baselines (matrix factorization) against novel approaches using graphs & NLP.
14 Jan 2020  ·  2 min  ·  recsys machinelearning
In-depth sharing on how to put machine learning systems into production.
09 Oct 2019  ·  4 min  ·  machinelearning production
Keynote on how Asia's tech giants scale and their SuperApp strategy.
03 Oct 2019  ·  3 min  ·  machinelearning engineering leadership
OMSCS CS7646 (Machine Learning for Trading) - Don't sell your house to trade algorithmically.
11 May 2019  ·  9 min  ·  omscs learning machinelearning python
How we built an ML system to predict hospitalization costs at admission; sharing at DATAx Conference.
06 Mar 2019  ·  4 min  ·  production machinelearning
OMSCS CS6601 (Artificial Intelligence) - First, start with the simplest solution, and then add intelligence.
20 Dec 2018  ·  8 min  ·  omscs learning machinelearning python
OMSCS CS7641 (Machine Learning) - Revisiting the fundamentals and learning new techniques.
27 Dec 2017  ·  4 min  ·  omscs learning machinelearning python
Or how to put machine learning models into production.
13 Feb 2017  ·  8 min  ·  machinelearning production python 🛠
Cleaning up text and messing with ascii (urgh!)
11 Dec 2016  ·  8 min  ·  machinelearning python 🛠
How Lazada ranks products to improve customer experience and conversion at Strata 2016.
09 Dec 2016  ·  1 min  ·  machinelearning lazada production
Parsing json and formatting product titles and categories.
11 Oct 2016  ·  9 min  ·  machinelearning python 🛠
Sharing about my first data science competition at DataScience SG.
20 Jun 2015  ·  1 min  ·  machinelearning
Join 9,300+ readers getting updates on machine learning, RecSys, LLMs, and engineering.