98 posts, 174,385 words, innumerable hours. Opinions & bad jokes my own. What should I write?
Why real-time? How have China and US companies built them? How to design and build an MVP?
10 Jan 2021  ·  20 min  ·  teardown machinelearning recsys production
A public roadmap to track and share my progress; nothing mission or work-related.
03 Jan 2021  ·  4 min  ·  productivity life
Wrapping up 2020 with writing and site statistics, graphs, and a word cloud.
20 Dec 2020  ·  8 min  ·  productivity life
"When can I get my own daggers?", he asked. "Catch the daggers I throw and they are yours", she replied.
Time to clear the cache, evaluate existing processes, and start new threads.
06 Dec 2020  ·  4 min  ·  productivity life
Learn how he switched from engineering to data science, what "senior" means, and how writing helps him.
29 Nov 2020  ·  22 min  ·  informalmentors career datascience writing
Data cleaning, transfer learning, overfitting, ensembling, and more.
22 Nov 2020  ·  11 min  ·  machinelearning career life
Interview questions you should ask and how to evolve your job scope.
15 Nov 2020  ·  8 min  ·  datascience career
A personal take on their deliverables and skills, and what it means for the industry and your team.
08 Nov 2020  ·  10 min  ·  datascience machinelearning career
Chip shares openly about the setbacks she faced, overcoming them, and how writing changed her life.
01 Nov 2020  ·  11 min  ·  informalmentors career machinelearning writing
What questions do they answer? How do they compare? What open-source solutions are available?
25 Oct 2020  ·  16 min  ·  teardown datascience engineering
DNS server snafus leading to missing email and security issues. Also, limited free build minutes monthly.
21 Oct 2020  ·  3 min  ·  misc
Instead of "How to build a data science portfolio", we'll discuss the "Whys" and "Whats" around a portfolio.
18 Oct 2020  ·  15 min  ·  datascience learning career
Step-by-step walkthrough on the environment, compilers, and installation for ScaNN.
14 Oct 2020  ·  3 min  ·  python machinelearning til
Building prototypes helped get buy-in on data science efforts when roadmaps & design docs failed.
11 Oct 2020  ·  6 min  ·  datascience productivity
As our careers grow, how does the balance between writing & coding change? Hear from 4 tech leaders.
04 Oct 2020  ·  12 min  ·  writing career leadership
Emphasis on bias, more sequential models & bandits, robust offline evaluation, and recsys in the wild.
27 Sep 2020  ·  15 min  ·  machinelearning deeplearning recsys production survey
What if the alternative was nothingness?
26 Sep 2020  ·  1 min  ·  life
For years I've refined my routines and found tools to manage my time. Here I share it with readers.
20 Sep 2020  ·  16 min  ·  productivity learning career
My tools for organization and creation, autopilot routines, and Maker's schedule
13 Sep 2020  ·  11 min  ·  productivity learning career
A step-by-step of how to migrate from json comments to Utterances.
Checking for correct implementation, expected learned behaviour, and satisfactory performance.
06 Sep 2020  ·  13 min  ·  machinelearning engineering python
Why read papers, what papers to read, and how to read them.
30 Aug 2020  ·  6 min  ·  learning career productivity
How not to become an expert beginner and to progress through beginner, intermediate, and so on.
Examining the broad strokes of NLP progress and comparing between models
16 Aug 2020  ·  22 min  ·  machinelearning deeplearning nlp survey
Why (and why not) be more end-to-end, how to, and Stitch Fix and Netflix's experience
09 Aug 2020  ·  15 min  ·  datascience machinelearning productivity career
Updating our FastAPI app to let users select options and download results.
05 Aug 2020  ·  3 min  ·  engineering python til
Surprising lessons I picked up from the best books, essays, and videos on writing non-fiction.
Why OMSCS? How can I get accepted? How much time needed? Did it help your career? And more...
I couldn't find any guides on serving HTML with FastAPI, thus I wrote this to plug the hole on the internet.
23 Jul 2020  ·  3 min  ·  engineering python til
Ever revisit a project & replicate the results the first time round? Me neither. Thus I adopted these habits.
19 Jul 2020  ·  12 min  ·  datascience productivity
It's not enough to have a good strategy and plan. Execution is just as important.
12 Jul 2020  ·  7 min  ·  datascience productivity
I wanted to add my recent writing to my GitHub Profile README but was too lazy to do manual updates.
11 Jul 2020  ·  3 min  ·  engineering python til
I thought giving it my all led to maximum outcomes; then I learnt about the 85% rule.
09 Jul 2020  ·  3 min  ·  productivity life
Part II of the previous write-up, this time on applications and frameworks of Spark in production
05 Jul 2020  ·  14 min  ·  machinelearning deeplearning spark production survey
Sharing my notes & practical knowledge from the conference for people who don't have the time.
28 Jun 2020  ·  11 min  ·  machinelearning deeplearning spark survey
After this article, we'll have a workflow of tests and checks that run automatically with each git push.
21 Jun 2020  ·  18 min  ·  engineering productivity production python
A curious discussion made me realize my expert blind spot. And no, Airflow is not late.
17 Jun 2020  ·  3 min  ·  datascience engineering production til
Haste makes waste. Diving into a data science problem may not be the fastest route to getting it done.
15 Jun 2020  ·  11 min  ·  datascience productivity
Initially, I didn't like it. But over time, it grew on me. Here's why.
07 Jun 2020  ·  10 min  ·  agile leadership datascience productivity
Crocker's Law, cognitive dissonance, and how to receive (uncomfortable) feedback better.
Can maintaining machine learning in production be easier? I go through some practical tips.
25 May 2020  ·  15 min  ·  machinelearning engineering production
I thought deploying machine learning was hard. Then I had to maintain multiple systems in prod.
18 May 2020  ·  13 min  ·  machinelearning engineering production
An expansion of my Twitter thread that went viral.
09 May 2020  ·  4 min  ·  writing
What I Learnt about evaluating ideas from first-hand participation in a hackathon.
03 May 2020  ·  7 min  ·  datascience lazada
What I learned about measuring diversity, novelty, surprise, and serendipity from 10+ papers.
26 Apr 2020  ·  11 min  ·  recsys machinelearning survey
Why you should give a talk and some tips from five years of speaking and hosting meet-ups.
18 Apr 2020  ·  7 min  ·  datascience career writing
Should I join a start-up? Which offer should I accept? A simple metaphor to guide your decisions.
12 Apr 2020  ·  6 min  ·  career
Using a Zettelkasten helps you make connections between notes, improving learning and memory.
05 Apr 2020  ·  6 min  ·  learning writing productivity
Writing begins before actually writing; it's a cycle of reading -> note-taking -> writing.
28 Mar 2020  ·  7 min  ·  learning writing productivity
Automate your experimentation workflow to minimize effort and iterate faster.
15 Mar 2020  ·  6 min  ·  datascience agile python productivity
How hard work, many failures, and a bit of luck got me into the field and up the ladder.
27 Feb 2020  ·  12 min  ·  career datascience lazada
Beating the baseline using Graph & NLP techniques on PyTorch, AUC improvement of ~21% (Part 2 of 2).
13 Jan 2020  ·  16 min  ·  recsys machinelearning python learning nlp
Building a baseline recsys based on data scraped off Amazon. Warning - Lots of charts! (Part 1 of 2).
06 Jan 2020  ·  12 min  ·  recsys machinelearning python learning
OMSCS CS6200 (Introduction to OS) - Moving data from one process to another, multi-threaded.
15 Dec 2019  ·  7 min  ·  omscs learning engineering
OMSCS CS6750 (Human Computer Interaction) - You are not your user! Or how to build great products.
Moving off wordpress and hosting for free on GitHub. And gaining full customization!
25 Aug 2019  ·  1 min  ·  misc
OMSCS CS6440 (Intro to Health Informatics) - A primer on key tech and standards in healthtech.
OMSCS CS7646 (Machine Learning for Trading) - Don't sell your house to trade algorithmically.
11 May 2019  ·  9 min  ·  omscs learning machinelearning
No, you don't need a PhD or 10+ years of experience.
30 Apr 2019  ·  8 min  ·  career datascience
Taking the best from agile and modifying it to fit the data science process (Part 2 of 2).
02 Feb 2019  ·  13 min  ·  agile datascience productivity
A deeper look into the strengths and weaknesses of Agile in Data Science projects (Part 1 of 2).
26 Jan 2019  ·  13 min  ·  agile datascience productivity
OMSCS CS6601 (Artificial Intelligence) - First, start with the simplest solution, and then add intelligence.
20 Dec 2018  ·  8 min  ·  omscs learning machinelearning python
OMSCS CS6460 (Education Technology) - How to scale education widely through technology.
OMSCS CS7642 (Reinforcement Learning) - Landing rockets (fun!) via deep Q-Learning (and its variants).
30 Jul 2018  ·  6 min  ·  omscs learning machinelearning python
Culture >> Hierarchy, Process, Bureaucracy.
12 May 2018  ·  5 min  ·  leadership datascience lazada
OMSCS CS7641 (Machine Learning) - Revisiting the fundamentals and learning new techniques.
27 Dec 2017  ·  4 min  ·  omscs learning machinelearning python
How being a Lead / Manager is different from being an individual contributor.
25 Sep 2017  ·  5 min  ·  leadership datascience lazada
OMSCS CS6300 (Software Development Process) - Java and collaboratively developing an Android app.
13 Aug 2017  ·  5 min  ·  omscs learning engineering
Tools and skills to pick up, and how to practice them.
25 Jun 2017  ·  8 min  ·  datascience learning career
OMSCS CS6476 Computer Vision - Performing computer vision tasks with ONLY numpy.
15 May 2017  ·  5 min  ·  omscs learning machinelearning python
If things are not failing, you're not innovating enough. - Elon Musk
19 Feb 2017  ·  3 min  ·  leadership datascience lazada
Or how to put machine learning models into production.
13 Feb 2017  ·  8 min  ·  machinelearning production python
A web app to find similar products based on image.
14 Jan 2017  ·  4 min  ·  machinelearning deeplearning python production
Cleaning up text and messing with ascii (urgh!)
11 Dec 2016  ·  8 min  ·  datascience machinelearning python
A simple web app to classify fashion images into Amazon categories.
27 Nov 2016  ·  2 min  ·  machinelearning deeplearning python production
Got accepted into Georgia Tech's Computer Science Masters!
A card sorting game to discover youl passion by identifying skills you like and dislike.
23 Oct 2016  ·  4 min  ·  misc
Parsing json and formatting product titles and categories.
11 Oct 2016  ·  8 min  ·  datascience python
Learning Scala from Martin Odersky, father of Scala.
31 Jul 2016  ·  4 min  ·  learning
Guest post of how DataKind SG worked with NGOs to frame their problems and suggests solutions
17 Sep 2015  ·  8 min  ·  datascience
Hey there. Didn’t expect anyone back here; this is where I started writing. What do I write about? Why do I write? Answers to these and more in the FAQ.
I write about data science, machine learning, and career. Join 700+ readers. Weekly updates.
Welcome gift: 5-day email course on How to be an Effective Data Scientist 🚀