You Are Optimizing for the Wrong Metric

We love to optimize Technical people love to optimize every little thing. For example, I am always thinking about how to use my time, money, and knowledge in the most effective way to the point of getting upset when I do things suboptimally. However, everyone optimizes for different things. When I had my first software engineering job, I was learning to use vim and I downloaded a bunch of different plugins to make the app prettier and have more functionality. Then, when I finally had my setup perfect with 10 different plugins downloaded, I saw that my boss was using stock vim! How can you get anything done without neotree?? He viewed vim as simply a means to an end. How much do plugins really help productivity? Was I really optimizing for the right thing by using my time to learn these plugins? ...

May 22, 2026 · 5 min · Lucas Pauker

Your Prod Code Should Have Bugs

Debugging prod after shipping on Friday Your job as a software engineer Every line of production code is a bet on speed, safety, and correctness. When you ship software, you are balancing: Getting it done fast -> shipping faster means the feature appears earlier for customers and ultimately the business makes more money Not introducing bugs -> bugs can cause outages or other errors that lead to losses The problem is that anyone that has worked with software knows that no matter what, you cannot guarantee that your shipped software will not have bugs. In fact, your software will always have bugs. No matter how many “lgtms” you get on your pull request, bugs always find a way of creeping in. ...

January 20, 2026 · 7 min · Lucas Pauker

Home Run Modeling

Why home runs? Some of the best moments in baseball games are home runs. Something about hitting the ball out of the park is satisfying. Since baseball season just started, I wanted to model a part of the game. I decided to model home runs since they are pretty rare events but should still be able to be accurately predicted. When I say accurately predicted, I mean that we can accurately predict the probability of a player hitting a home run. At the end of the day, we can only compare to the true data…but we have many years of baseball data (baseball is nice since there are very detailed statistics and metrics for each player for 100+ years). ...

February 28, 2024 · 10 min · Lucas Pauker

OpenAI Model Timing

Introduction The goal of this article is to explore the latency of different OpenAI models. When using AI models in production, latency is an important factor to consider. Comparing Model Architectures First, I test the latency for different OpenAI models. I test the following models: gpt-4, gpt-4-0613, gpt-3.5-turbo, gpt-3.5-turbo-0613, gpt-3.5-turbo-16k, gpt-3.5-turbo-16k-0613, text-davinci-003, text-davinci-002, text-davinci-001, text-curie-001, text-babbage-001, text-ada-001, davinci-002, babbage-002, davinci, curie, babbage, and ada. These are all the OpenAI models that are available for inference through the chat and completions endpoints. The models can be divided into chat models, instruct models, and base models. Chat models are gpt-4 and gpt-3.5 and are LLMs that are optimized for chat. Instruct models are models that are trained with reinforcement learning through human feedback to follow instructions [1]. ...

October 15, 2023 · 4 min · Lucas Pauker

LLMs Unleashed: The Power of Fine-Tuning

Disclaimer: This article mentions https://terra-cotta.ai/, an LLM experimentation platform I am building Introduction ChatGPT, Bard, and other large language models (LLMs) are very useful for a wide variety of tasks from writing code to answering complex questions to aiding with education. However, these models are ultimately limited by the data that they are trained on. Also, these models are trained to be able to answer a wide variety of questions which may not be sufficient for domain-specific questions. Fine-tuning is essential in order to make these models accurately answer domain-specific questions and be useful for difficult tasks. Furthermore, fine-tuning may be cheaper for inference. ...

July 23, 2023 · 5 min · Lucas Pauker

50 AI Applications

Advancements in artificial intelligence and language models have made significant impacts in various fields from healthcare to finance to entertainment. Here are 50 practical applications of AI that are currently in use or have the potential to be implemented in various industries. Let me know if any of these ideas inspire you or if you build any of them! Text Analysis Automatically generate outlines or summaries of news articles. Find fake news and provide a citation with the real source. Determine the political affiliation and bias of a news article. Generate better captions/subtitles for shows translated from foreign languages. Generate lyrics for songs based on a topic and song structure. Generate summary of court opinions or court transcripts to understand legal proceedings better. Annotate legal documents for specific clauses (https://github.com/lucaspauker/extract-contract). Find similar academic papers to an input prompt. Generate lyric annotations (like Genius) automatically by using information on the internet. Better Google search suggestions with language models. Generate personalized poems for someone. AI Assistance Generate code from comments or pseudocode. Translate code from one coding language to another. Generate automatic email reply suggestions based on your writing style. Generate a cover letter based on your resume and the company website you are applying for. Suggest edits to a paper like Grammarly. Automatically generate simple math questions (for example arithmetic word problems). Automatically grade essays or automatically grade the ACT/SAT writing section. Suggest next workout based on previous workouts and goals. For doctors, given a patient, find similar patients and summarize their diagnoses in order to help diagnose the patient. Generate slogan based on product description. Generate Latex notes from a transcript of a lecture. Generate presentation slides based on a paper. Write an abstract for an academic paper given the rest of the paper. Do a command on a browser given text input for example: “buy a plane ticket on United from SFO to ORD.” Autocomplete sentences in Google docs using AI. Travel and Leisure Generate travel itinerary based on user preferences. Generate meal plans based on caloric goals and food preferences. Suggest outfits to wear based on a user’s fashion preferences. Suggest meals based on ingredients someone has in their kitchen. Generate sports game commentary based on context of what’s happening in the game. Generate a text-based game with AI that has infinite possibilities. Automatically generate responses for video game characters using AI. Image Grade a student’s work with a picture of the student’s assignment and a picture of the answer key. Improve accessibility on the web by automatically generating captions for images. Convert handwritten text from an image to digital text. Diagnose medical conditions by analyzing medical images. Generate images for advertising campaigns based on existing images. Create design assets in Figma based on text ideas and pictures. Count the number of cars or buildings in satellite images. Video Generate a transcript by lip-reading a video. Create annotations for key events in a video. Detect and label important objects in video footage. Generate a Tinder profile based on someone’s Instagram profile. Increase the frame rate of animation by generating extra frames. Audio Generate lyrics from audio. Given audio and the lyrics, output the timestamp of each lyric. Transcribe speech from audio. Generate sheet music or MIDI from audio. Critique a song by analyzing its musical elements and style given audio.

April 4, 2023 · 3 min · Lucas Pauker

Blackjack Reinforcement Learning

Introduction I recently read Ed Thorpe’s Beat the Dealer, a book about how Thorpe, a mathematician, found a way to gain an edge in blackjack. In the book, Thorpe uses computer simulations to calculate the best blackjack strategy as well as card-counting strategies. Since I took a reinforcement learning class last quarter, I wanted to apply one of the most common algorithms, Q-learning, to to find the best strategy for blackjack. ...

April 5, 2021 · 11 min · Lucas Pauker

Achieving Quantum Supremacy, Qubit by Qubit

Faster than a Supercomputer? In the 1980s, American physicist Richard Feynman proposed the idea of quantum computers to model complex quantum systems. In October 2019, around 40 years later, Google AI and NASA scientists unveiled a quantum computer which ran an experiment in a few minutes that would take the fastest supercomputer 10,000 years. The quantum computer sped up the computation by a factor of 1 billion! This was one of the first major successes in the nascent field of quantum computing. ...

March 11, 2021 · 8 min · Lucas Pauker

Simple Stock Market Models with Python

Introduction In this blog post, I will implement a few simple time series models of a stock price over time. I will also see how they do if we trade using them. We will look at moving averages (MA) and exponential moving averages (EMA). Data First, we need to download the price data. For this article, we will use SPY historical open price data. We can download this from Yahoo Finance. Now, let’s process the data into a dataframe and split the data into test and train datasets. ...

December 20, 2020 · 4 min · Lucas Pauker

Solar Flare Time Series Research

Introduction I spent the summer of 2019 as a physics research intern at the Stanford University Solar Lab. I was very fortunate to have a wonderful advisor and had a great summer overall. I created machine learning models to characterize time series data for solar flare prediction. In this article, I will first provide some physics background about solar flares, then dive into my research. For a more in-depth analysis, check out the source code and my poster. ...

September 15, 2019 · 10 min · Lucas Pauker