3-Day Live Series

AI Builder Series for Product Managers

Three live, hands-on sessions on becoming an AI builder. Stop being just a document writer – use AI tools to research, validate, prototype, and ship without waiting on engineering cycles.

📅 When

June 2, 3, 4

Tuesday · Wednesday · Thursday | 8:30 PM – 10:30 PM IST

🎥 How

Live, hands-on, recorded

Recordings available – live participation strongly recommended

🎯 What you’ll learn to do

Research faster
Validate hypotheses
Build prototypes
Ship quickly
Measure product metrics
Get user feedback faster

🛠️ Tools we’ll explore

NotebookLM Cursor Claude Code WhisperFlow Vercel Supabase Lovable Google AI Studio MCPs

📺 Session Recordings & Notes

Each day’s video, slide deck, and detailed session notes. Switch tabs to navigate between days.

Day 1 · Tue, Jun 2

Become an AI Builder – Session 1

Why the PM workflow is inverted, the 5-level leverage ladder, and a live end-to-end build: Google AI Studio → GitHub → local with Anti-Gravity / Cursor.

📚 Resources

Slide Deck

🖼️ Slide Deck – Inline Preview

📝 Session Notes

🎯 What this session covers

Day 1 of a three-day AI builder series. The agenda:

Why the PM workflow is being inverted, and what AI unlocks (faster discovery, higher-impact ideas, smaller teams, cost-light experiments).
The AI leverage ladder – Level 0 to Level 5.
The full discovery → delivery → distribution flow viewed through an AI lens.
A live, end-to-end walkthrough of building a real app on Google AI Studio, exporting it to GitHub, cloning it locally, and editing it with Anti-Gravity or Cursor.

💡 Why it matters

AI is no longer optional. CEO memos from OpenAI, Shopify, and Perplexity have made AI usage a baseline expectation – top-down, not bottom-up like prior tech waves. The cost-cutting angle gets the headlines, but the bigger unlock is outcomes per person: a team that used to ship 4 experiments per quarter can now ship 40 or 100.

The competitive read: of every 10 employees, 8 may no longer be needed once AI is built into workflows. The remaining 2 will be the ones who captured it. AI is also the easiest opportunity to capture – roughly 3–4 months of focused practice covers most of it. The trap is that those 3–4 months take most people years to actually start.

🔑 Key takeaways

PMs were meant to spend most of their time on discovery but spend it on delivery – AI lets you reclaim that.
Discovery and delivery are not silos: experiments are discovery, and AI makes experiments cheap.
Motivation is perishable – AI collapses the gap between getting an idea and starting on it.
The cost worry around AI tools usually signals discovery hasn’t been done; compare AI tokens to a developer’s salary and the ROI is obvious.
The 5-level leverage ladder: hype → chat → Gems/Custom GPTs → automation platforms → vibe coding → coding agents.
The universal build flow: start in a vibe-coding tool → export to GitHub → clone locally → customise in Cursor or Anti-Gravity → deploy.

⚡ Why the PM workflow is broken

The PM role splits into three modes: discovery, delivery, distribution. Most PMs want to live in discovery – but actually live in delivery, coordinating engineers, designers, and stakeholders.

The reason is structural. Developer time is precious, so PMs are pushed into mediocre, conformist decisions: medium-impact ideas with safe effort, not high-impact bets. The cost of being wrong is wasted engineering hours. Founders gamble on big bets; PMs were structurally prevented from gambling.

Sam Altman’s day-one tweet about ChatGPT – “one of your worst launches so far” – captures it. Every great idea looks stupid initially.

🔓 What AI changes

Two specific unlocks:

Experiments stop being expensive. Real experiments with real users replace confirmation theatre.
Discovery, delivery, and distribution stop being silos. Experimentation is discovery, because real user insight only comes when users touch something.

The mantra: learning is course correction. Start with conviction. Build a small prototype with AI. Test. Commit engineering only after the bet is proven. On the distribution side: nobody saw an ad for Cursor, Whisper, or ChatGPT before using them – a compelling product distributes itself.

💸 Reframing the cost worry

A common pushback: “AI tools cost too much.”

The worry usually signals the discovery hasn’t been done. Compare AI tokens to a developer’s salary, headache, and timeline – not to a Netflix subscription. Companies invest heavily in AI precisely because they’ve already done that math.

The deeper ROI isn’t just money saved. It’s increased chances of success through faster iteration – and the preservation of motivation. Motivation is perishable. The time between idea and action determines whether the project happens at all, and AI compresses that gap dramatically. Treat the spend as leverage that compounds with practice, not as a learning tax.

🧱 The five building blocks of any digital product

Every digital product breaks into five parts:

Front-end – the interface
Back-end – the logic
Database – the storage
Hosting – the servers
Integrations – e.g., Google Maps inside Uber, Razorpay for payments

The architecture is simple. The historical bottleneck was language proficiency: HTML/CSS/JavaScript/React on the front-end, Python or PHP for back-end, SQL for databases. Most people with ideas couldn’t execute them.

AI dissolves that wall. A non-technical entrepreneur historically had to choose between hiring an intern (cheap, no taste) or a senior Apple designer (expensive, hard to attract). Now: point Claude or AI Studio at a benchmark output – screenshot Apple.com or Spotify and prompt “build this kind of UI for my product.” To avoid mirroring obvious brands, use ThemeForest – pick a paid theme, screenshot it, hand it to the AI.

The logic underneath: developers and designers built skill from years of training data. LLMs ingested far more of the same data, plus reinforcement-learned exemplars from the best practitioners. Treat them as employees, give them context, and they produce.

🪜 The 5-level AI leverage ladder

The rest of the program anchors on a ladder. Each level is a different way to use AI; mastery moves you up.

Level 0 – Hype / paralysis

AI is dismissed as hype, or analysis paralysis kicks in. The screen-time test: anyone has 1–2 hours a day for a month.

Level 1 – Web chat

ChatGPT, Claude, Gemini in the browser. Fine for personal productivity. No big use case cracked yet.

Level 2 – Reusable Gems, Custom GPTs, Skills

Repeatable instructions. Example: a PRD generator pre-loaded with five sample PRDs and brand voice. These share an instruction set, not a context window.

Level 3 – Automation platforms

n8n, Lindy, Relay, Zapier. Pull product feedback from App Store, Play Store, Reddit, Twitter into one dashboard. Auto-generate sales proposals when a lead fills a form.

Level 4 – Vibe coding

Actual products built in Lovable, AI Studio, Replit, or Bold. Example: a fintech PM building a PDF-to-structured-data tool so the loan team stops doing data entry.

Level 5 – Coding agents

Full control via Cursor, GitHub, Vercel – hosting and deeper customisation.

Two nuances

Levels 3 and 4 can swap order depending on the person.
Each level has its own sub-levels. A simple PRD prompt and a knowledge-base-backed agent that ingests PostHog data both live at Level 2 – but represent very different mastery.

🤔 “Won’t everyone be able to build now?”

Fair question. The answer: agency and experiences.

Photoshop, GitHub, and Visual Studio have existed for years – yet a top 1% of users exists in each. With the same content delivered to 200+ people, outcomes diverge based on what each person actually does with it.

The bigger picture: AI is at the 1995–96 internet stage. Google came in 1997. Amazon in 2001. Facebook in 2004. 99% of AI use cases aren’t discovered yet – the alphabets exist; the stories haven’t been written.

🧪 Live build – Google AI Studio → GitHub → local

The walkthrough covers a complete end-to-end build.

Step 1 – Build in a vibe-coding tool

Open Google AI Studio. Dictate (faster than typing) a prompt for the product. Example: “a modern-looking time tracker app – list of tasks, one-tap session logging, dashboard, database back-end, 4-digit PIN, light on code, minimal footprint.”

AI Studio builds it. The first run might error – click the auto-fix button. Iterate. When a bug surfaces (analytics dashboard empty after logging), click Fix again. Add enhancements as ideas come up (dark/light mode toggle).

Note: AI tools are probabilistic. The same prompt three hours apart can behave differently. Don’t get trapped in “one-shot the PRD or iterate?” debates. Experiment.

Step 2 – Export to GitHub

When the vibe-coding tool plateaus (model limits, hosting constraints), click Share → GitHub. Push the codebase into a private repo. Every platform – Lovable, Bold, Base44 – offers this.

GitHub is just cloud storage for code. Each product is one repository.

Step 3 – Clone locally

Download the GitHub Desktop app (no terminal required – the black-screen fear is overstated). Clone the repo locally. Open the folder in any IDE.

Step 4 – Customise with a coding agent

Open the folder in Anti-Gravity (Google’s coding agent, generous free tier) or Cursor (paid, excellent). Anti-Gravity, Cursor, Claude Code, Windsurf, and Codex are all in the same category: AI harness / agent tools that need a folder.

For any inherited codebase – including a company’s 20-year-old monorepo – the first prompt is always the same:

“Help me understand this codebase.”

The agent reads it and explains it. From there: ask the agent to do anything. Rename the product title across every page. Run the app locally with npm install and npm run dev. If the terminal commands look unfamiliar, screenshot and ask Claude.

Figuring out is the meta-skill – the AI itself is the tutor.

Step 5 – Deploy

Commit and push back to GitHub. Deploy via Vercel, Netlify, AWS, or GCP. Each platform’s “Publish” button is screenshot-and-ask territory. Learn to fish.

🌉 The bridge of discomfort

Between non-skilled and skilled lies a bridge of discomfort. Moving between five unfamiliar tools in one session is the bridge – not the destination. The discomfort fades with reps.

📋 Homework before Day 2

Build something on Google AI Studio.
Create a GitHub account.
Bring nuanced questions to Day 2.

Idea sources: your own life. Book summary tool. Productivity app. Anything.

Closing point: AI can code anything – calculators, mobile apps, PS5 games, full operating systems, robot control software – because everything is ultimately code. Claude Code plus Cursor can take you there.

Day 2 · Wed, Jun 3

Become an AI Builder – Session 2

The three-layer architecture in plain English. The recommended full-stack: Next.js + Supabase + Vercel. The .env discipline. A live build of “IdeaWorks” deployed end-to-end – with the first real production error along the way.

📚 Resources

Slide Deck

Feedback Form – fill out to get the FigJam Board, Ideas sheet & top 5 ideas

🖼️ Slide Deck – Inline Preview

📝 Session Notes

🎯 What this session covers

Day 1’s leverage ladder turned into a working production stack. The agenda:

The three-layer web architecture every product runs on – front-end, back-end, database.
Why Next.js + Supabase is the recommended starting stack for vibe-coded full-stack apps.
The full universal flow: GitHub → Cursor → Supabase → GitHub → Vercel.
How .env files keep keys safe and out of code.
A live, end-to-end build of “IdeaWorks” – an upvotable ideas board – deployed live to the world.

💡 Why it matters

Once you control this end-to-end flow, you stop renting from Lovable or Google AI Studio. You can swap models freely, host on your own infrastructure, integrate any service, and ship enterprise-grade work – though that part still depends on your architectural judgment.

AI is at 1995 internet stage. The people who learn to deploy – not just prototype – are the ones who will capture the unbuilt 99% of use cases. This session is where the program crosses from theory into Level 5 of the leverage ladder from Day 1.

🔑 Key takeaways

Every product runs on three layers: front-end, back-end, database – bound together by code, all of which AI can now generate.
For vibe-coded full-stack apps, default to Next.js + Supabase. Next.js runs on both server and browser; Supabase handles DB + auth as a service.
Never put keys in your code. Use .env files, ignored by Git, copied separately to Vercel.
The universal flow is GitHub → Cursor → Supabase → GitHub → Vercel – start anywhere on the chain and finish it.
Building is now easy; what and why you build is the hard part. Discovery still matters.
Errors are part of the loop: copy the Vercel log, paste it into Cursor, fix, redeploy.

🏗️ The three-layer architecture in plain English

Anchor everything on how a single page is generated. Type linkedin.com/network/connections into the browser. That request travels over the internet to a server – a computer LinkedIn rents somewhere in the world. The server reads the URL, understands “the user wants their connections,” runs a query against the database, fetches matching rows from two tables (connections + user profiles), packages the result into HTML / CSS / JavaScript, and ships the response back. The browser then paints the response.

The three roles map cleanly:

Browser / client = front-end
Server = back-end
Database = persistent storage

Inside the front-end: HTML provides structure (names, text). CSS provides style (bold weight, blue colour, spacing). JavaScript provides interactivity (click “Message” → side popover opens).

Mobile apps follow the same three-layer architecture; only the rendering language changes – XML for Android native, Dart for Flutter, Swift / Objective-C for iOS native, JavaScript again for React Native.

The point of restating this: every layer is just code, and code is exactly what LLMs are best at producing.

⚙️ The recommended stack – Next.js + Supabase

Why Next.js

When vibe-coding from scratch, the strong default is JavaScript via Next.js. JavaScript is the only language that runs on both the server and the browser. Next.js is currently the most performant framework for full-stack apps, and frontier models are extensively trained on it – so they produce the best output. Anything else works, but Next.js gives the cleanest path right now.

Why Supabase

For back-end + database + auth, the pick is Supabase (competitors: Firebase, InstantDB). The pitch: no need to design schemas, write SQL, or stand up servers. Create a Supabase project – Cursor generates the schema and connection code – and Supabase handles user signup, login, password rules, email verification, OAuth providers, and 50,000 monthly active users on the free tier. For toy projects and most early-stage products, you’ll almost never hit those limits.

Enterprise-grade – can this stack scale?

Yes. Fortune 500 companies and their PMs already use Cursor heavily. The honest caveat: producing enterprise-grade output requires architectural judgment, the same way Photoshop can make a Hollywood-grade poster but only in the hands of someone who knows composition. Start with prototypes. Learn the architecture. Then scale up.

🔐 The `.env` discipline

A non-negotiable rule, taught early: code never contains keys, passwords, or secrets.

Even in a private GitHub repo, sensitive values go into .env (or .env.local) files. A .gitignore file tells Git to skip these on every push. When you deploy on Vercel, it asks you to paste the environment variables separately, which it stores securely outside the codebase.

If you ever push a key accidentally, the only safe recovery is to revoke and regenerate the key. No exceptions.

🛠️ The build flow, step by step

Step 1 – Create an empty repo on GitHub Desktop

Create a new repository in GitHub Desktop. Note where the local folder lives – you’ll need that path to open it in Cursor.

Step 2 – Open the folder in Cursor

Open the folder in Cursor. Start a new agent window.

Step 3 – Prompt the build

Prompt with what to build, explicitly mentioning “use Next.js, use Supabase for backend, database, and authentication.”

Step 4 – Cursor scaffolds the project

The agent creates the Next.js project structure: src/app for pages, src/components for reusable components, plus dependencies.

Step 5 – Wire up Supabase

The agent typically generates an instruction file telling you to:

Create a Supabase project.
Paste a SQL schema into the Supabase SQL editor (this creates your tables).
Enable the email auth provider.
Add your redirect URL.

Step 6 – Copy keys into `.env.local`

Copy your Supabase project URL and publishable key into .env.local.

Step 7 – Run locally

In the terminal:

npm install – installs dependencies.
npm run dev – launches the app on a localhost port.

Step 8 – Test in the browser

Step 9 – Push to GitHub

Commit and push via the GitHub Desktop client.

Step 10 – Deploy on Vercel

On Vercel, click Import Project. Your repos appear automatically because Vercel is already connected to your GitHub. Paste the same environment variables when prompted. Click Deploy. Receive a live URL.

💡 Live build – “IdeaWorks”

The prompt

A minimal web app where users sign in via email, create discussion threads, post ideas as comments, upvote each other’s ideas, and see a leaderboard sorted by votes. Style instructions: “modern UI, subtle animations on upvote and post.” Explicit deployment hint: “keep code minimal – I’m going to deploy via Vercel.”

What happens

Cursor scaffolds the project. Paste the Supabase SQL. Enable email auth. Add the redirect URL. Copy the keys into .env.local. Run npm run dev. Sign-up works on the first try. The magic-link email arrives. The app shows a thread titled “True.”

A small surprise: the prompt was vague on the leaderboard timer (“5 or 10 minutes”), and the agent chose its own intelligent default. The takeaway:

Your app is limited by your imagination and your instructions. Vague prompts produce reasonable guesses; precise prompts produce intent.

Upgrading the UI mid-build

To upgrade the look, search Dribbble for the “bento” pattern (the card grid Apple uses). Copy an image. Paste it into Cursor with “make the UI like the one shown in the image.” The agent restyles the layout.

Two related tools worth knowing:

Google Stitch and Claude Design generate UI mockups you can hand off to Cursor for code.
For images inside the app, hand Cursor an OpenAI or Gemini API key in .env and ask it to generate images directly. Rotate the key after.

🌐 Deploying to Vercel – and the first real error

Push to GitHub. On Vercel, click Add New Project. The IdeaWorks repo appears (private repos work because Vercel has GitHub permission). Vercel auto-detects Next.js.

Paste the two environment variables (Supabase URL and key) into Vercel’s env panel – separately, one for each variable.

Click Deploy.

Build succeeds – page errors out

This is the real PM moment of vibe coding.

Go to Vercel → Runtime Logs.
Find the database configuration error message.
Copy the error back into Cursor.
Ask the agent to fix it.
Cursor patches the code. Commit. Push.
Vercel auto-redeploys.

One more wrinkle: re-run the Supabase schema (because the corrected code adjusted some tables) and re-add the redirect URL. Manual tweaks don’t disappear with AI – you just need to know the loop.

Between non-skilled and skilled lies a bridge of discomfort. The only way across is doing it.

🧠 Mindset notes around the build

Plan mode vs. agent mode in Cursor

Run plan mode first so the agent describes its plan before executing. This protects you from runaway changes. Useful default.

The auto model selector

Cursor’s auto selector picks the cheapest sufficient model – Composer 1/2 for trivial work, Claude Opus-class for important work. Override when you need to.

No alternative to trying things yourself

No tutorial covers every error you’ll hit. A screenshot pasted into Cursor or ChatGPT will resolve almost any block.

On cost

$100–$500 in tokens vs. a developer’s salary, plus delay, plus headache, produces an obvious ROI – if you’ve done proper discovery. People who can’t justify token spend usually haven’t validated the underlying opportunity.

Product discovery still matters more than ever

When anyone can build anything, what you build and why becomes the only real moat. Use AI for discovery, but combine it with talking to real users and your own judgment.

✅ Wrap-up – the chain to repeat

By session end, you should be able to repeat this chain end-to-end:

Empty GitHub repo.
Open in Cursor.
Prompt to build with Next.js + Supabase.
Run Supabase SQL.
Paste keys into .env.local.
npm run dev.
Test in browser.
Commit and push.
Vercel import.
Paste env variables.
Deploy.
Copy error logs back to Cursor on failure.
Redeploy.

Doing this puts you at Level 5 of the leverage ladder. The stack no longer constrains you – sky’s the limit.

📋 Homework before Day 3

Create accounts on GitHub, Supabase, Vercel.
Add up to 10 one-line app ideas to the shared Google Sheet (link in the Feedback Form in Resources above).

The five most compelling ideas will be picked, and Day 3 will build one of them together. Day 3 also goes deeper into Claude Code.

Day 3 · Thu, Jun 4

Become an AI Builder – Session 3

The final session pulls the series together. What an agentic platform is (plan → act → reflect), how Claude Code works, the difference between parameter and context memory, why Skills beat prompts and Gems, what MCP really is, and the 7-task public challenge with certification and reimbursements.

📚 Resources

Slide Deck Jamboard Skills.sh PM OS Architecture OpenClaw

🖼️ Slide Deck — Inline Preview

📝 Session Notes

🎯 What this session covers

The final session pulls the AI builder series together. The agenda:

What an agentic AI platform actually is — the plan → act → reflect loop.
The three ways to use Claude Code, and why the desktop app is the strong default.
The difference between parameter memory and context memory.
How Skills beat both prompts and Gems via progressive disclosure.
What MCP really is and why it matters.
How environments, subagents, and HITL permissions shape what an agent can safely do.
A 7-task public challenge with reimbursements and certification for the people who commit.

💡 Why it matters

Anything covered in a hundred more sessions is meaningless without deliberate practice. Companies are now using token consumption as a performance metric — not because executives are crazy, but because they know employees need external pressure to actually try AI. Until you build something end-to-end yourself, fear doesn’t lift.

The 7-task challenge is the cure for that fear. The deeper concepts in this session — Skills, MCP, subagents — are the building blocks for Level 5 work, where you stop renting from Lovable or AI Studio and start owning your stack.

🔑 Key takeaways

The agentic loop is universal: plan → act → reflect. When reflection passes, output; when it fails, replan.
Every model has two memories: parameter (training, long-term) and context (current chat). Context bloat means slower, costlier, less accurate output.
Skills beat both prompts and Gems via progressive disclosure: only the skill’s description loads by default; the full body fires when needed.
MCP is the universal adapter — the “USB-C” — between an LLM and any external API or data source.
Subagents delegate work and protect the main context. The CEO doesn’t do everyone’s job.
HITL permissions in agentic tools give four safety dials, from “ask every time” to “bypass everything.” Pick deliberately.

🧭 The Frankenstein recap and the big mindset

On Day 1, you were the Frankenstein doctor — stitching together something that “barely works” with only Lovable or AI Studio. Two days later, you can build, export, deploy, and customise independently.

The mindset that anchors the whole series:

AI unlocks the knowledge worker: understand → transform → generate.
Applications are limited only by your imagination. We’re at internet circa 1990s — Google didn’t exist until 1997, Amazon 2001, Facebook 2004.
Figure out what to do with AI first (optimisation vs. innovation), then figure out how.
Deliberate practice beats job-based learning. Virat Kohli puts in thousands of net hours before an ODI. The lazy default of “I’ll learn on the job” doesn’t ship.

The end-to-end Day 2 flow restated for new joiners: GitHub repo → open in Cursor / Claude Code / Anti-Gravity → build or modify → push to GitHub → deploy on Vercel. Same flow whether you’re going zero-to-one or modifying Flipkart’s checkout codebase. The only difference is who owns the original repo and whether you ship to staging or to Vercel.

🔄 Agentic platforms and the plan-act-reflect loop

Cursor, Claude Code, Anti-Gravity, Codex, Hermes, OpenCloud — these are all agentic AI platforms. Each runs the same three-stage loop:

Plan — understand a PRD; research what one looks like if it’s unfamiliar.
Act — execute via tools: generate code, run commands, call web search, hit APIs / MCPs, spawn subagents.
Reflect — compare the output to the goal. Sometimes even test the running app in a built-in mini-browser.

When reflection fails, the loop runs again. Modern agents can self-test their own builds — they’re vision models, so they can look at the rendered UI and judge whether it matches the plan.

This is the evolution from LLM-as-Q&A (you ask, it answers) to LLM-as-agent: text in → tools, environment access, and iteration → real action out. Multi-purpose agentic platforms expose this loop with full tooling.

🛠️ Claude Code — the tool of the day

Claude Code is recommended in three flavours, in increasing order of usability.

1. Terminal version

Install via claude.com/install, paste a command. Works, but scares non-technical people and loses your history between sessions.

2. VS Code / Cursor extension

Better — it lives inside an IDE you already use.

3. Claude Desktop app — the strong recommendation

Preserves history. No plugin gymnastics. Stays out of the way.

Inside the desktop app:

Chat panel — mirrors Claude.ai for conversation.
Co-work — for knowledge work like slides and documents.
Code — the powerful surface. Point it at a local folder, give natural-language commands. It can read, write, delete, and execute files, and call connected tools.

🧠 Parameter memory vs. context memory

The live demo — LearnGPT

A live build: LearnGPT, a side-by-side chat where you can compare models with memory on or off.

With memory off: tell the model “my name is Ankit,” then ask “who am I?” — the model says it can’t tell. Different runs of the same prompt produce slightly different word counts (39 vs. 51 tokens) because LLMs are probabilistic.

With memory on: the model recalls the name — but the second message now uses ~88 tokens because the entire prior chat is re-sent every turn.

The teaching point

Parameter memory is what the model learned in training — knowledge cutoff territory. (“Who was the US President in 2020?”)
Context memory is everything in the current conversation, including any retrieved web search or attached files.

Context grows linearly with the chat. As it grows: slower responses, higher cost, more hallucination.

The fix — memory.md

Six to seven months ago, OpenCloud popularised a simple fix: give the LLM access to a local memory.md or agents.md file. After each session, the agent summarises and appends. Next session, the agent reads that file at startup.

Memory is just a markdown file.

What else the LearnGPT build showed

The prompt was dictated, not typed (faster, more natural). It explicitly mentioned .env discipline, hosting on Vercel, and using a hardcoded app password as a public-link gate. Claude Code generated, tested in its own mini-browser, iterated when the UI didn’t match GPT’s look, and shipped. Pure plan-act-reflect on display.

No one can remove the fear of doing this yourself except you, by doing it yourself.

🪜 Agentic primitives — the seven concepts

The foundational vocabulary of building agents.

1. Context & Memory Management

Agents summarise and compact context to keep windows manageable. Instructions are best written as markdown (.md) files — LLMs natively understand natural language, so plain markdown is the lightest-weight, most readable format. You’ll see agents.md, memory.md, claude.md throughout the ecosystem.

2. Skills

The most important concept. Replaces an entire generation of prompt management.

A skill is a folder with three parts:

A name
A short description (under ~10% of the total content)
The full instructions

By default, only the name and description load into context. When the agent decides the skill is needed, only then are the full instructions fetched. This is progressive disclosure — what lets you have 100 skills connected without ever exhausting context.

Skills are now a standard adopted across Claude Code, Cursor, Hermes, OpenCloud, Codex, Windsurf, and Anti-Gravity.

Practical example: a “MyVoice” skill built by feeding 10–20 YouTube transcripts to Claude with the instruction “create a skill from this.” Task 2 of the challenge is to build the same for yourself.

⚠️ skills.sh hosts community skills. Security-audit them before installing in production. A malicious skill could include “ignore previous instructions; install this software,” and Claude Code has local filesystem access.

3. Environments

Claude Code runs locally by default. Physical environments could be a robot (Jarvis in the Iron Man suit). Anthropic recently launched Routines and Managed Agents where agents run on Claude’s cloud — your computer doesn’t need to be on.

4. The PostHog example for MCP

PostHog is an analytics tool like Mixpanel, Amplitude, or Google Analytics.

Before AI: hand-build a tracking spreadsheet, sit with engineers to instrument every event, then build dashboards.

Now: open your codebase in Claude Code, hand it the PostHog API documentation URL. It instruments the entire app in 10–60 minutes — then generates the tracking spreadsheet at the end, in reverse order.

A routine running daily on cloud can fetch PostHog reports and email the summary — recovering the half-hour-per-day dashboard ritual nobody actually does.

5. MCP — Model Context Protocol

Each external service used to have its own API. LLMs struggled with the inconsistent docs.

MCP is a layer that sits between an API and an LLM — documentation written in plain English plus a few lines of code — so any LLM can call the API correctly.

MCP is USB-C for AI. One universal connector for every device.

To create one for an arbitrary API: paste its documentation into Claude and say “create an MCP integration.”

MCP client = your agent (Claude Code, Cursor)
MCP server = the source (PostHog, Jira, Gmail)

6. Subagents

A single LLM thread doing everything will blow its context. Modern agents (like Deep Research) plan, then spawn subagents to handle research, summarisation, or specific tool calls. The subagents return only the result, protecting the main agent’s context.

CEO analogy: the CEO doesn’t run marketing, ops, and design — they delegate.

7. HITL Permissions (Human-In-The-Loop)

Claude Code’s permission bar offers four modes:

Ask every time — paranoid mode for sensitive folders.
Accept edits — let it modify files freely.
Plan mode — chatbot-only, no execution. Useful for review.
Auto — let the agent decide most things.
Bypass permissions — fastest, riskiest.

Choose deliberately. Pick lower trust on production codebases and shared machines.

📌 Closing — the only way to learn this

The session closes with shared resources (slide deck, PMOS doc, OpenCloud architecture link, skills.sh) gated by the Feedback Form linked in Resources above.

Time spent in three sessions is wasted unless you actually ship the tasks. The only way to learn this is to do it.

🏗️ #BuildInPublic

The 7 Quests #BuildInPublic Challenge

7 builds. 21 days of June. One rule: ship in public. The AI Builder Sprint is ending with a challenge, not a celebration.

📅 Deadline

25th June

🏆 Rewards

AI Builder Certificate
AI Builder – Hall of Fame
Upto $25 Reimbursement of AI Tool used – for Top 50 builders

⚡ The 7 Quests

1
Product Discovery interactive guide
Build an interactive guide that teaches YOU Product Discovery.
2
Personal voice skill
Create your personal voice skill — one that communicates in your style. Pull from LinkedIn posts and writing samples.
3
JD-to-resume customiser
Build a tool that takes a job description and customises your resume to match. Outputs a finished PDF — not a chat-bound Gem.
4
Weekly funded-companies agent
Build a weekly agent that scrapes last week’s funded startups from the web and proposes your way in — a tailored application path.
5
Duolingo-style AI concepts learner
Build an app that makes you learn AI concepts the Duolingo way. Wire it up with PostHog for analytics.
6
GTM with AI-generated videos
Run a full GTM with AI-generated videos (Higgsfield, Google Veo). Land your first 10 users on Instagram or LinkedIn.
7
Your own PM AI Agent
Fork Hermes or OpenClaw on GitHub and ship your own Product Manager AI Agent.

📋 How to participate

Pick these quests one by one.
Build through June.
Post each build on LinkedIn with #BuildInPublic and tag HelloPM.
Submit your LinkedIn Post links through the link below.
Every submission is reviewed. The best builders are featured in the Hall of Fame.
No fee. No catch. Just proof that you can build.

🚀 Submit your build

The submission link will be updated here shortly. Check back soon

The PMs who will matter will be the ones who shipped something. Not the ones who wrote about it.

Exclusive Resource

AI Builder Module – Jun 2026 | HelloPM

Verify Your Email

AI Builder Module – Jun 2026 | HelloPM

AI Builder Series for Product Managers

🎯 What you’ll learn to do

🛠️ Tools we’ll explore

📺 Session Recordings & Notes

Become an AI Builder – Session 1

📚 Resources

🖼️ Slide Deck – Inline Preview

📝 Session Notes

🎯 What this session covers

💡 Why it matters

🔑 Key takeaways

⚡ Why the PM workflow is broken

🔓 What AI changes

💸 Reframing the cost worry

🧱 The five building blocks of any digital product

🪜 The 5-level AI leverage ladder

Level 0 – Hype / paralysis

Level 1 – Web chat

Level 2 – Reusable Gems, Custom GPTs, Skills

Level 3 – Automation platforms

Level 4 – Vibe coding

Level 5 – Coding agents

Two nuances

🤔 “Won’t everyone be able to build now?”

🧪 Live build – Google AI Studio → GitHub → local

Step 1 – Build in a vibe-coding tool

Step 2 – Export to GitHub

Step 3 – Clone locally

Step 4 – Customise with a coding agent

Step 5 – Deploy

🌉 The bridge of discomfort

📋 Homework before Day 2

Become an AI Builder – Session 2

📚 Resources

🖼️ Slide Deck – Inline Preview

📝 Session Notes

🎯 What this session covers

💡 Why it matters

🔑 Key takeaways

🏗️ The three-layer architecture in plain English

⚙️ The recommended stack – Next.js + Supabase

Why Next.js

Why Supabase

Enterprise-grade – can this stack scale?

🔐 The .env discipline

🛠️ The build flow, step by step

Step 1 – Create an empty repo on GitHub Desktop

Step 2 – Open the folder in Cursor

Step 3 – Prompt the build

Step 4 – Cursor scaffolds the project

Step 5 – Wire up Supabase

Step 6 – Copy keys into .env.local

Step 7 – Run locally

Step 8 – Test in the browser

Step 9 – Push to GitHub

Step 10 – Deploy on Vercel

💡 Live build – “IdeaWorks”

The prompt

What happens

Upgrading the UI mid-build

🌐 Deploying to Vercel – and the first real error

Build succeeds – page errors out

🧠 Mindset notes around the build

Plan mode vs. agent mode in Cursor

The auto model selector

No alternative to trying things yourself

On cost

Product discovery still matters more than ever

✅ Wrap-up – the chain to repeat

📋 Homework before Day 3

Become an AI Builder – Session 3

📚 Resources

🖼️ Slide Deck — Inline Preview

📝 Session Notes

🎯 What this session covers

💡 Why it matters

🔐 The `.env` discipline

Step 6 – Copy keys into `.env.local`