NYMG Agentic Academy

← Back to overview

Day 2 of 5

The Landscape: Who's Who and What to Use

AI is not one thing from one company. Different models for different jobs. Smart companies match the tool to the task.

🔒 Locked

🔒

Day 2 is locked

Complete Day 1 flashcards to unlock.

Finish Day 1 first →

Step 1 of 6 - Listen

🎙️

Listen First

Today we zoom out from the technology to the landscape. Who builds these AI models, why they're all different, and how NYMG navigates the choices.

⏱ ~10 minutes

What You'll Learn

The major AI companies
What "open source" means
Why models feel different
Model versions & upgrades
What model "size" means
Thinking / Doing / Checking
NYMG's AI journey
Passive vs Active AI

Step 2 of 6 - Study

🎨

Study the Visual

The wider AI landscape at a glance - and the simple setup NYMG uses in practice.

Step 3 of 6 - Flashcards

Flashcards - Day 2

1 / 13

Term

Provider

tap to reveal →

A company that builds and offers AI models. The major providers include OpenAI, Anthropic, Google, Meta, and xAI. Each has a different philosophy and approach.

💡 Like car manufacturers - BMW, Toyota, Tesla all make cars, but with very different philosophies.

Term

Open Source

tap to reveal →

When a company releases its AI model for free, so anyone can download, use, and modify it. Meta's Llama and China's DeepSeek are open source.

💡 Like publishing it openly - anyone can run it, adapt it, and build on it. No licence fee required.

Term

Model Size

tap to reveal →

How many patterns and connections a model has learned - not physical size. A bigger model can handle more complex tasks but is slower and more expensive to run. A smaller model is faster and cheaper.

💡 Like brains - more neural connections means better reasoning, but it takes more energy to run.

Term

Thinking / Doing / Checking

tap to reveal →

Thinking models (e.g. claude-opus-4-6) - expensive, powerful, for complex reasoning.

Doing models (e.g. claude-sonnet-4-6) - the workhorse, handles 90% of tasks.

Checking models (e.g. claude-haiku-4-5) - fast, cheap, for quick quality control.

💡 Like hiring: senior consultant for strategy, reliable team member for daily work, quick reviewer for proofreading.

Term

Model Version

tap to reveal →

AI models get updated regularly, like smartphone software. Each version (e.g. claude-sonnet-4-6 replacing earlier Sonnet generations) can change how the model behaves - formatting, tone, and capabilities may all shift.

💡 This is why companies track versions. "We use AI" isn't enough - it's "we use this specific version of this specific model."

Term

Benchmark

tap to reveal →

A standardised test that AI models are scored on - like exam results. Tests reasoning, coding, language understanding. Useful for comparing models, but doesn't tell the full story.

💡 Like acing a written driving test but still being nervous on real roads. Benchmarks show potential, not guaranteed real-world performance.

Term

Model Routing

tap to reveal →

The strategy of sending each task to the right model tier. Simple tasks go to cheap, fast models. Complex tasks go to powerful, expensive ones. This is how companies control AI costs.

💡 The same task might cost $0.56 on a thinking model vs $0.04 on a checking model. Multiply by thousands of tasks and it adds up fast.

Term

Passive AI

tap to reveal →

AI that waits for you to ask. You open ChatGPT, type a question, get an answer. It's reactive - it only does something when YOU tell it to. All the tools we've discussed so far are passive AI.

💡 Like a brilliant colleague sitting at a desk. They'll answer any question you bring them, but they'll never start a task on their own.

Term

Active AI

tap to reveal →

AI that can take initiative - use tools, remember past conversations, work through multi-step tasks, and act without being told every move. The shift from passive to active is the biggest change happening in AI right now.

💡 Tomorrow on Day 3 we'll break down exactly how this works - and why it changes everything.

Term

Claude Code

tap to reveal →

A tool from Anthropic that's different from Claude the chatbot. Claude Code can work directly WITH your codebase - building, fixing, and creating features. It lets non-developers get technical tasks done.

💡 Charlotte uses Claude Code for development work despite not being a developer. It's the difference between asking questions and actually doing.

🔍 Deeper Dive

Question

Why does the same question give different answers in ChatGPT vs Claude?

tap to reveal →

Because each model was trained differently and optimised for different goals. OpenAI is designed to make you feel good - warm and conversational. Anthropic is designed to make you think - more precise and careful. Neither is wrong.

Question

Why does NYMG use multiple AI tools instead of just one?

tap to reveal →

Because different tasks need different tools. Claude excels at long documents and code. ChatGPT works well for helpdesk. There's no single "best" - only the right tool for the right job.

💡 It's like a toolbox - you need both a hammer AND a screwdriver. They're not competing, they're complementary.

Question

Why should you care when a model gets upgraded?

tap to reveal →

Because upgrades can change how the model behaves. Formatting, tone, and task handling may all shift. Something that worked perfectly on version 4.5 might work differently on 4.6. Companies need to test after updates.

💡 Like a smartphone update - mostly improvements, but sometimes your favourite app starts working differently.

Step 5 of 6 - Review

📌 Key Takeaways

AI is not one company - it's a global ecosystem of providers
Different models feel different because they're optimised for different goals
Models come in tiers: thinking (expensive), doing (workhorse), checking (cheap)
NYMG uses multiple tools - Claude for heavy lifting, ChatGPT for helpdesk
Everything so far is passive AI - you ask, it answers

The Major Players

Company	Model	Known For
OpenAI	GPT / ChatGPT	Started the revolution. Biggest consumer AI.
Anthropic	Claude	Safety-focused. Massive context window. NYMG's primary AI.
Google	Gemini	Invented the technology (2017). Integrated with Workspace.
Meta	Llama	Open source - free for anyone to download and use.
xAI	Grok	Elon Musk's company. Growing quickly.
DeepSeek	DeepSeek	Chinese. Open source. Competitive quality. Free.

Why Models Feel Different

Same question, three different answers. OpenAI is designed to make you feel good (warm, conversational). Anthropic is designed to make you think (precise, careful). Each model has its own personality based on training.

Models Change

AI models get version updates (claude-sonnet-4-6 replacing earlier Sonnet generations). Behaviour, formatting, and tone can shift. Companies track versions and test after updates. Benchmarks compare models objectively, but don't tell the full story.

Model Tiers - Thinking / Doing / Checking

Tier	Example	Use For	Cost
Thinking	claude-opus-4-6	Complex reasoning, strategy	$$$
Doing	claude-sonnet-4-6	Everyday tasks (90% of work)	$$
Checking	claude-haiku-4-5	Quick grammar, formatting	$

Same task can cost 10× more on a thinking model vs a checking model.

How NYMG Uses AI

Started on helpdesk → wasn't reliable
Moved into technical operations and AI-assisted coding workflows
A jump to newer Claude generations was the turning point for coding productivity
Claude Code let Charlotte do tech tasks without coding knowledge
Dev team shifted: writing code → reviewing AI-written code
Bart uses ChatGPT for helpdesk; Charlotte & Daniëlle use Claude Code

Passive AI vs Active AI

Passive: You go to it. Ask a question, get an answer. (ChatGPT, Claude)

Active: It can take initiative - use tools, remember past conversations, work through steps independently.

💡 Key takeaway: AI is not one thing from one company. Different models for different jobs. Smart companies match the tool to the task.

Have you noticed different results when using different AI tools? What was different?
Which of the AI tools mentioned (ChatGPT, Claude) are you most curious to try?
Can you think of tasks in your daily work that could use a 'checking model' vs a 'thinking model'?
What does 'passive AI' vs 'active AI' mean to you based on what you heard?

Day 1 - What Are LLMs?

LLM (Large Language Model)

A mathematical pattern-recognition system trained on enormous amounts of text. It predicts the most likely next word - it doesn't think or reason.

Training Data

The text an LLM learned from (books, websites, papers, code). This data IS the model's entire worldview.

Token

The basic unit an LLM works with, roughly ¾ of a word. Everything in AI has a token cost.

Context Window

The maximum tokens an AI can handle at once. Like a desk - if it overflows, the oldest information falls off.

Context

Everything the AI can "see" right now. If it's not in the context, it doesn't exist for the AI.

Knowledge Cutoff

The date when an LLM's training data stops. After this, the model is frozen in time.

Hallucination

When AI generates confident but entirely made-up information. It has no fact-checker and no humility.

Probabilistic vs Deterministic

Computers are deterministic (same input = same output). LLMs are probabilistic (deal in likelihood, not certainty).

Prompt

The text you send to an AI. Better input = better output.

Transformer

The architecture behind modern LLMs. Invented by Google in 2017, made accessible by OpenAI in 2022.

Day 2 - The AI Landscape

Provider

A company that builds AI models (OpenAI, Anthropic, Google, Meta, xAI, DeepSeek).

Open Source

When a model is released openly, so anyone can download, use, and modify it.

Model Size

How many patterns/connections a model learned. Bigger = more capable but slower and more expensive.

Thinking / Doing / Checking Models

Three tiers: Thinking (claude-opus-4-6 - complex reasoning), Doing (claude-sonnet-4-6 - everyday tasks), Checking (claude-haiku-4-5 - quick, cheap quality control).

Model Version

AI models get updated regularly (e.g. claude-sonnet-4-6 replacing earlier Sonnet generations). Each version may change behaviour, formatting, and tone.

Benchmark

A standardised test for comparing AI models. Useful but doesn't tell the full story.

Model Routing

Sending each task to the right model tier. Simple tasks → cheap model, complex tasks → powerful model.

Passive AI

AI that waits for you to ask. You go to it, ask a question, get an answer.

Active AI

AI that can take initiative, use tools, remember conversations, and work through tasks independently.

Claude Code

Anthropic's tool that works directly with code. Different from Claude the chatbot - it can build, fix, and create.

Step 6 of 6 - Complete

🎉

Day 2 Complete!

You now know who builds AI models, why different tools exist for different jobs, and how to spot agent washing. Tomorrow we cross the line from passive to active AI.

← Back to overview

Day 3 of 5

From Chatbot to Agent

🔒 Locked

🔒

Day 3 is locked

Complete Day 2 flashcards to unlock.

Finish Day 2 first →

Step 1 of 6 - Listen

🎙️

Listen First

Today we cover the most important concept of the week: the difference between a chatbot and an agent. This is where passive AI becomes active AI - and where everything clicks.

⏱ ~12 minutes

What You'll Learn

Chatbot vs agent
Tools (the agent's hands)
Memory & autonomy
System prompts explained
Guardrails & safety
The agent loop
Workflow automation
Orchestrators

🤖

Bonus: What is an Agent?

A short explainer. The anatomy of an agent in 6 building blocks: model, memory, skills, tools, guardrails, identity.

⏱ ~4 minutes

Step 2 of 6 - Study

🎨

Study the Visual

The key shift: from a chatbot that answers, to an agent that acts.

Step 3 of 6 - Flashcards

Flashcards - Day 3

1 / 17

Term

Chatbot

tap to reveal →

An AI that can have conversations but can't take action. It responds to what you ask, but it can't go do things in the world.

💡 Like a receptionist who answers questions but can't leave the desk.

Term

Agent

tap to reveal →

An AI that can plan and use tools to complete tasks. It doesn't just answer - it acts. The key difference from a chatbot.

💡 Like an assistant who gets up, makes the call, files the paperwork, and comes back with results.

Term

Tools

tap to reveal →

Individual capabilities given to an agent - browse web, read files, send messages, check databases. Each tool is a single action the agent can perform.

💡 The knife, the stove, the mixing bowl. Individual instruments in the kitchen.

Term

Skills

tap to reveal →

Structured instructions that tell an agent how to use multiple tools together for a complex task. Different from tools: a tool is a single action, a skill orchestrates many tools in sequence.

💡 Think of it like a recipe: chop with the knife, heat in the pan, fry for 3 minutes. A skill orchestrates tools in the right order.

Term

Short-Term Memory

tap to reveal →

The current conversation context - what the AI can "see" right now. Limited by the context window (the desk from Day 1). When it's full, the oldest information falls off.

💡 Everything on your desk right now. Visible, but the desk has a size limit.

Term

Long-Term Memory

tap to reveal →

Information stored in an external database that the agent can search when starting a new conversation. Remembers past interactions, preferences, and history.

💡 Like a filing cabinet the agent checks before each conversation. Not perfect, but much better than starting from zero.

Term

Autonomy & Pipeline

tap to reveal →

Autonomy = deciding your own steps. Pipeline = a designed sequence of steps the agent follows. Most real agents use guided autonomy: freedom within each pipeline step, but the overall route is planned.

💡 A train on tracks, not a driver choosing any route. Safe and predictable.

Term

Guardrails

tap to reveal →

Rules and limits that prevent an agent from doing things it shouldn't. Built into agent rules AND pipeline checkpoints. Autonomy without guardrails = real problems.

💡 Joey once posted a reel twice because there was no duplicate check. That check is now a guardrail.

Term

Agent Loop

tap to reveal →

The cycle an agent repeats: observe → think → act → check. Unlike a chatbot's single-pass answer, agents keep looping until the job is done.

💡 Chatbot = one answer. Agent = sustained work, adjusting as it goes.

Term

Sub-Agent

tap to reveal →

A child agent spawned by a parent agent for a specific subtask. Exists only for that job, then disappears. Like hiring a freelancer for one task.

💡 Need 17 translations? Spawn 17 sub-agents, one per language. Each focuses on one job.

Term

Agent Washing

tap to reveal →

When companies label something "agentic" that's really just traditional automation with a chatbot on top. Real agents reason and adapt; fake agents follow fixed scripts.

💡 If something is labelled "AI agent" but just runs fixed rules with a chatbot on top - that's agent washing.

Term

Agent Harness

tap to reveal →

The platform that runs the agent loop. Claude Code, Codex, Cowork, and OpenClaw are all agent harnesses. Different platforms, same concepts underneath.

💡 Once you understand how agents work, you can use any harness - like knowing how to drive lets you use any car.

Term

Anthropomorphism

tap to reveal →

Giving AI human-like traits - a name, a personality, a tone of voice. It increases trust and engagement, but the AI isn't actually "feeling" anything. It's still a prediction engine.

💡 Joey has "New Yorker energy." That's anthropomorphism - a designed personality, not real emotions.

Term

The Agentic Shift

tap to reveal →

The industry-wide move in 2026 from AI that provides information to AI that executes tasks. Every major company is building agent capabilities. NYMG is part of this shift.

💡 From "ask AI a question" to "give AI a job." That's the shift happening right now.

🔍 Deeper Dive

Question

What three things transform a chatbot into an agent?

tap to reveal →

Tools (capabilities to interact with the world), Memory (short-term context + long-term database), and Autonomy (ability to decide steps, guided by a pipeline).

Question

How do Skills differ from Tools?

tap to reveal →

Tools are individual capabilities (read a file, send a message). Skills are structured instructions that orchestrate multiple tools together for a complex task (how to translate a blog post step by step).

💡 Tool = a single action (like cutting). Skill = the full set of instructions that coordinates multiple actions together.

Question

Why do agents need guardrails?

tap to reveal →

Because autonomy without limits leads to real problems - duplicate posts, ignoring stop commands, modifying the wrong things. Guardrails are rules + pipeline checkpoints that keep agents safe.

Step 5 of 6 - Review

📌 Key Takeaways

Agentic = able to take action. A chatbot talks, an agent acts. Watch out for agent washing.
Three pillars: tools (capabilities), memory (short-term + long-term), guided autonomy (freedom within a pipeline)
Tools are individual actions. Skills are structured instructions that orchestrate multiple tools for complex tasks.
Guardrails keep agents safe - rules + pipeline checkpoints. Joey's duplicate story proves why.
2026 = the agentic shift. Every major company is building agent capabilities. NYMG is part of this.

Chatbot vs Agent

A chatbot is passive AI - you ask, it answers, but it can't take action. An agent is active AI that can use tools, remember past interactions, and work autonomously toward goals.

The Three Pillars of an Agent

Pillar	What It Means	Example
Tools	External capabilities to interact with the world	Browse web, edit files, send messages
Memory	Stores and recalls past conversations	Remembers your preferences from last week
Autonomy	Plans its own steps without instruction	Figures out how to complete a goal independently

System Prompts

Hidden instructions that define an AI's persona, rules, and boundaries. The user never sees them, but they shape everything. The same AI becomes a completely different tool with different system prompts - that's how Claude becomes Claude Code, or a customer support bot.

Guardrails

Predefined rules and limits that prevent an agent from taking dangerous actions. Without guardrails, autonomy is a liability. With guardrails, it's a superpower.

The Agent Loop

Observe - take in the current situation
Think - decide what to do next
Act - use a tool or take a step
Check - evaluate the result
Repeat until the job is done

Workflow Automation & Orchestrators

When multiple specialist agents work together under an orchestrator, you get workflow automation - entire processes running from start to finish. At NYMG, Jørgen's mystery tool from Day 2 is an agent, and the company is moving toward specialised agents coordinated by an orchestrator.

💡 Key takeaway: A chatbot talks. An agent acts. Tools + Memory + Autonomy = Agent.

Think about your daily work: which tasks do you do repeatedly that follow the same steps each time?
If you could give an AI assistant three tools to help with your job, what would they be?
What's one task where you'd want to keep human approval (guardrails) and one where you'd trust the AI to just do it?
How does the chatbot vs agent distinction change how you think about AI in your work?

Day 1 - What Are LLMs?

LLM (Large Language Model)

A mathematical pattern-recognition system trained on enormous amounts of text. It predicts the most likely next word - it doesn't think or reason.

Training Data

The text an LLM learned from (books, websites, papers, code). This data IS the model's entire worldview.

Token

The basic unit an LLM works with, roughly ¾ of a word. Everything in AI has a token cost.

Context Window

The maximum tokens an AI can handle at once. Like a desk - if it overflows, the oldest information falls off.

Context

Everything the AI can "see" right now. If it's not in the context, it doesn't exist for the AI.

Knowledge Cutoff

The date when an LLM's training data stops. After this, the model is frozen in time.

Hallucination

When AI generates confident but entirely made-up information. It has no fact-checker and no humility.

Probabilistic vs Deterministic

Computers are deterministic (same input = same output). LLMs are probabilistic (deal in likelihood, not certainty).

Prompt

The text you send to an AI. Better input = better output.

Transformer

The architecture behind modern LLMs. Invented by Google in 2017, made accessible by OpenAI in 2022.

Day 2 - The AI Landscape

Provider

A company that builds AI models (OpenAI, Anthropic, Google, Meta, xAI, DeepSeek).

Open Source

When a model is released openly, so anyone can download, use, and modify it.

Model Size

How many patterns/connections a model learned. Bigger = more capable but slower and more expensive.

Thinking / Doing / Checking Models

Three tiers: Thinking (claude-opus-4-6 - complex reasoning), Doing (claude-sonnet-4-6 - everyday tasks), Checking (claude-haiku-4-5 - quick, cheap quality control).

Model Version

AI models get updated regularly (e.g. claude-sonnet-4-6 replacing earlier Sonnet generations). Each version may change behaviour, formatting, and tone.

Benchmark

A standardised test for comparing AI models. Useful but doesn't tell the full story.

Model Routing

Sending each task to the right model tier. Simple tasks → cheap model, complex tasks → powerful model.

Passive AI

AI that waits for you to ask. You go to it, ask a question, get an answer.

Active AI

AI that can take initiative, use tools, remember conversations, and work through tasks independently.

Claude Code

Anthropic's tool that works directly with code. Different from Claude the chatbot - it can build, fix, and create.

Day 3 - From Chatbot to Agent

Chatbot

An AI that can converse but can't take action. Like a receptionist who can't leave the desk.

Agent

An AI that can plan and use tools to complete tasks. It doesn't just answer - it acts.

Tools

Individual capabilities given to an agent - browse web, read files, send messages. Each tool is a single action.

Skills

Structured instructions that orchestrate multiple tools for complex tasks. A tool is a single action; a skill coordinates many.

Short-Term Memory

The current conversation context, limited by the context window. Like your desk - visible but finite.

Long-Term Memory

Information stored in an external database, searchable across conversations. Like a filing cabinet.

Autonomy & Pipeline

Autonomy = deciding own steps. Pipeline = designed sequence of steps. Most real agents use guided autonomy within pipelines.

Guardrails

Rules + pipeline checkpoints preventing harmful actions. Autonomy without guardrails = real problems.

Agent Loop

Observe → think → act → check, repeating until the job is done.

Sub-Agent

A child agent spawned for a specific subtask. Like hiring a freelancer for one job.

Agent Washing

Labelling traditional automation as "agentic." Real agents reason and adapt; fake agents follow fixed scripts.

Computer Use

AI capability to see your screen and control mouse/keyboard. Introduced by Anthropic, being built by all major companies.

Anthropomorphism

Giving AI human-like traits (name, personality). Increases engagement but the AI isn't actually feeling anything.

The Agentic Shift

The 2026 industry-wide move from information-providing AI to task-executing agents.

Step 6 of 6 - Complete

🎉

Day 3 Complete!

You now understand the shift from chatbot to agent. Day 4 is now unlocked - time to look under the hood.

← Back to overview

Day 4 of 5

Setup

🔒 Locked

Step 1 of 6 - Listen

🎙️

Listen First

The most NYMG-specific episode. See how Atlas, Joey, and the other agents work behind the scenes, hear real war stories, and understand how everything connects.

⏱ ~19 minutes

What You'll Learn

Why OpenClaw (local, open-source)
SOUL file and more - how agent identity works
The glossary - 15 years of existing institutional knowledge, used by agents automatically
Meet Atlas - your primary day-to-day agent
How Atlas uses skills, pipelines, and approval checkpoints
How glossary knowledge is applied across markets
Memory, heartbeats, and cron jobs (at a practical level)
War stories - what went wrong and why
Anthropomorphism - agents have names, not feelings

Step 2 of 6 - Study

🎨

Study the Visual

Three visuals: how an agent is built, the overall system, and Atlas's content pipeline.

Step 3 of 6 - Flashcards

Flashcards - Day 4

1 / 16

Term

SOUL.md

tap to reveal →

The personality file that shapes an agent's identity. Joey's says "Real New Yorker energy." Atlas says "Named for the titan who carries the world." It defines who the agent IS - its tone, rules, and character.

💡 Like a character bible for a TV show - every writer reads it so the lead always sounds like themselves.

Term

Workspace Files

tap to reveal →

A collection of identity files that together form an agent's working instructions in OpenClaw. Not one text box - organised, readable files.

💡 Like a new employee's onboarding folder: who you are, who you serve, what tools you use - all in separate tabs.

Term

Pipeline

tap to reveal →

A designed sequence of steps built in LangGraph. Atlas's current production pipeline is: parse_request → fetch_content → assess_and_plan → translate → copy_edit → glossary_check → apply_edit → validate → replicate_sites → report.

💡 Like an airport conveyor belt - your bag passes through check-in, security, and baggage claim in a fixed order.

Term

Human-in-the-Loop

tap to reveal →

Human approval at key decision points. Jørgen reviews Joey's posts before they go live. Country managers review Atlas's translations. The AI does the work, humans approve it.

💡 Like a sous chef who preps everything - but the head chef tastes and nods before it leaves the kitchen.

Term

Sub-Agent

tap to reveal →

One-shot - created for one task only, no memory, no continuity. Atlas spawns one per language: it gets the task, glossary, and style guide - does the full pipeline, then disappears. You only ever talk to main Atlas.

💡 Like a temp contractor hired for one job - they arrive, do the work, and leave. No ongoing relationship.

Term

Heartbeat

tap to reveal →

A scheduled wake-up where the agent proactively runs a checklist - check inbox, run health checks, monitor token usage. No one asks it to. It works while you sleep.

💡 Like a night watchman doing rounds - checking every door on schedule, not waiting to be called.

Term

Cron Job

tap to reveal →

A scheduled task at a specific time. "Post this reel at 9 AM Saturday." Joey's crons don't survive restarts - there's a manifest file to recreate them.

💡 Like a calendar alarm - set it once, it fires at exactly the right moment without you being awake.

Term

Compaction

tap to reveal →

When a conversation gets too long for the context window, the system summarises older parts - like meeting minutes instead of a full transcript. The agent still knows what happened, in condensed form.

💡 Like a book summary - you didn't re-read every page, but you know the plot well enough to keep going.

Term

pgmemory

tap to reveal →

The team's long-term memory system. A PostgreSQL database that auto-captures important information and auto-recalls relevant memories in new conversations. The glossary is now query-based in pgmemory with 7,283 entries.

💡 Like a colleague who's been here 10 years - they just know things without being told, because they were there.

Term

Atlas Workflow

tap to reveal →

The practical path country managers see: task comes in, Atlas applies the change, quality checks run, and you review the result before anything is final.

💡 Like tracked changes in a Word doc - Atlas edits, you accept or reject, nothing goes live without your nod.

Term

Fallback Chain

tap to reveal →

A backup plan when the primary AI model fails. The system automatically switches to approved alternatives so work can continue without interruption.

💡 Like a generator kicking in when the power cuts - you barely notice, the lights stay on.

Term

Glossary Governance

tap to reveal →

The glossary captures years of institutional knowledge. It now lives in a pgmemory table, so the agent queries the right terms instead of loading giant files into context.

💡 Like a company style guide that's searchable online - you look up what you need, not carry the whole book.

Quiz

How is the "system prompt" set up in OpenClaw?

tap to reveal →

Through workspace files: SOUL.md (personality), AGENTS.md (rules), USER.md (communication style), MEMORY.md (context), and TOOLS.md (tool guidance). Together they form the working prompt - organised into readable files, not one text box.

Quiz

Why does the heartbeat matter for agent reliability?

tap to reveal →

It lets agents work proactively - checking health, monitoring tasks, catching problems before anyone notices. Without heartbeats, agents only respond when spoken to. With heartbeats, they watch over things while you sleep.

Quiz

What should you do if Atlas gives a wrong translation?

tap to reveal →

Flag it and correct it. Your expertise as a country manager protects quality in your market. The glossary captures 15 years of NYMG institutional knowledge, and Atlas relies on that accuracy every day.

Quiz

What's the difference between a guardrail-as-instruction and a guardrail-as-code?

tap to reveal →

An instruction tells the AI "don't do this" - but the AI can ignore it. Code physically blocks the action - like the duplicate checker that stops uploads before they happen. Code guardrails are stronger than instruction guardrails.

Step 4 of 6 - Quiz

🧠 Quick Quiz - Day 4 Question 1/3

How is the "system prompt" configured in OpenClaw?

A single text field in the settings panel A JSON configuration file Workspace files: SOUL.md, AGENTS.md, USER.md, MEMORY.md

What happens when Atlas needs to translate a post into 17 languages?

Atlas translates all 17 in one long conversation Atlas spawns one-shot sub-agents, each with the right glossary Charlotte manually assigns each language to a different agent

What's the key difference between a guardrail-as-instruction and a guardrail-as-code?

Code physically blocks the action; an instruction can be ignored by the AI Instructions are faster to execute than code There's no practical difference

Step 5 of 6 - Review

📌 Key Takeaways

Local = control. OpenClaw runs on Charlotte's Mac Mini. Your data stays in the building.
System prompt = files. SOUL.md (personality), AGENTS.md (rules), USER.md (comms), MEMORY.md (context), TOOLS.md (tool guidance). Not a single text box.
Atlas is your main agent. It handles content updates, translations, and quality checks for your market. Joey handles social media in a separate workflow.
Skills auto-load. The right skill activates when the task matches - the agent doesn't need to be told which skill to use.
Memory persists. pgmemory auto-captures and auto-recalls. Compaction summarises long conversations.
Heartbeats = proactive. Agents check in without being asked. They work while you sleep.
Guardrails: instruction vs code. Instructions can be ignored. Code gates physically block bad actions. We're upgrading from instructions to code.
Things go wrong. Duplicate posts, ignored stops, crashes. That's normal. That's why agents need humans.

OpenClaw - A local, open-source platform running on Charlotte's Mac Mini as a separate user. Multi-agent by design. Data stays on our hardware.

System Prompt in Practice - Not a text box. Workspace files: SOUL.md defines personality, AGENTS.md sets rules, USER.md teaches communication style, MEMORY.md provides bootstrap context, and TOOLS.md explains tool usage.

The Agents - Atlas 🗺️ is the one country managers work with most (content ops, 17 languages, sub-agents per language). Joey 🗽 runs social workflows with Jørgen. Other technical agents exist in the background, but your day-to-day interaction is mainly Atlas.

Skills - Structured instructions that auto-load when tasks match. Atlas has 18 skills. Shared library means consistent processes across agents.

Memory - Short-term = current conversation (context window). Compaction = summarising when it gets long. Long-term = pgmemory (PostgreSQL, auto-capture, auto-recall).

Heartbeat - Proactive scheduled check-ins. Health checks, inbox monitoring. Joey: weekends only. Powered by cron jobs (scheduled tasks).

Pipelines (LangGraph) - Designed step sequences with checkpoints. Joey's posting pipeline. Atlas's translation pipeline. Hard gates physically block bad actions.

Fallback Chains - Automatic model switching when the primary AI provider is down. Ensures agents keep working.

War Stories - Duplicate reel (health monitor killed Joey mid-upload). Ignored STOP command (instruction vs code guardrail). These examples show why process and human review matter.

Glossary - Years of institutional knowledge, now stored in a pgmemory glossary table with 7,283 entries. Agents query what they need instead of loading giant per-language dumps.

Which agent will you be working with most directly? What does it do?
What's the difference between how Joey handles social media and how you'd do it manually?
Why does it matter that NYMG's agents run locally instead of in the cloud?
If Atlas makes a mistake in your language, what should you do?

SOUL.md

The personality file - defines who the agent IS. Tone, rules, character.

Workspace Files

SOUL.md + AGENTS.md + USER.md + MEMORY.md = the complete "system prompt" in OpenClaw.

Pipeline

A designed sequence of steps in LangGraph. Each step has checkpoints and can resume on failure.

Human-in-the-Loop

Human approval at key decision points before an agent proceeds.

Sub-Agent

A temporary child agent spawned for one task. Gets the right glossary, does the job, disappears.

Heartbeat

Proactive scheduled check-in. The agent works while you sleep.

Cron Job

A scheduled task at a specific time. Don't survive restarts - recreated from manifests.

Compaction

Summarising long conversations to free up context window space.

pgmemory

PostgreSQL-based long-term memory. Auto-captures, auto-recalls. Not perfect but better than starting from zero.

Atlas Workflow

The practical sequence country managers see: receive task, apply change, run checks, review output, then confirm.

Fallback Chain

Automatic backup models when the primary AI provider is unavailable. Keeps agents working.

Practice: Meet Atlas

🗺️ Meet Atlas - Your Content Agent

Atlas is the agent you'll be working with most. It lives in your Slack channel and handles content updates, translations, and quality checks for your market.

Your Slack channel: #atlas-[your-country-code]
e.g., #atlas-nl for Dutch, #atlas-fi for Finnish, #atlas-de for German

Your First Task

🚧 Your first guided task will be provided in the meeting. Charlotte will walk you through it step by step.

⚠️ What to Do When Things Go Wrong

Wrong translation? Tell Atlas directly: "That's incorrect. It should be [correct term]."
Atlas isn't responding? Wait 30 seconds, then try again. If still nothing, message Charlotte.
Atlas did something you didn't ask? Tell it to stop and revert. Then flag it to Charlotte.
Not sure if it's right? Ask Atlas to show you what it changed before approving.

Step 6 of 6 - Complete

🎉

Day 4 Complete!

You now know how the agents work under the hood. Day 5 is now unlocked - the practical reality of working with them.

🎁 Bonus: Atlas Content Pipeline

A detailed look at every step Atlas follows when updating your pages.

NYMG Agentic Academy

👋 Welcome to Learning Week

What Are LLMs and Why Should You Care?

Listen First

What You'll Learn

Study the Visual

Test Your Knowledge

The ChatGPT Moment

How LLMs Are Built

Predicts, Doesn't Think

Not a Computer

Tokens & Context Window

Knowledge Cutoff

Hallucination

What LLMs Cannot Do (On Their Own)

Day 1 Complete!

🎬 Bonus: Ryan Serhant on AI

The Landscape: Who's Who and What to Use

Day 2 is locked

Listen First

What You'll Learn

Study the Visual

Test Your Knowledge

The Major Players

Why Models Feel Different

Models Change

Model Tiers - Thinking / Doing / Checking

How NYMG Uses AI

Passive AI vs Active AI

Day 2 Complete!

From Chatbot to Agent

Day 3 is locked

Listen First

What You'll Learn

Bonus: What is an Agent?

Study the Visual

Test Your Knowledge

Chatbot vs Agent

The Three Pillars of an Agent

System Prompts

Guardrails

The Agent Loop

Workflow Automation & Orchestrators

Day 3 Complete!

Setup

Listen First

What You'll Learn

Study the Visual

Test Your Knowledge

📌 Key Takeaways

🗺️ Meet Atlas - Your Content Agent

Your First Task

⚠️ What to Do When Things Go Wrong

Day 4 Complete!

🎁 Bonus: Atlas Content Pipeline

Along for the Ride

Final lesson

What You'll Learn

Test Your Knowledge

Team Reflection

You've completed the Agentic Academy!

📚 Glossary