Prompt Playbook: AI Brand Voice PART 2

Prompt Playbook: AI Brand Voice

Want to build an audience as an AI Authority? A 30 Day content production accelerator to get you started in short form video and building your audience.

I’ll show you exactly how I built a 250,000+ person $300,000 audience in a year.

First run price: £100. Limited to 100 people only. 30 Days.

Starts Next Week.

Hey Prompt Entrepreneur,

Let’s talk about source data.

It’ll be fun - I promise! OK…fine it’s a bit dry. But it’s super important!! Sorry!

And when it goes wrong it’s pretty funny.

I worked with someone who was horrified by what their brand new AI tool was producing. They'd spent weeks building a system to write social media posts in their company's voice, but something was... off.

Every single post ended with "Thank you for reaching out! How would you rate my response today?” or some variant.

Bizarre, right? Totally inappropriate for social media.

After digging into their training data, we discovered the culprit. Instead of feeding the AI with their best marketing content, they'd used thousands of customer service chat logs.

The AI was faithfully replicating the tone and structure of those support conversations - complete with that signature service agent sign-off.

It was doing it’s job perfectly well! But it had been given the wrong information to work from. Not it’s fault really!

This is a perfect example of the cardinal rule of AI training: Garbage In, Garbage Out (GIGO). Your AI assistant can only be as good as the content you feed it.

Let's get started:

Summary

Garbage in, Garbage Out

  • The GIGO principle of AI training

  • Purpose-driven content collection

  • Personal vs. company voice considerations

  • Your content inventory checklist

  • Tools and methods for efficient collection

GIGO: Garbage In, Garbage Out

In the world of AI, there's a simple but powerful principle: what you put in determines what you get out.

Feed your tool outdated, off-brand, or irrelevant content, and you'll get outdated, off-brand, or irrelevant outputs. It's that simple.

Most of us get this. But that doesn’t necessarily mean we know how to action it. This Part of the Playbook is here to give you a specific action plan and a checklist. Converting best intentions into actually solid data.

This is especially important for brand voice. When you're building an AI system to replicate a tone or a voice, the examples you provide are everything.

The AI has zero inherent understanding of what makes your voice unique - it can only analyse and replicate patterns from the content you provide. Our source data is everything.

Purpose-Driven Collection

Before you start gathering content, you need to answer a crucial question: What exactly will you use this AI voice model for?

Different use cases require different types of content:

  • Social media posts need casual, engaging content examples

  • Blog articles need more in-depth, informative content

  • Customer service responses need empathetic, helpful language

  • Internal communications need clarity and appropriate formality

This alignment is critical.

Yes you can make a general purpose assistant that can do all the outputs.

But guess what? It’s not going to be as good as focused individual tools.

If you must combine everything into one tool you’ll need to label your inputs explicitly (ie. making it clear what is a transcript, what is a blog article, what is from an interview) and then also adjust your output prompts to specifically use certain sources.

It’s doable! But adds complexity. For now I’d recommend creating focused single purpose tools - one for social media, one for newsletters, one for email responses, one for customer service etc. etc.

Personal vs. Company Voice

Next consideration is whether this is using personal or company tonality. This questions comes after the usage question from before. We need to define usage first then what type of tonality.

The source collection process differs significantly depending on whether you're capturing your personal voice or a company's brand voice.

For your personal voice:

  • The process is simpler - any authentic content you've created works

  • Focus on content where your natural voice shines through

  • Include both formal and casual examples for flexibility

  • Consider how your voice changes across different contexts

For a company voice:

  • Be more selective and strategic

  • Identify what tonality the company wants to project

  • Get stakeholders to provide their "gold standard" examples

  • Consider brand guidelines

Content Inventory Checklist

OK those are the two main factors in play - purpose and tonality.

To help you create a tailored collection checklist, I've created this prompt that you can use with ChatGPT or Claude:

You are an AI voice training expert helping me collect content for an AI brand voice project. Based on my specific needs, create a detailed content collection checklist.

Ask me questions to determine the following:
- Purpose: What will the AI voice be used for? E.g., "Writing social media posts" or "Creating blog articles"
- Voice type: Personal or company voice

Then generate a list of potential content sources that'll be used as examples to capture brand voice. 

For each content type in your checklist, please include:
1. Description of what to look for
2. Why this content type is valuable
3. Minimum recommended quantity
4. Specific elements to pay attention to
5. Red flags or content to avoid

This prompt will generate a customised collection checklist tailored to your specific needs.

Obviously the next question is how to extract each type of data. And honestly - it depends a lot depending on what it is! Let me quickly run through the main options!

Website Copy Extraction Methods

Your company website is often the most polished representation of your brand voice. Here's how to capture it effectively:

  1. Manual copy/paste: For smaller sites, simply copy and paste content into a document, organising by page type. Works perfectly fine and AI can strip out the “formatting” elements no problem.

  2. Web scrapers: For larger sites, tools like Octoparse or ParseHub can extract all text content automatically. I used Octoparse personally.

  3. Browser extensions: SingleFile or Save Page WE can save entire webpages with their structure intact.

Blog Content Collection

Blog articles often contain the richest examples of your brand voice in action. They're typically longer-form content that addresses topics in depth.

Generally manually copy/pasting isn’t viable here. A scrape works but there are some additional methods here.

Collection methods:

  1. Direct access: If you have CMS access, export articles directly

  2. RSS feeds: Use an RSS reader to collect posts

Podcast Transcription

Spoken content can provide excellent examples of natural voice patterns, especially for conversational tones. Super helpful for personal tone of voice, especially because when transcribed podcasts give your thousands of words. Here are your options for transcription:

  1. Paid services:

  • Rev.com ($1.25/min) - High accuracy with quick turnaround, human transcription

  • Trint ($25/month) - Automated with manual correction options

  • Otter.ai Good for meetings and interviews

  1. The YouTube Trick: If your content is on YouTube, here's a free hack:

  • Upload your video (privately if needed)

  • YouTube will auto-generate captions

  • Download the .srt file

  • Convert to text using an online converter

Or use OpenAI’s Whisper model via the API.

When processing transcripts, clean up filler words and false starts unless these are part of the voice you want to capture. AI can do this for you - no need to do manually.

Social Media Content Gathering

Social posts often showcase your most conversational, engaging voice. Collection approaches vary entirely by platform:

Twitter/X:

  • Use the archive download feature in settings (do this in advance as it takes a while to be processed!)

  • Tools like Tweepi or Twitonomy for more organized collection

LinkedIn:

  • Request data export from privacy settings

  • For company pages, manually collect top-performing posts

Instagram:

  • Use Creator Studio to access post copy

  • Third-party tools like Iconosquare can export captions

Facebook:

  • Page content can be exported via Creator Studio

  • Personal content via Facebook's "Download Your Information"

All of these work with text. What about video posts?

You can use Apify or similar tools to mass scrape posts.

This is how I personally do it - Apify to scrape videos and their subtitles, throw transcript of video post over to ChatGPT to clean up then send it into an Airtable. Very cost effective and you can basically strip mine a company’s post (or your own!) into a table in minutes.

Organisation System

With all this content collected, you need a system to organise it effectively:

  1. Create a central repository:

  • Google Drive folder

  • Notion database

  • Dedicated project in tools like Airtable (allows tagging etc.)

  1. Categorise by content type:

  • Create separate documents/sections for different content types

  • Include metadata for each piece (source, date, performance if known)

  1. Tag content by voice characteristics:

  • Formal vs casual

  • Persuasive vs informative

  • Technical vs simplified

  • Emotional tone (inspiring, authoritative, friendly)

This organised approach will make the next step - voice extraction - much more effective.

If you are just doing this for your own voice tool (rather than a client) you can probably get away with just dumping everything into a Google Drive. We don’t need the same level of precision and sorting because it’s all our voice. We can play more fast and loose.

Minimum Viable Collection Size

How much content is enough? Here are my recommendations:

  • For personal voice: 25-50 samples across different content types

  • For company voice: 50-100 samples across relevant channels

  • Minimum of 10,000 words total

  • At least 5 examples of each specific content type you want to generate

These are rules of thumb. Got more? Fantastic. As long as quality is solid more is generally better!

What's Next?

In Part 3, we'll take all this organised content and extract the DNA of your brand voice. We'll create powerful prompts that capture the essence of your voice and allow any AI to replicate it consistently.

Keep Prompting,

Kyle

When you are ready

Select from these simple options:

60+ AI Business Courses
✓ Instantly unlock 60+ AI Business courses ✓ Get FUTURE courses for Free ✓ Kyle’s personal Prompt Library ✓ AI Business Starter Pack Course ✓ AI Niche Navigator Course Get Premium 

AI Workshop Kit : Cohort 3 Now Closed
Deliver AI Workshops and Presentations to Businesses with my Field Tested AI Workshop Kit  Get on Waitlist

Waitlist 
Do you want to build and market your very own AI tool that people want (and will pay you for)?  Join AI Accelerator Waitlist

Anything else? Hit reply to this email and let’s chat.

If you feel this — learning how to use AI in entrepreneurship and work — is not for you → Unsubscribe here.