Overview
The content pipeline is SEO Machine’s structured approach to moving content from initial idea to published article. Each stage has specific quality gates, automatic processes, and output locations.Pipeline Stages
Stage 1: Topics
Directory:topics/
Purpose: Capture raw content ideas and topic suggestions
Format: Free-form markdown files
How to use:
Stage 2: Research
Directory:research/
Purpose: Store research briefs, SERP analysis, and content audits
Commands that output here:
/research [topic]→research/brief-[topic]-[date].md/analyze-existing [URL]→research/analysis-[topic]-[date].md/research-serp [keyword]→research/serp-[keyword]-[date].md/research-gaps→research/competitor-gaps-[date].md/research-trending→research/trending-[date].md/research-performance→research/performance-priorities-[date].md
- SEO Foundation
- Competitive Analysis
- Content Strategy
- Primary keyword (volume, difficulty)
- Secondary keywords
- Target word count
- Featured snippet opportunity
- Search intent classification
Stage 3: Drafts
Directory:drafts/
Purpose: Work-in-progress articles and their analysis reports
Commands that output here:
/write [topic]→ Multiple files (see below)/article [topic]→ Multiple files
Automatic Scrubbing
Immediately after saving, the article is automatically scrubbed to remove AI signatures:- Invisible Unicode watermarks
- Zero-width spaces (U+200B)
- Byte order marks (BOM)
- Format-control characters
- Excessive em-dashes (replaced contextually)
Automatic Quality Scoring
After scrubbing, content is automatically scored:| Dimension | Weight | Evaluation |
|---|---|---|
| Humanity/Voice | 30% | Checks for AI phrases, contractions, natural flow |
| Specificity | 25% | Looks for concrete examples, numbers, names, details |
| Structure Balance | 20% | Ensures 40-70% prose (not all lists/bullets) |
| SEO Compliance | 15% | Validates keywords, meta, links, structure |
| Readability | 10% | Flesch 60-70, grade 8-10, sentence length |
Automatic Agent Execution
If quality score ≥70, five agents analyze the content:Content Analyzer
Runs 5 Python analysis modules:
search_intent_analyzer.py- Intent classificationkeyword_analyzer.py- Density, distribution, clusteringcontent_length_comparator.py- Benchmarks vs. SERPreadability_scorer.py- Flesch scores, grade levelseo_quality_rater.py- 0-100 SEO score
drafts/content-analysis-[topic]-[date].mdSEO Optimizer
Analyzes on-page SEO:
- Keyword placement and density
- Heading structure
- Link strategy (internal/external)
- Meta elements
- Featured snippet opportunities
drafts/seo-report-[topic]-[date].mdMeta Creator
Generates variations:
- 5 meta title options (50-60 chars)
- 5 meta description options (150-160 chars)
- Recommendations for each
- SERP preview
drafts/meta-options-[topic]-[date].mdInternal Linker
Strategic linking:
- Reviews internal-links-map.md
- Identifies 3-5 relevant pages
- Provides exact placement locations
- Suggests anchor text
drafts/link-suggestions-[topic]-[date].mdStage 4: Review Required
Directory:review-required/
Purpose: Content that needs human review before proceeding
Triggers that route content here:
- Quality score less than 70 after 2 revision attempts
- Critical SEO issues detected
- Keyword stuffing risk flagged
- Readability below 8th grade or above 12th grade
- Content too short (less than 1500 words)
- Missing required elements (meta, keywords, links)
- Human editor reviews article
- Implements fixes from review notes
- Re-runs quality scorer
- If score ≥70, moves back to
drafts/
Stage 5: Published
Directory:published/
Purpose: Final, polished articles ready for publication
Commands that move content here:
- Manual move after
/optimizeapproves - Indicates content has passed all quality gates
/publish-draft
Stage 6: WordPress
Command:/publish-draft [file]
What happens:
- Reads WordPress credentials from
.env - Converts markdown to WordPress block format
- Extracts meta title, description, keywords
- Sets Yoast SEO fields via custom REST API endpoint
- Uploads via WordPress REST API
- Returns published URL
- WordPress REST API enabled
- Application password created
- Custom MU-plugin installed (
wordpress/seo-machine-yoast-rest.php) - Yoast SEO plugin installed
Alternative Paths
Rewrites Path
Directory:rewrites/
Purpose: Updated versions of existing content
Flow:
Files in rewrites/:
- Before/after word count
- Sections added/removed
- Statistics updated
- SEO improvements made
- What was preserved from original
Landing Pages Path
Directory:landing-pages/
Purpose: Conversion-optimized landing page content
Flow:
Landing page files:
above_fold_analyzer.pycta_analyzer.pytrust_signal_analyzer.pylanding_page_scorer.pylanding_performance.pycro_checker.py
Audits Path
Directory:audits/
Purpose: Content health audits and analysis reports
Command: /analyze-existing [URL]
Output: audits/[site]-audit-[date].md
Audit includes:
- Content health score (0-100)
- SEO issues identified
- Competitive positioning
- Update priority (low/medium/high/critical)
- Update scope (refresh/update/rewrite)
- Recommended improvements
Pipeline Automation
What Runs Automatically
Content Scrubbing
Removes AI watermarks immediately after writing
Quality Scoring
Evaluates content on 5 dimensions
Content Routing
Moves to drafts/ or review-required/ based on score
Agent Execution
Runs 5 analysis agents on passing content
What Requires Manual Action
Review Fixes
Editing content in review-required/
Agent Review
Reading and acting on agent recommendations
Optimization
Running /optimize before publishing
Publication
Moving to published/ and running /publish-draft
Pipeline Best Practices
Do’s
Trust the quality gates - If content goes to review-required/, there’s a reason
Read all agent reports - They contain actionable insights
Address critical issues first - Don’t optimize until basics are correct
Keep pipeline flowing - Don’t let content sit in drafts/ indefinitely
Don’ts
Monitoring Pipeline Health
File Counts by Stage
Pipeline Metrics
Track these metrics to optimize workflow:- Research → Draft time - Should be less than 24 hours
- Draft → Published time - Target 1-2 days
- Review-required rate - Should be less than 20% of articles
- Quality score average - Target ≥75
- SEO score average - Target ≥80
The content pipeline ensures consistent quality while automating repetitive analysis tasks, letting you focus on strategy and creativity.