Analysis Framework
The agent follows a structured due diligence process:- Foundation Analysis: Company basics, team, mission
- Market Intelligence: Target market, competition, business model
- Financial Assessment: Funding, revenue indicators, growth
- Risk Evaluation: Market, technology, team, financial, regulatory risks
Prerequisites
- Python 3.12+
- OpenAI API key
- ScrapeGraph API key
Setup
1
Clone the repository
2
Create and activate virtual environment
3
Install dependencies
4
Get ScrapeGraph API key
Sign up at ScrapeGraph to get an API key.
5
Set environment variables
Run the Agent
Analyze Startup
Perform comprehensive analysis on a company:- Website crawling and data extraction
- Team and product analysis
- Risk assessment
Competitive Intel
Compare multiple companies:- Multi-company analysis
- Competitive positioning
- Market comparison
Quick Scan
Fast overview of a company:Agent Configuration
| Parameter | Purpose |
|---|---|
model | GPT for analysis and synthesis |
output_schema | Structured due diligence report |
ScrapeGraphTools | Website crawling and extraction |
enable_crawl | Multi-page site analysis |
enable_markdownify | Clean content extraction |
enable_searchscraper | External information search |
ReasoningTools | Plan research approach |
How It Works
Analysis Workflow
ScrapeGraph Tools
| Tool | Usage |
|---|---|
| Crawl | Comprehensive site analysis (first step) |
| SmartScraper | Extract specific data from pages |
| SearchScraper | Find external news and information |
| Markdownify | Convert pages to clean markdown |
Risk Categories
| Category | Examples |
|---|---|
| Market | Competition, market size, timing |
| Technology | Technical debt, scalability |
| Team | Key person dependencies |
| Financial | Runway, burn rate |
| Regulatory | Compliance, legal exposure |
Troubleshooting
ScrapeGraph API errors
ScrapeGraph API errors
Verify your API key:Check your ScrapeGraph dashboard for usage limits.
Incomplete website data
Incomplete website data
Some websites block crawlers. The agent will report what it could access and note limitations in the confidence score.
Missing funding information
Missing funding information
Funding data relies on public sources. Private companies may have limited information. Check the
due_diligence_focus field for areas needing manual research.