Skip to content

Data Analysis

Workflow for data analysis from problem definition to conclusions and visualization. Follows CRISP-DM methodology with validation loops for data quality and analysis completeness.

Terminal window
mcp__moira__start({ workflowId: "data-analysis" })
flowchart LR
    A[Context] --> B[Problem]
    B --> C[Collect Data]
    C --> D[Prepare Data]
    D --> E[Explore/EDA]
    E --> F[Find Insights]
    F --> G[Visualize]
    G --> H[Conclude]
StepActionOutput
1. Get ContextCollect business question, context, data sources, constraints, audienceContext document
2. Define ProblemFormulate research question, hypotheses, success criteria, scopeProblem definition
3. Collect DataDownload, study structure, initial quality checkRaw dataset
4. Prepare DataHandle missing values, types, duplicates, outliers, transformationsClean dataset
5. Explore DataDistributions, correlations, patterns, preliminary insightsEDA report
6. Find InsightsTest hypotheses, answer research question, recommendationsKey insights
7. VisualizeCreate charts for key findingsVisualizations
8. ConcludeExecutive summary, findings, recommendations, limitationsFinal report
LoopPurposeCriteria
Data qualityVerify data readiness for EDANo critical quality issues
EDA completenessVerify research thoroughnessAll hypotheses addressed
GateDecision
Problem definitionConfirm research question and scope
ConclusionsApprove final findings and recommendations
StandardDescription
ReproducibilityAnalysis can be repeated with same results
ValidityMethods appropriate for data and question
RelevanceFindings address the business question
ClarityResults understandable by audience
ActionabilityRecommendations are practical
TaskAction
Missing valuesIdentify, understand, handle appropriately
Data typesVerify and correct column types
DuplicatesDetect and remove if necessary
OutliersIdentify, investigate, handle
TransformationsApply needed transformations
{
"id": "explore-data",
"type": "agent-directive",
"directive": "Perform exploratory data analysis. Examine distributions, correlations, and patterns. Document preliminary insights.",
"completionCondition": "EDA complete with distributions, correlations analyzed and preliminary insights documented",
"connections": {
"next": "validate-eda"
}
}