$ loading_

explore-data — askskill

$ ~/registry/skill/anthropics-data-skills-explore-data

SKILL

explore-data

快速剖析新数据集的结构、质量与分布特征，辅助后续分析决策

来源

GitHub

更新于

2026-06-07

// 安全评估低风险

仅提示词，不执行代码
开源可审计

正在进行安全审计…

凭证密钥
网络外发
代码执行
数据访问
来源供应链

// 安装

复制安装指令，让 AI 自动完成配置 · 推荐新手

请帮我安装 askskill 上的 "explore-data" 技能：
1. 下载 https://raw.githubusercontent.com/anthropics/knowledge-work-plugins/main/data/skills/explore-data/SKILL.md
2. 保存为 ~/.claude/skills/explore-data/SKILL.md
3. 装好后重载技能，告诉我可以用了

// 下载

下载 SKILL.md机读安装清单 ↗

// 用法示例

检查新表质量

输入

请分析这份数据表的基本结构，包括行数、列数、字段类型、缺失率、重复记录情况，以及每列的主要取值分布，并指出明显的数据质量问题。

预期产出

一份数据概况报告，包含结构统计、缺失与重复分析、异常值提示和质量风险总结。

识别异常字段

输入

帮我检查这份数据里哪些字段存在可疑值，例如异常极值、不合理日期、格式不一致或类别拼写混乱，并按风险高低排序说明。

预期产出

按字段列出的异常排查结果，说明问题类型、示例值和优先处理建议。

确定分析维度指标

输入

基于这份数据的字段内容，帮我判断适合做分析的维度和指标，并建议可以优先探索的几个业务问题或图表方向。

预期产出

一份分析规划建议，包含可用维度、核心指标、优先问题和推荐可视化方向。

// 文档

/explore-data - Profile and Explore a Dataset

If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md.

Generate a comprehensive data profile for a table or uploaded file. Understand its shape, quality, and patterns before diving into analysis.

Usage

/explore-data <table_name or file>

Workflow

1. Access the Data

If a data warehouse MCP server is connected:

Resolve the table name (handle schema prefixes, suggest matches if ambiguous)
Query table metadata: column names, types, descriptions if available
Run profiling queries against the live data

If a file is provided (CSV, Excel, Parquet, JSON):

Read the file and load into a working dataset
Infer column types from the data

If neither:

Ask the user to provide a table name (with their warehouse connected) or upload a file
If they describe a table schema, provide guidance on what profiling queries to run

2. Understand Structure

Before analyzing any data, understand its structure:

// 同源资产

技能

nextflow-development

运行 nf-core/Nextflow 流水线，完成 RNA-seq、变异检测与 ATAC-seq 数据分析

anthropics装→

技能

cowork-plugin-customizer

为特定组织定制 Claude Code 插件配置、连接器与工作流适配方案。

anthropics装→

// 功能相似

MCP 工具

MCP Analyst

帮助用户直接分析本地 CSV 或 Parquet 大数据文件并生成洞察。

—装→

MCP 工具

scherlok

自动分析数仓数据质量、识别异常并为 CI 提供只读质量门禁。

—装→

min, max, mean, median (p50)
standard deviation
percentiles: p1, p5, p25, p75, p95, p99
zero count
negative count (if unexpected)

min length, max length, avg length
empty string count
pattern analysis (do values follow a format?)
case consistency (all upper, all lower, mixed?)
leading/trailing whitespace count

min date, max date
null dates
future dates (if unexpected)
distribution by month/week
gaps in time series

true count, false count, null count
true rate

explore-data

// 用法示例

// 文档

/explore-data - Profile and Explore a Dataset

Usage

Workflow

1. Access the Data

2. Understand Structure

// 同源资产

nextflow-development

cowork-plugin-customizer

// 功能相似

MCP Analyst

scherlok

3. Generate Data Profile

4. Identify Data Quality Issues

5. Discover Relationships and Patterns

customer-research

analyze

statistical-analysis

data-visualization

ai.myriade/myriade

metrics-review

validate-data

dataHill