Skills リサーチ

Skills Research

ナビゲーション

Skillsとは？

Skills（SKILL.md）は、AIエージェント（Claude Code、Cursor、Codexなど）に特定の能力を追加するための設定ファイルです。

詳しく見る →

リンク

検索に戻る

Qwen3 ASR — Voice Transcription

https://github.com/second-state/qwen3_asr_rs

★225更新: 1週間前Claude Code🤖 AI・機械学習モデル構築自動化 Git・PREnglish

Transcribe speech from audio files to text.

Qwen3 ASR — Voice Transcription

Transcribe speech from audio files to text.

Binary

{baseDir}/scripts/asr — Speech-to-text transcription.

Models

{baseDir}/scripts/models/Qwen3-ASR-0.6B — Speech recognition model (0.6B parameters).

Transcription

Transcribe an audio file to text.

{baseDir}/scripts/asr \
  {baseDir}/scripts/models/Qwen3-ASR-0.6B \
  <audio_file>

Parameters

Parameter	Required	Description
model_path	Yes	Path to the model directory (0.6B or 1.7B)
audio_file	Yes	Path to the audio file (any FFmpeg-supported format)

Output

Prints the transcribed text to standard output.

Example

{baseDir}/scripts/asr \
  {baseDir}/scripts/models/Qwen3-ASR-0.6B \
  recording.wav

Supported Audio Formats

Any format supported by FFmpeg: WAV, MP3, M4A, FLAC, OGG, and more. Audio is automatically resampled to 16 kHz mono internally.

Workflow

1. Identify the Audio File

Get the path to the audio file the user wants to transcribe.

2. Run the Command

Run the asr binary with the full paths to the binary and model directory.

{baseDir}/scripts/asr \
  {baseDir}/scripts/models/Qwen3-ASR-0.6B \
  /path/to/audio.wav

3. Return the Transcription

The transcribed text is printed to stdout. Return it to the user.

GitHub で開く Raw を見るマーケットで出品する →

関連スキル(🤖 AI・機械学習)

../../../plugin/skills/make-plan/SKILL.md

../../../plugin/skills/do/SKILL.md

Kysy virallista Microsoftin dokumentaatiota löytääksesi käsitteitä, opetusohjelm

Запрашивайте официальную документацию Microsoft, чтобы находить концепции,

Запитвайте официалната документация на Microsoft, за да намерите концепции,

AI・機械学習のスキルをもっと見る →