Wenxi Chen
@cwx-worst-onePh.D.ing @X-LANCE & SII. Interested in Speech and Audio Understanding & Generation.
Language Breakdown
Lines of code distribution across 4 owned repositories
I-Shaped Developer
I-shapedSpecialist — deep expertise in Python
Collaboration Network
Global Impact visualization
Repos
15
PRs
0
Growth
+18%
Top Collaborators
No collaborator data yet.
Coding Streak
Contribution activity over the past year
Bohan Li
@bovod-sjtu
Samuel Xu
@xrysamuel
Yuanzhe Chen
@qq547276542
Shan Yang
@syang1993
jingyaogong
@jingyaogong
Top Repositories
[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer
WavTTS: Towards High-Quality Zero-Shot TTS via Direct Raw Waveform Modeling
Beta version for SLAM-LLM
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
An independent Python feature port of Claude Code, entirely rewritting from scratch using oh-my-codex. Educational Purpose only.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
GLM-4-Voice 简化本地单轮&多轮推理
Open Source Impact
Contributions to external projects