N/A
arizawan/vidlizer
vidlizer pulls frames out of any video, image, or PDF using ffmpeg, sends them to a vision LLM, and returns a flow array — one entry per scene. Each entry tells you what happened, who was on screen, what text was visible, and what changed. If the video has audio, it transcribes it with Apple MLX Whisper and merges the speech into each step.
Scan Scheduled
This agent is queued for security scanning. It will be graded in the next scan batch.
What We Know
- URL https://github.com/arizawan/vidlizer
- Framework mcp
- Sources glama
- First Seen Apr 30, 2026
- Repository github.com/arizawan/vidlizer
Browse more:
Search all agents
Ecosystem Report