Tag

#Video Generation

KeyFrame-Compass는 386개 기본 샘플과 키프레임 실행·전체 비디오 품질을 분리한 진단 지표로, 멀티 키프레임 비디오 생성이 자연스러움과 제어 충실도 사이에서 어떤 trade-off를 보이는지 측정하...

Sangmin Lee2026.07.20

WorldDirector는 LLM이 3D 객체·카메라 궤적을 계획하고, 이를 2D 위치 조건·appearance binding·causal chunk memory로 내려보내 장기 비디오에서 동적 객체 영속성을 유...

Sangmin Lee2026.07.04

Meituan LongCat과 Fudan University가 공개한 WBench는 289개 테스트 케이스와 1,058개 상호작용 턴으로 비디오 월드 모델의 렌더링, 설정 준수, 상호작용, 일관성, 물리성을 함께...

Sangmin Lee2026.05.28

NVIDIA LongLive-2.0은 Balanced SP, NVFP4 학습·추론, KV-cache 양자화, asynchronous VAE decoding을 묶어 긴 비디오 생성의 학습 비용과 실시간 추론 병목을...

Sangmin Lee2026.05.20

ByteDance의 Lance는 3B active parameter급 native unified multimodal model로, 이미지·비디오 이해, 생성, 편집을 shared interleaved contex...

Sangmin Lee2026.05.20

SANA-WM은 Hybrid GDN-Softmax attention, 6-DoF camera control, long-video refiner, pose annotation pipeline을 묶어 720p 60초...

Sangmin Lee2026.05.18