Moviescc
[6] McInnes, L., Healy, J., & Astels, S. (2017). HDBSCAN: Hierarchical density based clustering. JOSS . : Sample scene embeddings (t-SNE visualization) and confusion matrix are available in the supplementary material. End of paper.
[3] Rao, A., et al. (2020). SceneFormer: Inductive bias for video scene segmentation. ECCV . moviescc
Existing video classification models (e.g., VideoBERT, TimeSformer) often treat movies as generic video streams, ignoring unique cinematic structures such as shot transitions, pacing, and narrative tropes. fills this gap by focusing specifically on scene-level classification —the atomic narrative unit typically comprising multiple shots unified by time and location. [6] McInnes, L
: 87.4% Macro F1-score : 0.85