· IDT Lab

Publications - Journals

M. Ntrougkas, V. Mezaris, I. Patras, "P-TAME: Explain Any Image Classifier with Trained Perturbations", IEEE Open Journal of Signal Processing, vol. 6, pp. 536-545, 2025. DOI:10.1109/OJSP.2025.3568756. Software available at https://github.com/IDT-ITI/P-TAME.

K. Tsigos, E. Apostolidis, V. Mezaris, "An Integrated Framework for Multi-Granular Explanation of Video Summarization", Frontiers in Signal Processing, vol. 4, 2024. DOI:10.3389/frsip.2024.1433388. The accepted version of the paper is available at http://arxiv.org/abs/2405.10082.

M. Ntrougkas, N. Gkalelis, V. Mezaris, "T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers", IEEE Access, 2024. DOI:10.1109/ACCESS.2024.3405788. The accepted version of the paper is available in https://arxiv.org/abs/2403.04523. Software available at https://github.com/IDT-ITI/T-TAME

L. Nixon, K. Apostolidis, E. Apostolidis, D. Galanopoulos, V. Mezaris, B. Philipp, R. Bocyte, "AI and data-driven media analysis of TV content for optimised digital content marketing", Multimedia Systems Journal (Springer), vol. 30, art. 25, 2024. DOI:10.1007/s00530-023-01195-7. [SharedIt link]

M. Papadogiorgaki, N. Grammalidis, A. Grammatikopoulou, K. Apostolidis, E.S. Bei, K. Grigoriadis, S. Zafeiris, G. Livanos, V. Mezaris, M.E. Zervakis, "An Integrated Support System for People with Intellectual Disability", Electronics, vol. 12, no. 18:3803, Sept. 2023. DOI:10.3390/electronics12183803.

K. Apostolidis, V. Mezaris, M. Papadogiorgaki, E.S. Bei, G. Livanos, M.E. Zervakis, "Content and Other Resources Recommendations for Individuals with Intellectual Disability: A Review", Electronics, vol. 11, no. 21:3472, Oct. 2022. DOI:10.3390/electronics11213472.

N. Gkalelis, D. Daskalakis, V. Mezaris, "ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network", IEEE Access, vol. 10, pp. 108797-108816, 2022. DOI:10.1109/ACCESS.2022.3213652. Software available at https://github.com/bmezaris/ViGAT.

L. Nixon, J. Foss, K. Apostolidis, V. Mezaris, "Data-driven personalisation of Television Content: A Survey", Multimedia Systems Journal (Springer), vol. 28, no. 6, pp. 2193-2225, 2022. DOI:10.1007/s00530-022-00926-6.

E. Apostolidis, E. Adamantidou, A. Metsai, V. Mezaris, I. Patras, "Video Summarization Using Deep Neural Networks: A Survey", Proceedings of the IEEE, vol. 109, no. 11, pp. 1838-1863, Nov. 2021. DOI:10.1109/JPROC.2021.3117472. The accepted version of the paper is available in https://arxiv.org/abs/2101.06072.

E. Apostolidis, E. Adamantidou, A. Metsai, V. Mezaris, I. Patras, "AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization", IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 8, pp. 3278-3292, Aug. 2021. DOI:10.1109/TCSVT.2020.3037883. Software available at https://github.com/e-apostolidis/AC-SUM-GAN.

Publications - Conferences

A. Goulas, D. Galanopoulos, E. Apostolidis, V. Mezaris, "Sens-VisualNews: A Benchmark Dataset for Sensational Image Detection", Proc. 2026 IEEE Int. Conf. on Image Processing (ICIP 2026), Tampere, Finland, Sept. 2026. Preprint: http://arxiv.org/abs/2605.10394. Software and dataset available at https://github.com/IDT-ITI/Sens-VisualNews.

D. Galanopoulos, V. Mezaris, "A Test-time Actor-Critic Approach to News Images Generation", Proc. 2026 Multimedia Evaluation Workshop (MediaEval'26), Amsterdam, NL, June 2026. Preprint: http://arxiv.org/abs/2606.21304.

N. Pantelidis, E. Kosmidou, D. Galanopoulos, D. Georgalis, S. Pasios, K. Apostolidis, A. Goulas, M. Pegia, G. Tsionkis, K. Gkountakos, G. Kouvrakis, A. Moumtzidou, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE in VBS 2026", Proc. 32nd Int. Conf. on MultiMedia Modeling (MMM 2026), Prague, CZ, Jan. 2026. DOI:10.1007/978-981-95-6963-2_24.

A. Goulas, D. Galanopoulos, I. Patras, V. Mezaris, "MLLM Frame Subset Ensembling for Audio-Visual Video QA and MLLM-based Reranking for Ad-hoc Video Search in TRECVID 2025", Proc. TRECVID 2025 Workshop, Dec. 2025. [AVS slides] [VQA slides]

E. Palogiannidi, S. Legkas, D. Vogiatzis, V. Koutsoupia, M. Mylonas, V. Mezaris, S. Markou, G. Zissis, P. Theodosiou, "The MediaPot Platform for News Analytics", Proc. 20th Int. Workshop on Semantic and Social Media Adaptation and Personalization (SMAP 2025), Mystras, Greece, Nov. 2025. DOI:10.1109/SMAP66932.2025.00011.

M. Mylonas, E. Apostolidis, V. Mezaris, "SD-VSum: A Method and Dataset for Script-Driven Video Summarization", Proc. ACM Multimedia (ACM MM 2025), Dublin, Ireland, Oct. 2025. DOI:10.1145/3746027.3755821. https://arxiv.org/abs/2505.03319. Software and dataset available at https://github.com/IDT-ITI/SD-VSum. [slides]

I. Kontostathis, E. Apostolidis, V. Mezaris, "TSalV360: A Method and Dataset for Text-driven Saliency Detection in 360-Degrees Videos", IEEE Int. Conf. on Content-Based Multimedia Indexing (CBMI 2025), Dublin, Ireland, Oct. 2025. DOI:10.1109/CBMI66578.2025.11339307. http://arxiv.org/abs/2509.26208. Software and dataset available at https://github.com/IDT-ITI/TSalV360. [slides] Runner-up for Best Paper Award

T. Eleftheriadis, E. Apostolidis, V. Mezaris, "An Experimental Study on Generating Plausible Textual Explanations for Video Summarization", IEEE Int. Conf. on Content-Based Multimedia Indexing (CBMI 2025), Dublin, Ireland, Oct. 2025. DOI:10.1109/CBMI66578.2025.11339337. http://arxiv.org/abs/2509.26225. Software available at https://github.com/IDT-ITI/Text-XAI-Video-Summaries. [slides]

D. Galanopoulos, A. Goulas, V. Mezaris, "Cross-modal Image Recommendation for News Articles by Multimodal Foundation Models-based Retrieval-Reranking", Proc. 2025 Multimedia Evaluation Workshop (MediaEval'25), Dublin, Ireland, Oct. 2025. [slides]

A. Goulas, V. Mezaris, I. Patras, "VidCtx: Context-aware Video Question Answering with Image Models", IEEE Int. Conf. on Multimedia and Expo (ICME 2025), Nantes, France, June-July 2025. DOI:10.1109/ICME59968.2025.11210080. https://arxiv.org/abs/2412.17415. Software available at https://github.com/IDT-ITI/VidCtx. [poster]

M. Tzelepi, V. Mezaris, "Improving Multimodal Hateful Meme Detection Exploiting LMM-Generated Knowledge", Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, June 2025. DOI:10.1109/CVPRW67362.2025.00025. CVF open access version. http://arxiv.org/abs/2504.09914. Software available at https://github.com/IDT-ITI/LMM-CLIP-meme. [poster]

D. Galanopoulos, A. Goulas, A. Leventakis, I. Patras, V. Mezaris, "An LLM Framework for Long-form Video Retrieval and Audio-Visual Question Answering Using Qwen2/2.5", Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, June 2025. DOI:10.1109/CVPRW67362.2025.00358. CVF open access version. [slides]

E. Palogiannidi, S. Legkas, D. Vogiatzis, M. Mylonas, V. Koutsoupia, V. Mezaris, S. Markou, G. Zissis, P. Theodosiou, "News, Social Media and Video Analytics: the MediaPot platform", Int. Conf. on Revisiting Disinformation: Critical Media Literacy Approaches (ISBN 978-618-5762-03-2), Rethymno, Crete, GR, June 2025.

K. Tsigos, E. Apostolidis, V. Mezaris, "Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated Samples", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2025), Tucson, AZ, USA, pp. 658-667, Feb. 2025. DOI:10.1109/WACVW65960.2025.00080. http://arxiv.org/abs/2502.03957. Software available at https://github.com/IDT-ITI/Adv-XAI-Deepfakes. [slides]

N. Kaparinos, V. Mezaris, "B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2025), Tucson, AZ, USA, pp. 844-853, Feb. 2025. DOI:10.1109/WACVW65960.2025.00100. http://arxiv.org/abs/2501.16917. Software available at https://github.com/IDT-ITI/B-FPGM. [slides]

I. Kontostathis, E. Apostolidis, K. Apostolidis, V. Mezaris, "Enhancing User Control in AI-Based Video Summarization for Social Media", Proc. 31st Int. Conf. on MultiMedia Modeling (MMM 2025), Nara, Japan, Jan. 2025. DOI:10.1007/978-981-96-2074-6_12.

N. Pantelidis, D. Georgalis, M. Pegia, D. Galanopoulos, K. Apostolidis, K. Stavrothanasopoulos, A. Moumtzidou, K. Gkountakos, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE in VBS 2025", Proc. 31st Int. Conf. on MultiMedia Modeling (MMM 2025), Nara, Japan, Jan. 2025. DOI:10.1007/978-981-96-2074-6_43.

M. Tzelepi, V. Mezaris, "LMM-Regularized CLIP Embeddings for Image Classification", Proc. 26th Int. Symp. on Multimedia (ISM 2024), Tokyo, Japan, Dec. 2024. DOI:10.1109/ISM63611.2024.00041. http://arxiv.org/abs/2412.11663. [slides]

K. Gkountakos, D. Galanopoulos, A. Leventakis, G. Tsionkis, K. Stavrothanasopoulos, K. Ioannidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "ITI-CERTH participation in ActEV and AVS Tracks of TRECVID 2024", Proc. TRECVID 2024 Workshop, Nov. 2024.

M. Tzelepi, V. Mezaris, "Disturbing Image Detection Using LMM-Elicited Emotion Embeddings", Proc. LVLM Workshop @ 2024 IEEE Int. Conf. on Image Processing (ICIP 2024), Abu Dhabi, UAE, Oct. 2024. DOI:10.1109/ICIPCW64161.2024.10769133. http://arxiv.org/abs/2406.12668. [slides]

M. Tzelepi, V. Mezaris, "Online Anchor-based Training for Image Classification Tasks", Proc. 2024 IEEE Int. Conf. on Image Processing (ICIP 2024), Abu Dhabi, UAE, pp. 1099-1105, Oct. 2024. DOI:10.1109/ICIP51287.2024.10648148. http://arxiv.org/abs/2406.12662. Software available at https://github.com/IDT-ITI/OAT.

L. Nixon, D. Galanopoulos, V. Mezaris, "Finding video shots for immersive journalism through text-to-video search", Proc. 21st Int. Conf. on Content-based Multimedia Indexing (CBMI), Reykjavik, Iceland, Sept. 2024. DOI:10.1109/CBMI62980.2024.10859220. https://zenodo.org/records/13791615.

L. Nixon, D. Galanopoulos, V. Mezaris, A. Hubmann-Haidvogel, D. Fischl, A. Scharl, "Video Shot Discovery through Text2Video Embeddings in a News Analytics Dashboard", Proc. 21st Int. Conf. on Content-based Multimedia Indexing (CBMI), Reykjavik, Iceland, Sept. 2024. DOI:10.1109/CBMI62980.2024.10859251. https://zenodo.org/records/13792237.

N. Pantelidis, M. Pegia, D. Galanopoulos, K. Apostolidis, D. Georgalis, K. Stavrothanasopoulos, A. Moumtzidou, K. Gkountakos, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE: Simplifying Video Search for Novice Users", Proc. 21st Int. Conf. on Content-based Multimedia Indexing (CBMI), Reykjavik, Iceland, Sept. 2024. DOI:10.1109/CBMI62980.2024.10859248.

M. Tzelepi, V. Mezaris, "Prototype Anchoring for Image Classification Tasks", Proc. 32nd European Signal Processing Conf. (EUSIPCO 2024), Lyon, France, Aug. 2024. https://ieeexplore.ieee.org/document/10715272

I. Kontostathis, E. Apostolidis, V. Mezaris, "A Human-Annotated Video Dataset for Training and Evaluation of 360-Degree Video Summarization Methods", Proc. 1st Int. Workshop on Video for Immersive Experiences (Video4IMX-2024) at ACM IMX 2024, Stockholm, Sweden, June 2024. http://arxiv.org/abs/2406.02991. Software available at https://github.com/IDT-ITI/360-VSumm. [slides]

K. Apostolidis, J. Abesser, L. Cuccovillo, V. Mezaris, "Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol", Proc. ACM Int. Workshop on Multimedia AI against Disinformation (MAD’24) at the ACM Int. Conf. on Multimedia Retrieval (ICMR’24), Thailand, June 2024. DOI:10.1145/3643491.3660287. https://arxiv.org/abs/2405.00384. Software available at https://github.com/IDT-ITI/Visual-Audio-Discrepancy-Detection. [slides]

K. Tsigos, E. Apostolidis, S. Baxevanakis, S. Papadopoulos, V. Mezaris, "Towards Quantitative Evaluation of Explainable AI Methods for Deepfake Detection", Proc. ACM Int. Workshop on Multimedia AI against Disinformation (MAD’24) at the ACM Int. Conf. on Multimedia Retrieval (ICMR’24), Thailand, June 2024. DOI:10.1145/3643491.3660292. https://arxiv.org/abs/2404.18649. Software available at https://github.com/IDT-ITI/XAI-Deepfakes. [slides]

M. Tzelepi, V. Mezaris, "Exploiting LMM-based knowledge for image classification tasks", Proc. 25th Int. Conf. on Engineering Applications of Neural Networks (EANN/EAAAI 2024), Corfu, Greece, Springer CCIS vol. 2141, pp. 166-177, June 2024. DOI:10.1007/978-3-031-62495-7_13. http://arxiv.org/abs/2406.03071. [slides]

A. Leventakis, D. Galanopoulos, V. Mezaris, "Cross-modal Networks, Fine-Tuning, Data Augmentation and Dual Softmax Operation for MediaEval NewsImages 2023", Proc. 2023 Multimedia Evaluation Workshop (MediaEval'23), Amsterdam, NL, Feb. 2024. [slides]

K. Triaridis, V. Mezaris, "Exploring Multi-Modal Fusion for Image Manipulation Detection and Localization", Proc. 30th Int. Conf. on MultiMedia Modeling (MMM 2024), Amsterdam, NL, Springer LNCS vol. 14556, pp. 198–211, Jan.-Feb. 2024. DOI:10.1007/978-3-031-53311-2_15. https://arxiv.org/abs/2312.01790v1. Software available at https://github.com/IDT-ITI/MMFusion-IML. [slides]

I. Kontostathis, E. Apostolidis, V. Mezaris, "An Integrated System for Spatio-Temporal Summarization of 360-degrees Videos", Proc. 30th Int. Conf. on MultiMedia Modeling (MMM 2024), Amsterdam, NL, Springer LNCS vol. 14557, pp. 202–215, Jan.-Feb. 2024. DOI:10.1007/978-3-031-53302-0_15. https://arxiv.org/abs/2312.02576. Software available at https://github.com/IDT-ITI/CA-SUM-360. [slides]

E. Apostolidis, K. Apostolidis, V. Mezaris, "Facilitating the Production of Well-tailored Video Summaries for Sharing on Social Media", Proc. 30th Int. Conf. on MultiMedia Modeling (MMM 2024), Amsterdam, NL, Springer LNCS vol. 14557, pp. 271-278, Jan.-Feb. 2024. DOI:10.1007/978-3-031-53302-0_21. https://arxiv.org/abs/2312.02616.

N. Pantelidis, M. Pegia, D. Galanopoulos, K. Apostolidis, K. Stavrothanasopoulos, A. Moumtzidou, K. Gkountakos, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, B. Jonsson, "VERGE in VBS 2024", Proc. 30th Int. Conf. on MultiMedia Modeling (MMM 2024), Amsterdam, NL, Springer LNCS vol. 14557, pp. 356–363, Jan.-Feb. 2024. DOI:10.1007/978-3-031-53302-0_32.

K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2024), Waikoloa, Hawaii, USA, Jan. 2024. DOI:10.1109/WACVW60836.2024.00037. https://arxiv.org/abs/2311.16613. Software available at https://github.com/IDT-ITI/Lightweight-Face-Detector-Pruning.

D. Daskalakis, N. Gkalelis, V. Mezaris, "Masked Feature Modelling for the unsupervised pre-training of a Graph Attention Network block for bottom-up video event recognition", Proc. 25th IEEE Int. Symp. on Multimedia (ISM 2023), Laguna Hills, CA, USA, Dec. 2023. DOI:10.1109/ISM59092.2023.00047. https://arxiv.org/abs/2308.12673. Software available at https://github.com/bmezaris/masked-ViGAT. [slides]

D. Galanopoulos, V. Mezaris, "ITI-CERTH participation in AVS Task of TRECVID 2023", Proc. TRECVID 2023 Workshop, Nov. 2023.

E. Apostolidis, V. Mezaris, I. Patras, "A Study on the Use of Attention for Explaining Video Summarization", Proc. NarSUM workshop at ACM Multimedia 2023 (ACM MM), Ottawa, Canada, Oct.-Nov. 2023. DOI:10.1145/3607540.3617138.

E. Apostolidis, G. Balaouras, V. Mezaris, I. Patras, "Selecting a Diverse Set of Aesthetically-pleasing and Representative Video Thumbnails using Reinforcement Learning", IEEE Int. Conf. on Image Processing (ICIP 2023), Kuala Lumpur, Malaysia, Oct. 2023. DOI:10.1109/ICIP49359.2023.10222743. Software available at https://github.com/e-apostolidis/RL-DiVTS.

M. Papadogiorgaki, K. Apostolidis, G. Livanos, E. Bei, S. Zafeiris, G. Klados, V. Mezaris, M. Zervakis, "A Content Recommendation Platform for People with Intellectual Disability", 3rd Int. Workshop on Artificial Intelligence for Information, Communications, and Applications (AIICA 2023) at the 14th Int. Conf. on Ubiquitous and Future Networks (ICUFN), Paris, France, July 2023. DOI:10.1109/ICUFN57995.2023.10199882.

D. Galanopoulos, V. Mezaris, "Cross-modal networks and dual softmax operation for MediaEval NewsImages 2022", Proc. 2022 Multimedia Evaluation Workshop (MediaEval'22), CEUR vol. 3583, Bergen, Norway, Jan. 2023. [slides]

N. Pantelidis, S. Andreadis, M. Pegia, A. Moumtzidou, D. Galanopoulos, K. Apostolidis, D. Touska, K. Gkountakos, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE in VBS 2023", Proc. 29th Int. Conf. on Multimedia Modeling (MMM), Bergen, Norway, Springer LNCS vol. 13833, pp. 658-664, Jan. 2023. DOI:10.1007/978-3-031-27077-2_55.

M. Ntrougkas, N. Gkalelis, V. Mezaris, "TAME: Attention Mechanism Based Feature Fusion for Generating Explanation Maps of Convolutional Neural Networks", Proc. IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, pp. 58-65, Dec. 2022. DOI:10.1109/ISM55400.2022.00014. http://arxiv.org/abs/2301.07407. Software available at https://github.com/bmezaris/TAME. [slides] Best Paper Award

N. Gkalelis, D. Daskalakis, V. Mezaris, "Gated-ViGAT: Efficient bottom-up event recognition and explanation using a new frame selection policy and gating mechanism", Proc. IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, pp. 113-120, Dec. 2022. DOI:10.1109/ISM55400.2022.00024. http://arxiv.org/abs/2301.07565. Software available at https://github.com/bmezaris/gated-vigat. [slides]

E. Apostolidis, G. Balaouras, V. Mezaris, I. Patras, "Explaining video summarization based on the focus of attention", Proc. IEEE Int. Symposium on Multimedia (ISM), Naples, Italy, pp. 146-150, Dec. 2022. DOI:10.1109/ISM55400.2022.00029. Software available at https://github.com/e-apostolidis/XAI-SUM. [slides]

K. Gkountakos, D. Galanopoulos, D. Touska, K. Ioannidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "ITI-CERTH participation in ActEV and AVS Tracks of TRECVID 2022", Proc. TRECVID 2022 Workshop, Dec. 2022. [slides]

I. Gkartzonika, N. Gkalelis, V. Mezaris, "Learning Visual Explanations for DCNN-Based Image Classifiers Using an Attention Mechanism", Proc. ECCV 2022 Workshop on Vision with Biased or Scarce Data (VBSD), Springer LNCS vol. 13808, pp. 396-411, Oct. 2022. DOI:10.1007/978-3-031-25085-9_23. https://arxiv.org/abs/2209.11189. Software available at https://github.com/bmezaris/L-CAM. [slides]

D. Galanopoulos, V. Mezaris, "Are All Combinations Equal? Combining Textual and Visual Features with Multiple Space Learning for Text-Based Video Retrieval", Proc. ECCV 2022 Workshop on AI for Creative Video Editing and Understanding (CVEU), Springer LNCS vol. 13804, pp. 627–643, Oct. 2022. DOI:10.1007/978-3-031-25069-9_40. https://arxiv.org/abs/2211.11351. Software available at https://github.com/bmezaris/TextToVideoRetrieval-TtimesV. [slides]

E. Apostolidis, G. Balaouras, V. Mezaris, I. Patras, "Summarizing videos using concentrated attention and considering the uniqueness and diversity of the video frames", Proc. ACM Int. Conf. on Multimedia Retrieval (ICMR’22), Newark, NJ, USA, pp. 407-415, June 2022. DOI:10.1145/3512527.3531404. Software available at https://github.com/e-apostolidis/CA-SUM. [slides]

S. Andreadis, A. Moumtzidou, D. Galanopoulos, N. Pantelidis, K. Apostolidis, D. Touska, K. Gkountakos, M. Pegia, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE in VBS 2022", Proc. 28th Int. Conf. on Multimedia Modeling (MMM), Phu Quoc, Vietnam, Springer LNCS vol. 13142, pp. 530–536, June 2022. DOI:10.1007/978-3-030-98355-0_50.

E. Apostolidis, G. Balaouras, V. Mezaris, I. Patras, "Combining Global and Local Attention with Positional Encoding for Video Summarization", Proc. IEEE Int. Symposium on Multimedia (ISM), Dec. 2021. DOI:10.1109/ISM52913.2021.00045. Software available at https://github.com/e-apostolidis/PGL-SUM. [slides]

K. Apostolidis, V. Mezaris, "A Web Service for Video Smart-Cropping", Proc. IEEE Int. Symposium on Multimedia (ISM), Dec. 2021. DOI:10.1109/ISM52913.2021.00011. Software and dataset available at https://github.com/bmezaris/RetargetVid. [slides]

A. Pournaras, N. Gkalelis, D. Galanopoulos, V. Mezaris, "Combining Multiple Deep-learning-based Image Features for Visual Sentiment Analysis", Proc. MediaEval 2021 Workshop, CEUR vol. 3181, Dec. 2021.

K. Gkountakos, D. Galanopoulos, D. Touska, K. Ioannidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "ITI-CERTH participation in ActEV and AVS Tracks of TRECVID 2021", Proc. TRECVID 2021 Workshop, Dec. 2021.

A. Pournaras, N. Gkalelis, D. Galanopoulos, V. Mezaris, "Exploiting Out-of-Domain Datasets and Visual Representations for Image Sentiment Classification", Proc. 16th Int. Workshop on Semantic and Social Media Adaptation & Personalization (SMAP), Nov. 2021. DOI:10.1109/SMAP53521.2021.9610801.

E. Apostolidis, E. Adamantidou, V. Mezaris, I. Patras, "Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection", ACM Int. Conf. on Multimedia Retrieval (ICMR), Taipei, Taiwan, Nov. 2021. DOI:10.1145/3460426.3463630. Software available at https://github.com/e-apostolidis/Video-Thumbnail-Selector. [slides]

D. Galanopoulos, V. Mezaris, "Hard-negatives or Non-negatives? A hard-negative selection strategy for cross-modal retrieval using the improved marginal ranking loss", Proc. IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), pp. 2312-2316, Oct. 2021. DOI:10.1109/ICCVW54120.2021.00261. [slides]

K. Apostolidis, V. Mezaris, "A Fast Smart-Cropping Method and Dataset for Video Retargeting", Proc. 28th IEEE Int. Conf. on Image Processing (ICIP), Anchorage, Alaska, US, Sept. 2021. DOI:10.1109/ICIP42928.2021.9506390. Software and dataset available at https://github.com/bmezaris/RetargetVid [slides]

D. Galanopoulos, E. Elejalde, A. Pournaras, C. Niederée, V. Mezaris, "Automatic and Semi-automatic Augmentation of Migration Related Semantic Concepts for Visual Media Retrieval", Proc. Workshop on Open Challenges in Online Social Networks (OASIS) @ the 32nd ACM Conference on Hypertext and Social Media (ACM HT'21), Dublin, Ireland, Aug.-Sept. 2021. DOI:10.1145/3472720.3483618. [slides]

N. Gkalelis, A. Goulas, D. Galanopoulos, V. Mezaris, "ObjectGraphs: Using Objects and a Graph Convolutional Network for the Bottom-up Recognition and Explanation of Events in Video", Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 3370-3378, June 2021. DOI:10.1109/CVPRW53098.2021.00376. Software available at https://github.com/bmezaris/ObjectGraphs [slides]

L. Nixon, K. Apostolidis, E. Apostolidis, D. Galanopoulos, V. Mezaris, B. Philipp, R. Bocyte, "Content Wizard: demo of a trans-vector digital video publication tool", Proc. ACM Int. Conf. on Interactive Media Experiences (IMX), June 2021. DOI:10.1145/3452918.3468083.

F. Tsalakanidou, S. Papadopoulos, V. Mezaris, I. Kompatsiaris, B. Gray, D. Tsabouraki, M. Kalogerini, F. Negro, M. Montagnuolo, J. de Vos, P. van Kemenade, D. Gravina, R. Mignot, A. Ozerov, F. Schnitzler, A. Garcia-Saez, G. Yannakakis, A. Liapis, G. Kostadinov, "The AI4Media project: Use of Next-generation Artificial Intelligence Technologies for Media Sector Applications", Proc. 17th Artificial Intelligence Applications and Innovations Conference (AIAI), Crete, Greece, June 2021. DOI:10.1007/978-3-030-79150-6_7.

S. Andreadis, A. Moumtzidou, K. Gkountakos, N. Pantelidis, K. Apostolidis, D. Galanopoulos, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, "VERGE in VBS 2021", Proc. 27th Int. Conf. on Multimedia Modeling (MMM2021), Prague, CZ, Springer LNCS vol. 12573, pp. 398–404, June 2021. DOI:10.1007/978-3-030-67835-7_35.

A. Apaolaza, T. Backes, S. Barthold, I. Bienia, T. Blume, C. Collyda, A. Fessl, S. Gottfried, P. Grunewald, F. Günther, T. Köhler, R. Lorenz, M. Heinz, S. Herbst, V. Mezaris, C. Nishioka, A. Pournaras, V. Sabol, A. Saleh, A. Scherp, I. Simic, A. Skulimowski, I. Vagliano, M. Vigo, M. Wiese, T. Zdolsek Draksler, "MOVING: A User-Centric Platform for Online Literacy Training and Learning", in "e-Science: Open, Social and Virtual Technology for Research Collaboration", C. Koschtial, T. Köhler, C. Felden (Eds.), Spinger series "Progress in IS", pp. 77-97, 2021. DOI:10.1007/978-3-030-66262-2_6.

N. Gkalelis, V. Mezaris, "Structured Pruning of LSTMs via Eigenanalysis and Geometric Median for Mobile Multimedia and Deep Learning Applications", Proc. 22nd IEEE Int. Symposium on Multimedia (ISM), Dec. 2020. DOI:10.1109/ISM.2020.00028. Software available at https://github.com/bmezaris/lstm_structured_pruning_geometric_median. [slides]

K. Gkountakos, D. Galanopoulos, M. Mpakratsas, D. Touska, A. Moumtzidou, K. Ioannidis, I. Gialampoukidis, S. Vrochidis, V. Mezaris, I. Kompatsiaris, “ITI-CERTH participation in TRECVID 2020”, Proc. TRECVID 2020 Workshop, Gaithersburg, MD, USA, Dec. 2020.

E. Apostolidis, E. Adamantidou, A. Metsai, V. Mezaris, I. Patras, "Performance over Random: A robust evaluation protocol for video summarization methods", Proc. ACM Multimedia 2020 (ACM MM), Seattle, WA, USA, Oct. 2020. DOI:10.1145/3394171.3413632. Software available at https://github.com/e-apostolidis/PoR-Summarization-Measure. [slides]

M. Zwicklbauer, W. Lamm, M. Gordon, K. Apostolidis, B. Philipp, V. Mezaris, "Video Analysis for Interactive Story Creation: The Sandmännchen Showcase", Proc. AI4TV workshop at ACM Multimedia 2020 (ACM MM), Seattle, WA, USA, Oct. 2020. DOI:10.1145/3422839.3423061.

R. Troncy, J. Laaksonen, H. Tavakoli, L. Nixon, V. Mezaris, M. Hosseini, "AI4TV 2020: 2nd International Workshop on AI for Smart TV Content Production, Access and Delivery", Proc. ACM Multimedia 2020 (ACM MM), Seattle, WA, USA, Oct. 2020. DOI:10.1145/3394171.3421894.

E. Elejalde, D. Galanopoulos, C. Niederee, V. Mezaris, "Migration-Related Semantic Concepts for the Retrieval of Relevant Video Content", Proc. Int. Workshop on Artificial Intelligence and Robotics for Law Enforcement Agencies (AIRLEAs) at the 3rd Int. Conf. on Intelligent Technologies and Applications (INTAP 2020), Gjovik, Norway, Springer CCIS, vol. 1382, pp. 404-416, Sept. 2020. DOI:10.1007/978-3-030-71711-7_34. [slides]

D. Galanopoulos, V. Mezaris, "Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks", Proc. ACM Int. Conf. on Multimedia Retrieval (ICMR 2020), Dublin, Ireland, 2020. DOI:10.1145/3372278.3390737. Software available at https://github.com/bmezaris/AVS_dual_encoding_attention_network.

N. Gkalelis, V. Mezaris, "Fractional Step Discriminant Pruning: A Filter Pruning Framework for Deep Convolutional Neural Networks", Proc. 7th IEEE Int. Workshop on Mobile Multimedia Computing (MMC2020) at the IEEE Int. Conf. on Multimedia and Expo (ICME), London, UK, July 2020. DOI:10.1109/ICMEW46912.2020.9105979. Software available at https://github.com/bmezaris/fractional_step_discriminant_pruning_dcnn. [slides]

Publications - Edited Books

S. Papadopoulos, K. Bontcheva, V. Mezaris, R. Rogers (Eds.), "Countering Disinformation in the Era of Generative AI", Springer, 2026. DOI:10.1007/978-3-032-11782-3.

Publications - Book Chapters

K. Apostolidis, V. Mezaris, "Video Decomposition and Key Visual Elements Extraction and Enhancement", in book "Countering Disinformation in the Era of Generative AI", S. Papadopoulos, K. Bontcheva, V. Mezaris, R. Rogers (Eds.), pp. 343-380, Springer, 2026. DOI:10.1007/978-3-032-11782-3_12.

D. Galanopoulos, V. Mezaris, "Cross-Modal Learning for Free-Text Video Search", Encyclopedia of Information Science and Technology, Sixth Edition, IGI Global, 2024. DOI:10.4018/978-1-6684-7366-5.ch088.

E. Apostolidis, G. Balaouras, I. Patras, V. Mezaris, "Explainable Video Summarization for Advancing Media Content Production", Encyclopedia of Information Science and Technology, Sixth Edition, IGI Global, 2023. DOI:10.4018/978-1-6684-7366-5.ch065.

For a complete list of Dr. Mezaris group's publications, see https://www.iti.gr/~bmezaris/publications.html