image of ai research scientist

Hi, I'm Filippos🤚

I'm a GenAI Research Scientist 🤖.

  • 💼 Research Scientist at Meta (GenAI org.)
  • 🌎 based in the UK
  • 🎓 Ph.D from UCL
  • 📧 filipposkokkinos[at]gmail[.]com

🌟 Hello! I’m Senior Research Scientist at Meta

I specialize in Multimodal Reasoning as part of the Llama Team. In this role, I delve into the fascinating world of multimodal reasoning, pushing the boundaries of how different modalities interact and enhance AI capabilities. Prior to joining Llama, I worked on multi-view video diffusion models and 3D technologies, enabling the generation of 3D assets and advanced media editing capabilities. 🎨✨ My journey also includes impactful internships at Microsoft Research Lab and Huawei Noah's Ark.

🚀 Collaborations

I’m fortunate to work alongside amazing professionals such as Andrea Vedaldi, Natalia Neverova, and Ce Liu, among other amazing members of the Llama team. Together, we’re advancing the frontiers of computer vision and multimodal reasoning.

🎓 My Mentors

👩‍🎓 Supervising Students

The Llama 3 Herd of Models

Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.

arxiv, 2024

Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation

Junlin Han, Jianyuan Wang, Andrea Vedaldi, Philip Torr, Filippos Kokkinos

arxiv, 2024

Meta 3D Gen

Raphael Bensadoun*, Tom Monnier*, Yanir Kleiman*, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi

arxiv, 2024

Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials

Yawar Siddiqui*, Tom Monnier*, Filippos Kokkinos*, Mahendra Kariya, Yanir Kleiman, Emilien Garreau, Oran Gafni, Natalia Neverova, Andrea Vedaldi, David Novotny*, Roman Shapovalov*

Neurips, 2024

DreamCraft: Text-Guided Generation of Functional 3D Environments in Minecraft

Sam Earle, Filippos Kokkinos, Yuhe Nie, Julian Togelius, Roberta Raileanu

FDG, 2024

Vfusion3d: Learning scalable 3d generative models from video diffusion models

Junlin Han, Filippos Kokkinos, Philip Torr

ECCV, 2024

IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation

Luke Melas-Kyriazi*, Iro Laina, Christian Rupprecht, Natalia Neverova, Andrea Vedaldi, Oran Gafni, Filippos Kokkinos*

ICML, 2024

Real-time volumetric rendering of dynamic humans

Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

arXiv, 2023

Linear Complexity Self-Attention With 3rd Order Polynomials

Francesca Babiloni, Ioannis Marras, Jiankang Deng, Filippos Kokkinos, Matteo Maggioni, Grigorios Chrysos, Philip Torr, Stefanos Zafeiriou

PAMI, 2023

Deep structured layers for instance-level optimization in 2D and 3D vision

Filippos Kokkinos

Ph.D Thesis

Text-To-4D Dynamic Scene Generation

Uriel Singer*, Shelly Sheynin*, Adam Polyak*, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman

ICML, 2023

Replay: Multi-modal Multi-view Acted Videos for Casual Holography

Roman Shapovalov, Yanir Kleiman, Ignacio Rocco, David Novotny, Andrea Vedaldi, Changan Chen, Filippos Kokkinos, Ben Graham, Natalia Neverova

ICCV, 2023

Poly-NL: Linear Complexity Non-local Layers with Polynomials

Francesca Babiloni, Ioannis Marras, Filippos Kokkinos, Jiankang Deng, Grigorios Chrysos, Stefanos Zafeiriou

ICCV, 2021

To The Point: Correspondence-driven monocular 3D category reconstruction

Filippos Kokkinos, Iasonas Kokkinos

Neurips, 2021

Learning monocular 3D reconstruction of articulated categories from motion

Filippos Kokkinos, Iasonas Kokkinos

CVPR, 2021

Microscopy Image Restoration with Deep Wiener-Kolmogorov filters

Valeriya Pronina, Filippos Kokkinos, Dmitry V. Dylov, Stamatios Lefkimmiatis

ECCV, 2020

Pixel Adaptive Filtering Units

Filippos Kokkinos, Ioannis Marras, Matteo Maggioni, Gregory Slabaugh, Stefanos Zafeiriou

ArXiv, 2019

Iterative joint image demosaicking and denoising using a residual denoising network

Filippos Kokkinos, Stamatios Lefkimmiatis

IEEE TIP, 2019

Iterative residual cnns for burst photography applications

Filippos Kokkinos, Stamatios Lefkimmiatis

CVPR, 2019

Iterative joint image demosaicking and denoising using a residual denoising network

Filippos Kokkinos, Stamatios Lefkimmiatis

ECCV, 2018

Βαθιά μηχανική μάθηση για κατηγοριοποίηση προτάσεων

Filippos Kokkinos

Diploma Thesis, 2017

Tweester at SemEval-2017 Task 4: Fusion of Semantic-Affective and pairwise classification models for sentiment analysis in Twitter

Athanasia Kolovou, Filippos Kokkinos, Aris Fergadis, Pinelopi Papalampidi, Elias Iosif, Nikolaos Malandrakis, Elisavet Palogiannidi, Harris Papageorgiou, Shrikanth Narayanan, Alexandros Potamianos

ACL SemEval, 2017

Structural Attention Neural Networks for improved sentiment analysis

Filippos Kokkinos, Alexandros Potamianos

EACL , 2017

Tweester at SemEval-2016 Task 4: Sentiment analysis in Twitter using semantic-affective model adaptation

Elisavet Palogiannidi, Athanasia Kolovou, Fenia Christopoulou, Filippos Kokkinos, Elias Iosif, Nikolaos Malandrakis, Harris Papageorgiou, Shrikanth Narayanan, Alexandros Potamianos

ACL SemEval, 2016