Publications – Aude Oliva

Publications (Google scholar)

Lahner, B., Deb, M., Murty, N.A.R., & Oliva, A. (2026). MOSAIC: A scalable framework for fMRI dataset aggregation and modeling of human vision. In revision, Plos Biology. biorxiv paper

Kondic, J., et al (2026). ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 26, pp. 15922-15932. Paper Data MIT News

Golbari, Y., Wasserman, N., Cosarinsky, M., Beliy, R., Oliva, A., Torralba, A., Irani, M., & Rott Shaham, T. (2026). From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain. Submitted. arXiv:2605.23895. Paper

Wasserman, N, Cosarinsky, M., Golbari, Y., Oliva, A., Torralba, A., Rott Shaham, T., & Irani, M. (2025). BrainExplore: Large-Scale Discovery of Interpretable Visual Representations in the Human Brain. submitted. Arxiv:2512.08560. Paper

Lahner, B., Luo, A.F. , Prince, J.S., Deb, M., Henderson, M.M., Pyles, J.A, Oliva, A., Ratan Murty, N.A., Wehbe, L., Tarr, M.J, (2025). CONFORM: A Project to Create Crowd-Sourced Open Neuroscience fMRI Foundation Models. Neural Information Processing Systems Workshop on Foundation Models for the Brain and Body (NeurIPS 2025). Paper

Tarr, M.J, Prince, J.S., Lahner, B., Deb, M., Oliva, A., Ratan Murty, N.A., Pyles, J.A, Henderson, M.M., Wehbe, L., Luo, A.F. (2025). CONFORM: A Project to Create Crowd-Sourced Open Neuroscience fMRI Foundation Models. Neural Information Processing Systems Workshop on Data on the Brain & Mind, Tutorials (Neurips 2025).

Kondic, J., Li, P., Joshi, D., He,Z., Abedin, S., Sun, J.Z., Wiesel, B., Schwartz, E., Nassar, A., Wu, B., Arbelle, A., Oliva, A., Gutfreund, D., Karlinsky, L., Feris, R. (2025). ChartGen: Scaling Chart Understanding via Code-Guided Synthetic Chart Generation. Presented at the Graphic Design Understanding and Generation Workshop, International Conference on Computer Vision (ICCV), Honolulu, HI, October 19, 2025. arxiv Paper

Tuckute, G., Mahowald, K., Isola, P., Oliva, A., Gibson, E., & Fedorenko, E. (2025). Intrinsically memorable words have unique associations with their meanings. Journal of Experimental Psychology: General. APA Editor Choice 2025 Paper

Granite Vision Team, IBM Research. (2025). Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence. arxiv Paper

Fosco*, C., Josephs*, E., Andonian, A., & Oliva, A. (2025). Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines. arXiv:2206.00535. arXiv Paper Website

Lahner, B., Dwivedi, K., Iamshchinina, P., Graumann, M., Lascelles, A., Roig, G., Gifford, A.T., Pan, B., Jin, S., Murty, N.A.R., Kay, K., Oliva^†, A., & Cichy^†, R.M. (2024). Modeling short visual events through the BOLD moments video fMRI dataset and metadata. Nature Communication, 15(1), 6241. Paper Project page Dataset

Fosco*, C., Lahner*, B., , Pan, B., Andonian, A., Josephs, E., Lascelles, A., & Oliva, A. (2024). Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals. European Conference on Computer Vision (ECCV), 457-474. Paper Website

Fosco, C., Lahner, B., , Pan, B., Andonian, A., Josephs, E., Lascelles, A., & Oliva, A. (2024). A Systematic Comparison of fMRI-to-video Reconstruction Techniques. First Workshop on Controllable Video Generation@ ICML24. Paper Website

Mohsenzadeh*, Y., Lahner*, B., Mullin*, C., & Oliva, A. (2024). Visual perception of highly memorable images is mediated by a distributed network of ventral visual regions that enable a late memorability response. Plos Biology, 22(4). Fusion Video Paper Website Repository News

Josephs, E., Fosco, C., & Oliva, A. (2024). Effects of browsing conditions and visual alert design on human susceptibility to deepfakes. Journal of Online Trust and Safety, 2 (2) Paper

Pan, B., Panda, R., Jin, S., Feris, R., Oliva, A., Isola, P., & Kim, Y. (2024). LangNav: Language as a Perceptual Representation for Navigation. Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Paper Code Model News

Sun, X., Panda, R., Chen, C-F., Wang, N., Pan, B., Oliva, A., Rogerio, R., & Saenko, K. (2024). Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024). Paper

Huang, I., Lin, W., Mirza, M.J., Hansen, J., Doveh, S., Butoi, V., Herzig, R., Arbelle, A., Kuehne, H., Darrell, T., Gan, C., Oliva, A., Feris, R., & Karlinsky, L. (2024). Conme: Rethinking evaluation of compositional reasoning for modern vlms. Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks, 37, 22927-22946. Paper

Wang, R., Ghosh, S., Cox, D., Antognini, D., Oliva, A., Feris, R., & Karlinsky, L. (2024). Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning. Neural Information Processing Systems (NeurIPS 2024). Paper

Pan, B., Shen, Y., Liu, H., Mishra, M., Zhang, G., Oliva, A., Raffel, C., & Panda, R. (2024). Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models. Submitted. arXiv Paper blog

Teng, S., Cichy, R.M., Pantazis, D., & Oliva, A. (2024). Touch to text: Spatiotemporal evolution of braille letter representations in blind readers. Submitted. bioRxiv paper

Gifford, A.T., Bersch, D., St-Laurent, M., Pinsard, B., Boyle, J., Bellec, L., Oliva, A., Roig, G., & Cichy, R.M. (2024). The Algonauts Project 2025 Challenge: How the Human Brain Makes Sense of Multimodal Movies. arXiv Paper Website

Pan, B., Panda, R., Feris, RS., Oliva, AJ. (2024). Interpretability-aware redundancy reduction for vision transformers. US Patent 12,154,307. Document

Panda, R., Meng, Y., Lin, CC., Feris, RS, Oliva, AJ. (2024). Dynamic multi-resolution processing for video classification. US Patent 11,954,910. Document

Cascante-Bonilla, P., Shehada, K., Smith, J.S., Doveh, S., Kim, D., Panda, R., Varol, G., Oliva A., Ordonez, V., Feris, R., & Karlinsky, L. (2023). Going Beyond Nouns With Vision & Language Models Using Synthetic Data. The 19th International Conference on Computer Vision (ICCV 2023). Paper Supplementary Material Website News

Zhong, H., Mishra, S., Kim, D., Jin, S., Panda, R., Kuehne, H., Karlinsky, L., Saligrama, V., Oliva, A., & Feris, R. (2023). Learning Human Action Recognition Representations Without Real Humans. NeurIPS 2023 Datasets and Benchmarks. Paper Website Poster

Fosco, C., Jin, S., Josephs, E., & Oliva, A. (2023). Leveraging Temporal Context in Low Representational Power Regimes. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), pp. 10693–10703 Paper Supplementary Material Website News

Gifford, A.T., Lahner, B., Saba-Sadiya, S., Vilas, M.G., Lascelles, A., Oliva, A., Kay, K., Roig, G., & Cichy, R.M. (2023). The Algonauts Project 2023 Challenge: How the Human Brain Makes Sense of Natural Scenes. arXiv, 2301.03198. arXiv Paper Website

B Pan, R Panda, CL Fosco, RS Feris, AJ Oliva. (2023). Adaptive redundancy reduction for efficient video understanding. US Patent App. 17/476,437. Document

Kim, Y., Mishra, S., Jin, S., Panda, R., Kuehne, H., Karlinsky, L., Saligrama, V., Saenko, K., Oliva, A., & Feris, R. (2022). How Transferable are Video Representations Based on Synthetic Data? Conference on Neural Information Processing Systems (NeurIPS 2022) Datasets and Benchmarks Track, 35, 35710–35723 Paper Supplementary Material News

Grauman, K. et al. (2022). Ego4D: Around the World in 3,000 Hours of Egocentric Video. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), pp. 18995–19012 arXiv Paper Paper Website

Lowe*, M., Mohsenzadeh*, Y., Lahner, B., Charest, I., Oliva^†, A., & Teng^†, S. (2022). Cochlea to categories: The spatiotemporal dynamics of semantic auditory representations. Cognitive Neuropsychology, 38(7–8), 468–489. Paper Fusion Video Webpage

Liu, A.H., Jin, S., Lai, C.J., Rouditchenko, A., Oliva, A., & Glass, J. (2022). Cross-Modal Discrete Representation Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), (Volume 1: Long Papers), 3013–3035 Paper News

Panda*, R., Chen*, C., Fan, Q., Sun, X., Saenko, K., Oliva, A., & Feris, R. (2021). AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition. International Conference on Computer Vision (ICCV 2021), (pp. 7576-7585). Paper Website

Sun, X., Panda, R., Chen, C., Oliva, A., Feris, R., & Saenko K. (2021). Dynamic Network Quantization for Efficient Video Inference. International Conference on Computer Vision (ICCV 2021), (pp. 7375-7385). Paper Website

Pan, B., Jiang, Y., Panda, R., Wang, Z., Feris, R., & Oliva, A. (2021). IA-RED²: Interpretability-Aware Redundancy Reduction for Vision Transformer. Advances in Neural Information Processing Systems (NeurIPS 2021), 34, 24898-24911 arXiv Paper Paper Website

Cichy, R.M., Dwivedi, K., Lahner, B., Lascelles, A., Iamshchinina, P., Graumann, M., Andonian, A., Murty, N.A.R., Kay, K., Roig, G., & Oliva A. (2021). The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion. arXiv, 2104.13714 arXiv Paper Website GitHub Code

Bylinskii, Z., Goetschalckx, L., Newman, A., & Oliva, A. (2021). Memorability: An image-computable measure of information utility. Chapter in Human Perception of Visual Information (pp. 207–239). Springer, Cham. arXiv Paper

Monfort*, M., Jin*, S., Liu, A., Harwath, D., Feris, R., Glass, J., & Oliva, A. (2021). Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), (pp. 14871-14881). Paper Supplementary Material Website

Chen, C., Panda, R., Ramakrishnan, K., Feris, R., Cohn, J., Oliva, A., & Fan, Q. (2021). Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), (pp. 6165-6175). arXiv Paper

Pan, B., Panda, R., Fosco, C., Lin, C., Andonian, A., Meng, Y., Saenko, K., Oliva, A., & Feris, R. (2021). VA-RED²: Video Adaptive Redundancy Reduction. Ninth International Conference on Learning Representations (ICLR 2021). Paper Website

Meng, Y., Panda, R., Lin, C., Sattigeri, P., Karlinsky, L., Saenko, K., Oliva, A., & Feris, R. (2021). AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition. Ninth International Conference on Learning Representations (ICLR 2021). Paper Website

Bylinskii, Z., Madan, S., Tancik, M., Recasens, A., Zhong, K., Alsheikh, S., Pfister, H., Oliva, A., & Durand, F. (2021). Parsing and Summarizing Infographics with Synthetically Trained Icon Proposals. In 2021 IEEE 14th Pacific Visualization Symposium (PacificVis), (pp. 31-40). arXiv Paper Website

Monfort, M., Ramakrishnan, K., Andonian, A., McNamara, B., Lascelles, A., Pan, B., Fan, Q., Gutfreund, D., Feris, R., & Oliva, A. (2021) Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding. IEEE Pattern Analysis and Machine Intelligence (PAMI), 44(12), 9434-9445. arXiv Paper Paper Website

Murty, N.A.R., Teng, S., Beeler, D., Mynick, A., Oliva, A., & Kanwisher, N. (2020). Visual experience is not necessary for the development of face selectivity in the lateral fusiform gyrus. Proceedings of the National Academy of Sciences, 117(37), 23011–23020. Paper Supp. News

Cichy, R.M. & Oliva, A. (2020). A M/EEG-fMRI Fusion Primer: Resolving Human Brain Responses in Space and Time. Neuron, 107(5), 772–781. Paper

Newman*, A., Fosco*, C., Casser, V., Lee, A., McNamara, B., & Oliva, A. (2020). Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) Paper Website

Andonian*, A., Fosco*, C, Monfort, M., Lee, A., Feris, R., Vondrick, C., & Oliva, A. (2020). We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) arXiv Paper Website News

Meng, Y., Lin, C., Panda, R., Sattigeri, P., Karlinsky, L., Oliva, A. Saenko, K., & Feris, R. (2020). AR-Net: Adaptive Frame Resolution for Efficient Action Recognition. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) arXiv Paper Website

Fosco*, C., Newman*, A., Sukhum, P., Zhang, Y., Zhao, N., Oliva, A., & Bylinskii, Z. (2020). How much time do you have? Modeling multi-duration saliency. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020) Paper Website News

Mohsenzadeh, Y., Mullin, C., Lahner, B., & Oliva, A. (2020). Emergence of Visual Center-Periphery Spatial Organization in Deep Neural Networks. Scientific Reports, 10:4638 Paper Website

Monfort, M., Andonian, A., Zhou, B., Ramakrishnan, K., Adel Bargal, S., Yan, T., Brown, L., Fan, Q., Gutfreund, D., Vondrick, C., & Oliva, A. (2020). Moments in Time dataset: one million videos for event understanding. IEEE Pattern Analysis and Machine Intelligence (PAMI), 42(2), 502–508.Paper Website

Oliva, A. (2020). Computational Models of Human Object and Scene Recognition.In The Cognitive Neurosciences, 6th Edition. Edited by Gazzaniga et al. MIT Press (pp. 151–157). Paper

Amini, L., Chen, C.-H., Cox, D., Oliva, A., & Torralba, A. (2020). Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence. AI Magazine. 41(1), 70–81. Paper

Ramakrishnan, K., Panda, R., Fan, Q., Henning, J., Oliva, A. & Feris, R. (2020). Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPR-W 2020), Continual Learning in Computer Vision (CLVISION) Paper

Cichy, R.M., Roig, G., & Oliva, A. (2019). The Algonauts Project. Nature Machine Intelligence, 1(613) Paper Website

Jaegle*, A., Mehrpour*, V., Mohsenzadeh*, Y., Meyer, T., Oliva, A., & Rust, N. (2019). Population response magnitude variation in inferotemporal cortex predicts image memorability.eLife, 8:e47596 Paper

Goetschalckx*, L., Andonian*, A., Oliva, A., & Isola, P. (2019). GANalyze: Towards Visual Definition of Cognitive Image Properties. IEEE International Conference on Computer Vision (ICCV), 5744–5753 PaperWebsiteNews

Xiao, T., Fan, Q., Gutfreund, D., Monfort, M., Oliva, A., & Zhou, B. (2019). Reasoning About Human-Object Interactions Through Dual Attention Networks. IEEE International Conference on Computer Vision (ICCV), 3919–3928 Paper Website

Zhou,B., Bau, D., Oliva, A., & Torralba, A. (2019). Interpreting Deep Visual Representations via Network Dissection. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 41, 9, 2131–2145 Paper Website

Ramakrishnan*, K., Monfort*, M., McNamara, B., Lascelles, A., Gutfreund, D., Feris, R., & Oliva, A. (2019). Identifying Interpretable Action Units in Deep Networks.IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019) Workshop on Explainable AI. Paper

Monfort, M., Ramakrishnan, K., McNamara, B., Lascelles, A., Gutfreund, D., Feris, R., & Oliva, A. (2019). Examining Interpretable Feature Relationships in Deep Networks for Action recognition.ICML 2019 Workshop Deep Phenomena. Paper

Mohsenzadeh, Y., Mullin, C., Oliva, A., & Pantazis, D. (2019). The perceptual neural trace of memorable unseen scenes. Scientific Reports, 9, 6033 Paper

Mohsenzadeh, Y., Mullin, C., Lahner, B., Cichy, R.M., & Oliva, A. (2019). Reliability and generalizability of similarity-based fusion of fMRI and MEG data in the ventral and dorsal visual streams. Vision, 3(1), 8. Paper Website

Bylinskii, Z., Judd, T., Oliva, A., Torralba, A., & Durand, F. (2019). What do different evaluation metrics tell us about saliency models? IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 41(3), 740–757. Paper Website

Zhou,B., Andonian. A., Oliva, A., & Torralba, A. (2018). Temporal Relational Reasoning in Videos. European Conference on Computer Vision (ECCV)Website News

Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., & Torralba, A. (2018). Places: A 10 million Image Database for Scene Recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 40(6), 1452–1464.Paper Demo Website Places CNN Models

Khaligh-Razavi, S.M., Cichy, R.M., Pantazis, D., & Oliva, A. (2018). Tracking the spatiotemporal neural dynamics of object properties in the human brain. Journal of Cognitive Neuroscience, 30(11), 1559–1576.Paper

Singh, I., Oliva, A., & Howard, M. Visual memories are stored along a compressed timeline. Submitted manuscript. bioRxiv Paper

Madan, S., Bylinskii, Z., Tancik, M., Recasens, A., Zhong, K., Alsheikh, S., Pfister, H., Oliva, A. & Durand, F. (2018). Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics. arXiv, 1807.10441. arXiv Paper Website

Bau*,D., Zhou*,B., Khosla,A., Oliva, A., & Torralba, A. (2017). Network Dissection: Quantifying Interpretability of Deep Visual Representations. IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2695–2702. PaperWebsite TalkVideo News

Cichy, R.M., Khosla, A., Pantazis , D., & Oliva, A. (2017). Dynamics of scene representations in the human brain revealed by magnetoencephalography and deep neural networks. Neuroimage, 153:346–358 Paper Website

Bainbridge, W.A., Dilks, D., & Oliva, A. (2017). Memorability: A Stimulus-Driven Perceptual Neural Signature Distinctive from Memory. Neuroimage, 149, 141–152Paper Brain ROIs

Teng, S., Sommer, V., Pantazis, D., & Oliva, A. (2017). Hearing scenes: A neuromagnetic signature of auditory source and reverberant space separation. eNeuro 4(1) ENEURO.0007-17.2017Paper News

Monfort, M., Johnson, M., Oliva, A., & Hofmann, K. (2017). Asynchronous Data Aggregation for Training Visual Navigation Networks. Lifelong Learning: A Reinforcement Learning Approach Workshop at ICML, Sydney, Australia. Paper

Bylinskii, Z., Borkin, M.A., Kim, N.W., Pfister, H., & Oliva, A. (2017). Eye Fixation Metrics for Large Scale Evaluation and Comparison of Information Visualizations. In Burch, M., Chuang, L., Fisher, B., Schmidt, A., Weiskopf, D. (Eds.), Eye Tracking and Visualization: Foundations, Techniques, and Applications (pp. 235–255). Springer International Publishing. Paper Website

Kim, N.*, Bylinskii, Z.*, Borkin, M., Gajos, K., Oliva, A., Durand, F., & Pfister, H. (2017). BubbleView: replacing eye-tracking to crowdsource image importance. ACM Transactions on Computer-Human Interaction (TOCHI), 24(5), 36 Paper Website

Oliva, A., & Schyns, P.G. (2017). Hybrid Image Illusion. In The Oxford Compendium of Visual Illusions, Ed. Shapiro and Todorovic, Oxford University Press, Chapter 111 (p 763–766). Paper

Bylinskii, Z.*, Alsheikh, S.*, Madan, S.*, Recasens, A.*, Zhong, K., Pfister, H., Durand, F., & Oliva, A. (2017). Understanding Infographics through Textual and Visual Tag Prediction. arXiv 1709.09215 arXiv Paper

Vo, M., Bylinskii, Z., & Oliva, A. (2017). Image Memorability in the Eye of the Beholder: Tracking the Decay of Visual Scene Representations. BioRxiv:141044 2017 bioRxiv Paper

Cichy, R.M., Pantazis , D., & Oliva, A. (2016). Similarity-based fusion of MEG and fMRI reveals spatio-temporal dynamics in human cortex during visual object recognition. Cerebral Cortex, 26 (8): 3563–3579. Paper Website

Cichy, R.M., Khosla, A., Pantazis , D., Torralba, A., & Oliva, A. (2016). Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence. Scientific Reports, 6, 27755. PaperWebsite

Borkin*, A.M., Bylinskii*, Z., Kim, N.W., Bainbridge, C.M., Yeh, C.S., Borkin, D., Pfister, H., & Oliva, A. (2016). Beyond Memorability: Visualization Recognition and Recall. InfoVis 2015 IEEE Transactions on Visualization and Computer Graphics, 22(1), 519–528. Paper Supplementary Material Website Video News

Zhou, B., Khosla, A., Lapedriza , A., Oliva, A., & Torralba, A. (2016). Learning Deep Features for Discriminative Localization. IEEE Conference on Computer Vision and Pattern Recogntion (CVPR), pp. 2921–2929. arXiv Paper Website

Bylinskii, Z., Recasens, A., Borji , A., Oliva, A., Torralba, A., & Durand, F. (2016). Where should saliency models look next? Proceedings of the European Conference in Computer Vision (ECCV), Amsterdam Paper Supplementary Material Poster

Oliva, A., & Teng, S. (2016). The Cognitive Society.In Handbook of Society and Technology Convergence (W.S Bainbridge and M. C. Roco, Eds), Springer International Publishing Switzerland (pp 743–751). Paper

Bainbridge, W.A., & Oliva, A. (2015). Interaction Envelope: Local Spatial Representations of Objects at All Scales in Scene-Selective Regions.Neuroimage, 122, 408–416 Paper Dataset

Bainbridge, W.A., & Oliva, A. (2015). A toolbox and sample object perception data for equalization of natural images.Data in Brief, 5, 846–851 Paper Dataset

Park*, S.J, Konkle*, T., & Oliva, A. (2015). Parametric coding of the size and clutter of natural scenes in the human brain. Cerebral Cortex, 25, 1792–1805. Paper Stimuli

Khosla, A, Raju, A.S., Torralba, A., & Oliva, A. (2015). Understanding and Predicting Image Memorability at a Large Scale.IEEE International Conference on Computer Vision (ICCV), pp. 2390–2398.Paper Website News

Vondrick, C, Pirsiavash, H., Oliva, A., & Torralba, A. (2015). Learning Visual Biases from Human Imagination.Advances in Neural Information Processing Systems (NIPS), 28. Paper

Zhou, B., Khosla, A., Lapedriza , A., Oliva, A., & Torralba, A. (2015). Object Detectors emerge in Deep Scene CNNs .International Conference on Learning Representations (ICLR 2015). arXiv Paper Slides Visualization Places-CNN Visualization ImageNet-CNN

Bylinskii, Z., Isola, P., Bainbridge, C.M., Torralba, A., & Oliva, A. (2015). Intrinsic and Extrinsic Effects on Image Memorability. Vision Research, 116, 165-178. Paper Supplementary Material Poster Website

Kim, N.W., Bylinskii, Z., Borkin, A.M., Oliva, A., Gajos, K.Z., & Pfister, H. (2015). A Crowdsourced Alternative to Eye-tracking for Visualization Understanding.ACM Conference on Human Factors in Computing Systems (CHI) 2015 Extended Abstracts . Paper Poster

Bainbridge, C. M., Bainbridge, W.A, & Oliva, A. (2015). Quadri-stability of a spatially ambiguous auditory illusion.Frontiers in Human Neuroscience. The Future of Perceptual Illusions : From Phenomenology to Neuroscience. PaperSound file Continuous Sound

Zhou, B., Lapedriza , A., Xiao, J., Torralba, A., & Oliva, A. (2014). Learning Deep Features for Scene Recognition using Places Database.Conference on Neural Information Processing Systems (NIPS). Paper Website Demo

Cichy, R.M., Pantazis , D., & Oliva, A. (2014). Resolving human object recognition in space and time.Nature Neuroscience, 17(3), 455–464. Paper Supplementary Material WebsiteNews News

Isola, P., Xiao, J., Parikh, D, Torralba, A., & Oliva, A. (2014). What makes a photograph memorable? IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 36(7), 1469–1482. Paper

Zhou, B., Liu, L., Oliva, A., & Torralba, A. (2014). Recognizing City Identity via Attribute Analysis of Geo-tagged Images. Proceedings of the 13th European Conference on Computer Vision (ECCV). Paper

Vu, T.H., Olsson, C., Laptev, I., Oliva, A., & Sivic, J. (2014). Predicting Actions from Static Scenes. Proceedings of the 13th European Conference on Computer Vision (ECCV). PaperWebsite

Khosla, A., Bainbridge, W.A., Torralba, A., & Oliva, A. (2013). Modifying the Memorability of Face Photographs. Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp 3200–3207. Paper News Website

Bainbridge, W.A., Isola, P., & Oliva, A. (2013). The Intrinsic Memorability of Face Photographs. Journal of Experimental Psychology: General, 142(4), 1323–1334. Paper Website

Borkin, M.A, Vo, A.A., Bylinskii, Z., Isola, P., Sunkavalli, S., Oliva, A., & Pfister, H. (2013). What makes a visualization memorable? IEEE Transactions on Visualization and Computer Graphics (TVCG), 19(12), 2306–2315. Paper News Website

Oliva, A., Isola, P., Khosla, A., & Bainbridge, W.A. (2013). What makes a picture memorable? SPIE Newsroom Article, 7 May 2013, DOI: 10.1117/2.1201304.004806 Paper SPIE Website

Brady, T.F., Konkle, T., Gill, J., Oliva, A., & Alvarez, G.A. (2013). Visual long-term memory has the same limit on fidelity as visual working memory. Psychological Science, 24(6), 981–990. Paper Website

Brady, T.F., Konkle, T., Alvarez, G.A, & Oliva, A. (2013). Real-world objects are not represented as bound units: Independent forgetting of different object details from visual memory. Journal of Experimental Psychology: General, 142(3), 791–808. Paper Stimuli

Oliva, A. (2013). The Art of Hybrid Images: Two for the View of One. Art & Perception, 1(1-2), 65-74. Paper

Laprevote, V., Oliva, A., Ternois, A.S., Schwan, R., Thomas, P., & Boucart, M. (2013). Low spatial frequency bias in schizophrenia is not face specific: when the integration of coarse and fine information fails. Frontiers in Psychopathology, 4:248 Paper Frontiers Website

Oliva, A. (2013). Scene Perception. Chapter (51) in the New Visual Neurosciences. Eds John S. Werner and Leo. M. Chalupa (pp. 725–732) Chapter

MacGregor, D., Baba, M., Oliva, A., McLaughlin, A.C., Scacchi, W., Scassellati, B., Rubin, P., Mason, R.M., & Spohrer, J.R. (2013). Convergence Platforms: Human-Scale Convergence and the Quality of Life. In Convergence of Knowledge, Technology and Society: Beyond Convergence of Nano-Bio-Info-Cognitive Technologies (M. C. Roco, W. S. Bainbridge, B. Tonn and G. Whitesides, Eds), Springer Publishing (pp. 53–93) Chapter

Olds, J.L., MacGregor, D., Madou, M., McLaughlin, A., Oliva, A., Scassellati, B., & Wong, P. (2013). Implications: Human Cognition and Communication and the Emergence of the Cognitive Society In Convergence of Knowledge, Technology and Society: Beyond Convergence of Nano-Bio-Info-Cognitive Technologies (M. C. Roco, W. S. Bainbridge, B. Tonn and G. Whitesides, Eds), Springer Publishing (pp. 223–253) Chapter

Konkle, T. & Oliva, A. (2012). A real-world size organization of object responses in occipitotemporal cortex Neuron, 74 (6), 1114–1124. Paper News Stimuli

Konkle, T. & Oliva, A. (2012). A familiar-size Stroop effect: Real-world Size is an automatic property of object representation.Journal of Experimental Psychology: Human Perception and Performance, (3), 561–569. Paper Stimuli

Khosla, A., Xiao, J., Torralba, A., & Oliva, A. (2012). Memorability of Image Regions.Advances in Neural Information Processing Systems (NIPS), P. Bartlett and F.C.N. Pereira and C.J.C. Burges and L. Bottou and K.Q. Weinberge (Eds.), 25 ,305–313. Paper PosterWebsite

Brady, T.F., & Oliva, A. (2012). Spatial Frequency Integration During Active Perception: Perceptual Hysteresis When an Object Recedes. Frontiers in Perception Science, 3:462. Paper Frontiers Website

Khosla, A., Xiao, J., Isola, P., Torralba, A., & Oliva, A. (2021). Image memorability and visual inception. In SIGGRAPH Asia 2012 technical briefs, (pp. 1–4). Paper

Park, S., Brady, T.F., Greene, M.R., & Oliva, A. (2011). Disentangling scene content from its spatial boundary: Complementary roles for the parahippocampal place area and lateral occipital complex in representing real-world scenes Journal of Neuroscience, 31(4), 1333–1340. Paper

Konkle, T. & Oliva, A. (2011). Canonical visual size for real-world objects. Journal of Experimental Psychology: Human Perception & Performance, 37(1), 23. Paper Stimuli Download

Isola, P., Parikh, D., Torralba, A., & Oliva, A. (2011). Understanding the Intrinsic Memorability of Images. Advances in Neural Information Processing Systems (NIPS), J. Shawe-Taylor and R.S. Zemel and P. Bartlett and F.C.N. Pereira and K.Q. Weinberger (Eds.), 24, 2429–2437. Paper Poster Website

Isola, P., Xiao, J., Torralba, A., & Oliva, A. (2011). What makes an image memorable? Proceedings of the 24rd IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 145–152). Paper Poster Website

Gagnier, K.M., Intraub, H., Oliva, A., & Wolfe, J. (2011). Why Does Vantage Point Affect Boundary Extension? Visual Cognition, 19:2, 234–257. Paper

Oliva, A., Park, S., & Konkle, T. (2011). Representing, perceiving and remembering the shape of visual space. In L.R. Harris and M. Jenkin (Eds.), Vision in 3D Environments. Cambridge: Cambridge University Press (pp 308–339). Chapter

Konkle, T.*, Brady, T.F.*, Alvarez, G.A., & Oliva, A. (2010). Scene memory is more detailed than you think: the role of categories in visual long-term memory. Psychological Science, 21(11), 1551–1556. Paper Website Press

Konkle, T., Brady, T.F., Alvarez, G.A & Oliva, A. (2010). Conceptual distinctiveness supports detailed visual long-term memory for real-world objects. Journal of Experimental Psychology: General, 139 (3), 558–578. Website Paper

Greene, M.R. & Oliva, A. (2010). High-Level Aftereffects to Global Scene Properties. Journal of Experimental Psychology: Human Perception & Performance, 36(6), 1430–1442. Paper

Xiao, J., Hayes, J., Ehinger, K., Oliva, A., & Torralba, A. (2010). SUN Database: Large-scale Scene Recognition from Abbey to Zoo. Proceedings of the 23rd IEEE Conference on Computer Vision and Pattern Recognition, (pp. 3485–3492), IEEE Computer Society. Paper Website

Hidalgo-Sotelo, B. & Oliva, A. (2010). Person, place, and past influence eye movements during visual search. In S. Ohlsson & R. Catrambone (Eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society (CSS), (pp. 820–825). Austin, TX: Cognitive Science Society. Paper

Ross, M.G., & Oliva, A. (2010). Estimating perception of scene layout properties from global image features. Journal of Vision, 10(1), 2-2. Website Paper

Laprevote, V., Oliva, A., Delerue, C., Thomas, P., & Boucart, M. (2010). Patients with schizophrenia are biased toward low spatial frequency to decode facial expression at a glance. Neuropsychologia, 48, 4164–4168. Paper

Oliva, A. (2010). Seeing and Thinking in the Mist (Book review of the Invisible Gorilla). Science, 329, 1017. Paper

Oliva, A. (2010). Understanding the Physics of the Mind: a proposal for a Perceptual Science Initiative. White paper submitted to the National Science Foundation SBE 2020. Paper

Greene, M.R., & Oliva, A. (2009). The briefest of glances: the time course of natural scene understanding. Psychological Science, 20 (4), 464–472. Paper

Greene, M.R., & Oliva, A. (2009). Recognition of Natural Scenes from Global Properties: Seeing the Forest Without Representing the Trees. Cognitive Psychology, 58(2), 137–179. Paper

Brady, T.F., Konkle, T., Oliva, A., & Alvarez, G. A. (2009). Detecting changes in real-world objects: The relationship between visual long-term memory and change blindness. Communicative & Integrative Biology 2(1), 1–3. Paper

Alvarez, G.A., & Oliva, A. (2009). Spatial Ensemble Statistics are Efficient Codes that Can be Represented with Reduced Attention. Proceedings of the National Academy of Sciences, 106, 7345–7350. Paper

Oliva, A. (2009). Visual Scene Perception. In Encyclopaedia of Perception, Ed: Bruce Goldstein. Sage Edition. Paper

Ehinger, K.A. *, Hidalgo-Sotelo, B. *, Torralba, A., & Oliva, A. (2009). Modelling Search for People in 900 Scenes: A combined source model of eye guidance. Visual Cognition, 17, 945–978. Website Paper

Brady, T.F., Konkle, T., Alvarez, G.A., & Oliva, A. (2008). Visual long-term memory has a massive storage capacity for object details. Proceedings of the National Academy of Sciences, USA, vol 105 (38), 14325–14329. Website Paper

Alvarez, G.A., & Oliva, A. (2008). The Representation of Simple Ensemble Visual Features Outside the Focus of Attention. Psychological Science, 19(4), 392–398. Paper

Brady, T.F., & Oliva, A. (2008). Statistical Learning using Real World Scenes: Extracting Categorical Regularities without Conscious Intent. Psychological Science, 19(7), 678–685. Poster Paper

Boucart, M., Dinon, J.F., Despretz, P., Desmettre, T., Hladiuk, K., & Oliva, A. (2008). Recognition of facial emotion in low vision: A flexible usage of facial features. Visual Neuroscience, 25(4), 603–609. Paper

Oliva, A. & Torralba, A. (2007). The Role of Context in Object Recognition. Trends in Cognitive Sciences, 11(12), 520–527. Paper

Alvarez, G. A, Konkle, T., & Oliva, A. (2007). Searching in Dynamic Displays: Effects of Configural and Spatiotemporal Predictability. Journal of Vision, 7(14), 12-12. Paper

Serre, T., Oliva, A., & Poggio, T. A. (2007). A feedforward architecture accounts for rapid categorization. Proceedings of the National Academy of Sciences, 104 (15), 6424–6429 Paper Poster Slides

Konkle, T., & Oliva, A. (2007). Normative representation of objects: evidence for an ecological bias in object perception and memory. In D.S. McNamara & J. G. Trafton (Eds.). Proceedings of the 29th Annual Conference of the Cognitive Science Society (CSS), (p 407–412), Nashville, TN: Cognitive Science Society 2007. Paper

Alvarez, G., & Oliva, A. (2007). The Role of Global Layout in Visual Short-term Memory. Visual Cognition, 15(1), 70–73. Paper

Torralba, A., Oliva, A., Castelhano, M., & Henderson, J.M. (2006). Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychological Review, 113, 766–786. Paper Website

Oliva, A., Torralba, A., & Schyns, P.G. (2006). Hybrid Images. ACM Transactions on Graphics (Siggraph), 25, 3, 527–532. Paper Slides

Greene, M.R., & Oliva, A. (2006). Natural scene categorization from the conjunction of ecological global properties. Proceedings of the 28th Annual Conference of the Cognitive Science Society (CSS), 291–296. Paper Poster

Oliva, A. & Torralba, A. (2006). Building the Gist of a Scene: The Role of Global Image Features in Recognition. Progress in Brain Research: Visual perception, 155, 23–36. Paper

Goffaux, V., Jacques, C., Mouraux, A., Oliva, A., Rossion, B., & Schyns. P.G. (2005). Diagnostic colors contribute to early stages of scene categorization: behavioral and neurophysiological evidences. Visual Cognition, 12, 878–892. Paper

Oliva, A. (2005). Gist of the scene. In the Encyclopedia of Neurobiology of Attention. L. Itti, G. Rees, and J.K. Tsotsos (Eds.), Elsevier, San Diego, CA (pages 251–256).Paper

Hidalgo-Sotelo, B., Oliva, A., & Torralba, A. (2005). Human Learning of Contextual Priors for Object Search: Where does the time go? Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) – Workshops – vol. 3, p86–86.Paper

Oliva, A., Mack, M., Shrestha, M., & Peeper, A. (2004). Identifying the Perceptual Dimensions of Visual Complexity of Scenes. Proceedings of the Annual Meeting of the Cognitive Science Society (CSS), 26 (26) Paper

Oliva, A., Wolfe, J. M, & Arsenio, H. (2004). Panoramic Search: The interaction of Memory and Vision in Search through a Familiar Scene. Journal of Experimental Psychology: Human Perception and Performance, 30, 1132–1146. Paper

Torralba, A., & Oliva, A. (2003). Statistics of Natural Images Categories.Network: Computation in Neural Systems, 14, 391–412. Paper

Oliva, A., Torralba, A. Casthelano, M., & Henderson, J. (2003). Top-Down control of visual attention in object detection. Proceeding of the IEEE International Conference Image Processing, vol1 (pp 253–256).Paper

Torralba, A., & Oliva, A. (2002). Depth estimation from image structure. IEEE Pattern Analysis and Machine Intelligence (PAMI), 24, 1226–1238. Paper

Oliva, A., & Torralba, A. (2002). Scene-centered description from spatial envelope properties. Lecture Note in Computer Science Serie Proc. Second International Workshop on Biologically Motivated Computer Vision, Eds: H. Bulthoff, S.W. Lee, T. Poggio, & C. Wallraven. Srpinger-Verlag, Tuebingen, Germany (pp.263–272).Paper

Oliva, A., & Torralba, A. (2001). Modeling the Shape of the Scene: a Holistic Representation of the Spatial Envelope. International Journal in Computer Vision, 42, 145–175. Paper Database

Oliva, A., & Schyns, P.G. (2000). Diagnostic colors mediate scene recognition. Cognitive Psychology, 41, 176–210. Paper

Guérin-Dugué, A., & Oliva, A. (2000). Classification of scene photographs from local orientations features. Pattern Recognition Letters, 21 (13–14), 1135–1140. Paper

Schyns, P.G. & Oliva, A. (1999). Dr. Angry and Mr. Smile: when categorization flexibly modifies the perception of faces in rapid visual presentations. Cognition, 69, 243–265. Paper

Torralba, A. & Oliva, A. (1999). Semantic organization of scenes using discriminant structural templates. Proceedings of the Seventh IEEE International Conference on Computer Vision, Vol. 2, pp. 1253–1258. Paper

Oliva, A., Torralba, A., Guérin-Dugué, A., & Hérault, J. (1999). Global semantic classification of scenes using power spectrum templates. In Challenge of image retrieval, pp. 1–11. Paper

Oliva, A. & Schyns, P.G. (1997). Coarse blobs or fine edges? Evidence that information diagnosticity changes the perception of complex visual stimuli. Cognitive Psychology, 34, 72–107. Paper

Schyns, P.G. & Oliva, A. (1997). Flexible, diagnostically-driven, rather than fixed, perceptually determined scale selection in scene and face recognition. Perception, 26, 1027–1038.

Hérault, J., Oliva, A., & Guérin-Dugué, A. (1997). Scene categorisation by curvilinear component analysis of low frequency spectra. In ESANN’97: European symposium on artificial neural networks, pp. 91–96. Paper

Oliva, A., & Schyns, P. G. (1995). Mandatory scale perception promotes flexible scene categorization. In Proceedings of the XVII Meeting of the Cognitive Science Society, pp. 159–163.

Schyns, P.G. & Oliva, A. (1994). From blobs to boundary edges: Evidence for time- and spatial-scale-dependent scene recognition. Psychological Science, 5, 195–200. Paper

Habib, M., Gayraud, D., Oliva, A., Regis, J., Salamon, G., & Khalil, R. (1991). Effects of handedness and sex on the morphology of the corpus callosum: A study with brain magnetic resonance imaging Brain and cognition, 16(1), 41–61. Paper