Publications (Google scholar)
Lahner, B., Deb, M., Murty, N.A.R., & Oliva, A. (2025). MOSAIC: Meta-Organized Stimuli And fMRI Imaging data for Computational modeling. Submitted.
Kondic, J., Li, P., Joshi, D., He,Z., Abedin, S., Sun, J.Z., Wiesel, B., Schwartz, E., Nassar, A., Wu, B., Arbelle, A., Oliva, A., Gutfreund, D., Karlinsky, L., Feris, R. (2025). ChartGen: Scaling Chart Understanding via Code-Guided Synthetic Chart Generation. Presented at the Graphic Design Understanding and Generation Workshop, International Conference on Computer Vision (ICCV), Honolulu, HI, October 19, 2025. arxiv Paper
Tuckute, G., Mahowald, K., Isola, P., Oliva, A., Gibson, E., & Fedorenko, E. (2025). Intrinsically memorable words have unique associations with their meanings. Journal of Experimental Psychology: General. APA Editor Choice 2025 Paper
Granite Vision Team, IBM Research. (2025). Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence. arxiv Paper
Fosco*, C., Josephs*, E., Andonian, A., & Oliva, A. (2025). Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines. arXiv:2206.00535. arXiv Paper Website
Lahner, B., Dwivedi, K., Iamshchinina, P., Graumann, M., Lascelles, A., Roig, G., Gifford, A.T., Pan, B., Jin, S., Murty, N.A.R., Kay, K., Oliva†, A., & Cichy†, R.M. (2024). Modeling short visual events through the BOLD moments video fMRI dataset and metadata. Nature Communication, 15(1), 6241. Paper Project page Dataset
Fosco*, C., Lahner*, B., , Pan, B., Andonian, A., Josephs, E., Lascelles, A., & Oliva, A. (2024). Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals. European Conference on Computer Vision (ECCV), 457-474. Paper Website
Fosco, C., Lahner, B., , Pan, B., Andonian, A., Josephs, E., Lascelles, A., & Oliva, A. (2024). A Systematic Comparison of fMRI-to-video Reconstruction Techniques. First Workshop on Controllable Video Generation@ ICML24. Paper Website
Mohsenzadeh*, Y., Lahner*, B., Mullin*, C., & Oliva, A. (2024). Visual perception of highly memorable images is mediated by a distributed network of ventral visual regions that enable a late memorability response. Plos Biology, 22(4). Fusion Video Paper Website Repository News
Josephs, E., Fosco, C., & Oliva, A. (2024). Effects of browsing conditions and visual alert design on human susceptibility to deepfakes. Journal of Online Trust and Safety, 2 (2) Paper
Pan, B., Panda, R., Jin, S., Feris, R., Oliva, A., Isola, P., & Kim, Y. (2024). LangNav: Language as a Perceptual Representation for Navigation. Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Paper Code Model News
Sun, X., Panda, R., Chen, C-F., Wang, N., Pan, B., Oliva, A., Rogerio, R., & Saenko, K. (2024). Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024). Paper
Huang, I., Lin, W., Mirza, M.J., Hansen, J., Doveh, S., Butoi, V., Herzig, R., Arbelle, A., Kuehne, H., Darrell, T., Gan, C., Oliva, A., Feris, R., & Karlinsky, L. (2024). Conme: Rethinking evaluation of compositional reasoning for modern vlms. Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks, 37, 22927-22946. Paper
Wang, R., Ghosh, S., Cox, D., Antognini, D., Oliva, A., Feris, R., & Karlinsky, L. (2024). Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning. Neural Information Processing Systems (NeurIPS 2024). Paper
Pan, B., Shen, Y., Liu, H., Mishra, M., Zhang, G., Oliva, A., Raffel, C., & Panda, R. (2024). Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models. Submitted. arXiv Paper blog
Teng, S., Cichy, R.M., Pantazis, D., & Oliva, A. (2024). Touch to text: Spatiotemporal evolution of braille letter representations in blind readers. Submitted. bioRxiv paper
Gifford, A.T., Bersch, D., St-Laurent, M., Pinsard, B., Boyle, J., Bellec, L., Oliva, A., Roig, G., & Cichy, R.M. (2024). The Algonauts Project 2025 Challenge: How the Human Brain Makes Sense of Multimodal Movies. arXiv Paper Website
Pan, B., Panda, R., Feris, RS., Oliva, AJ. (2024). Interpretability-aware redundancy reduction for vision transformers. US Patent 12,154,307. Document
Panda, R., Meng, Y., Lin, CC., Feris, RS, Oliva, AJ. (2024). Dynamic multi-resolution processing for video classification. US Patent 11,954,910. Document
Cascante-Bonilla, P., Shehada, K., Smith, J.S., Doveh, S., Kim, D., Panda, R., Varol, G., Oliva A., Ordonez, V., Feris, R., & Karlinsky, L. (2023). Going Beyond Nouns With Vision & Language Models Using Synthetic Data. The 19th International Conference on Computer Vision (ICCV 2023). Paper Supplementary Material Website News
Zhong, H., Mishra, S., Kim, D., Jin, S., Panda, R., Kuehne, H., Karlinsky, L., Saligrama, V., Oliva, A., & Feris, R. (2023). Learning Human Action Recognition Representations Without Real Humans. NeurIPS 2023 Datasets and Benchmarks. Paper Website Poster
Fosco, C., Jin, S., Josephs, E., & Oliva, A. (2023). Leveraging Temporal Context in Low Representational Power Regimes. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), pp. 10693–10703 Paper Supplementary Material Website News
Gifford, A.T., Lahner, B., Saba-Sadiya, S., Vilas, M.G., Lascelles, A., Oliva, A., Kay, K., Roig, G., & Cichy, R.M. (2023). The Algonauts Project 2023 Challenge: How the Human Brain Makes Sense of Natural Scenes. arXiv, 2301.03198. arXiv Paper Website
B Pan, R Panda, CL Fosco, RS Feris, AJ Oliva. (2023). Adaptive redundancy reduction for efficient video understanding. US Patent App. 17/476,437. Document
Kim, Y., Mishra, S., Jin, S., Panda, R., Kuehne, H., Karlinsky, L., Saligrama, V., Saenko, K., Oliva, A., & Feris, R. (2022). How Transferable are Video Representations Based on Synthetic Data? Conference on Neural Information Processing Systems (NeurIPS 2022) Datasets and Benchmarks Track, 35, 35710–35723 Paper Supplementary Material News
Grauman, K. et al. (2022). Ego4D: Around the World in 3,000 Hours of Egocentric Video. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), pp. 18995–19012 arXiv Paper Paper Website
Lowe*, M., Mohsenzadeh*, Y., Lahner, B., Charest, I., Oliva†, A., & Teng†, S. (2022). Cochlea to categories: The spatiotemporal dynamics of semantic auditory representations. Cognitive Neuropsychology, 38(7–8), 468–489. Paper Fusion Video Webpage
Liu, A.H., Jin, S., Lai, C.J., Rouditchenko, A., Oliva, A., & Glass, J. (2022). Cross-Modal Discrete Representation Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), (Volume 1: Long Papers), 3013–3035 Paper News
Panda*, R., Chen*, C., Fan, Q., Sun, X., Saenko, K., Oliva, A., & Feris, R. (2021). AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition. International Conference on Computer Vision (ICCV 2021), (pp. 7576-7585). Paper Website
Sun, X., Panda, R., Chen, C., Oliva, A., Feris, R., & Saenko K. (2021). Dynamic Network Quantization for Efficient Video Inference. International Conference on Computer Vision (ICCV 2021), (pp. 7375-7385). Paper Website
Pan, B., Jiang, Y., Panda, R., Wang, Z., Feris, R., & Oliva, A. (2021). IA-RED²: Interpretability-Aware Redundancy Reduction for Vision Transformer. Advances in Neural Information Processing Systems (NeurIPS 2021), 34, 24898-24911 arXiv Paper Paper Website
Cichy, R.M., Dwivedi, K., Lahner, B., Lascelles, A., Iamshchinina, P., Graumann, M., Andonian, A., Murty, N.A.R., Kay, K., Roig, G., & Oliva A. (2021). The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion. arXiv, 2104.13714 arXiv Paper Website GitHub Code
Bylinskii, Z., Goetschalckx, L., Newman, A., & Oliva, A. (2021). Memorability: An image-computable measure of information utility. Chapter in Human Perception of Visual Information (pp. 207–239). Springer, Cham. arXiv Paper
Monfort*, M., Jin*, S., Liu, A., Harwath, D., Feris, R., Glass, J., & Oliva, A. (2021). Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), (pp. 14871-14881). Paper Supplementary Material Website
Chen, C., Panda, R., Ramakrishnan, K., Feris, R., Cohn, J., Oliva, A., & Fan, Q. (2021). Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2021), (pp. 6165-6175). arXiv Paper
Pan, B., Panda, R., Fosco, C., Lin, C., Andonian, A., Meng, Y., Saenko, K., Oliva, A., & Feris, R. (2021). VA-RED²: Video Adaptive Redundancy Reduction. Ninth International Conference on Learning Representations (ICLR 2021). Paper Website
Meng, Y., Panda, R., Lin, C., Sattigeri, P., Karlinsky, L., Saenko, K., Oliva, A., & Feris, R. (2021). AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition. Ninth International Conference on Learning Representations (ICLR 2021). Paper Website
Bylinskii, Z., Madan, S., Tancik, M., Recasens, A., Zhong, K., Alsheikh, S., Pfister, H., Oliva, A., & Durand, F. (2021). Parsing and Summarizing Infographics with Synthetically Trained Icon Proposals. In 2021 IEEE 14th Pacific Visualization Symposium (PacificVis), (pp. 31-40). arXiv Paper Website
Monfort, M., Ramakrishnan, K., Andonian, A., McNamara, B., Lascelles, A., Pan, B., Fan, Q., Gutfreund, D., Feris, R., & Oliva, A. (2021) Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding. IEEE Pattern Analysis and Machine Intelligence (PAMI), 44(12), 9434-9445. arXiv Paper Paper Website
Murty, N.A.R., Teng, S., Beeler, D., Mynick, A., Oliva, A., & Kanwisher, N. (2020). Visual experience is not necessary for the development of face selectivity in the lateral fusiform gyrus. Proceedings of the National Academy of Sciences, 117(37), 23011–23020. Paper Supp. News
Cichy, R.M. & Oliva, A. (2020). A M/EEG-fMRI Fusion Primer: Resolving Human Brain Responses in Space and Time. Neuron, 107(5), 772–781. Paper
Newman*, A., Fosco*, C., Casser, V., Lee, A., McNamara, B., & Oliva, A. (2020). Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) Paper Website
Andonian*, A., Fosco*, C, Monfort, M., Lee, A., Feris, R., Vondrick, C., & Oliva, A. (2020). We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) arXiv Paper Website News
Meng, Y., Lin, C., Panda, R., Sattigeri, P., Karlinsky, L., Oliva, A. Saenko, K., & Feris, R. (2020). AR-Net: Adaptive Frame Resolution for Efficient Action Recognition. Proceedings of the 16th European Conference on Computer Vision (ECCV 2020) arXiv Paper Website
Fosco*, C., Newman*, A., Sukhum, P., Zhang, Y., Zhao, N., Oliva, A., & Bylinskii, Z. (2020). How much time do you have? Modeling multi-duration saliency. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020) Paper Website News
Mohsenzadeh, Y., Mullin, C., Lahner, B., & Oliva, A. (2020). Emergence of Visual Center-Periphery Spatial Organization in Deep Neural Networks. Scientific Reports, 10:4638 Paper Website
Monfort, M., Andonian, A., Zhou, B., Ramakrishnan, K., Adel Bargal, S., Yan, T., Brown, L., Fan, Q., Gutfreund, D., Vondrick, C., & Oliva, A. (2020). Moments in Time dataset: one million videos for event understanding. IEEE Pattern Analysis and Machine Intelligence (PAMI), 42(2), 502–508.Paper Website
Oliva, A. (2020). Computational Models of Human Object and Scene Recognition.In The Cognitive Neurosciences, 6th Edition. Edited by Gazzaniga et al. MIT Press (pp. 151–157). Paper
Amini, L., Chen, C.-H., Cox, D., Oliva, A., & Torralba, A. (2020). Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence. AI Magazine. 41(1), 70–81. Paper
Ramakrishnan, K., Panda, R., Fan, Q., Henning, J., Oliva, A. & Feris, R. (2020). Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors. IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPR-W 2020), Continual Learning in Computer Vision (CLVISION) Paper
Cichy, R.M., Roig, G., & Oliva, A. (2019). The Algonauts Project. Nature Machine Intelligence, 1(613) Paper Website
Jaegle*, A., Mehrpour*, V., Mohsenzadeh*, Y., Meyer, T., Oliva, A., & Rust, N. (2019). Population response magnitude variation in inferotemporal cortex predicts image memorability.eLife, 8:e47596 Paper
Goetschalckx*, L., Andonian*, A., Oliva, A., & Isola, P. (2019). GANalyze: Towards Visual Definition of Cognitive Image Properties. IEEE International Conference on Computer Vision (ICCV), 5744–5753 PaperWebsiteNews
Xiao, T., Fan, Q., Gutfreund, D., Monfort, M., Oliva, A., & Zhou, B. (2019). Reasoning About Human-Object Interactions Through Dual Attention Networks. IEEE International Conference on Computer Vision (ICCV), 3919–3928 Paper Website
Zhou,B., Bau, D., Oliva, A., & Torralba, A. (2019). Interpreting Deep Visual Representations via Network Dissection. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 41, 9, 2131–2145 Paper Website
Ramakrishnan*, K., Monfort*, M., McNamara, B., Lascelles, A., Gutfreund, D., Feris, R., & Oliva, A. (2019). Identifying Interpretable Action Units in Deep Networks.IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019) Workshop on Explainable AI. Paper
Monfort, M., Ramakrishnan, K., McNamara, B., Lascelles, A., Gutfreund, D., Feris, R., & Oliva, A. (2019). Examining Interpretable Feature Relationships in Deep Networks for Action recognition.ICML 2019 Workshop Deep Phenomena. Paper
Mohsenzadeh, Y., Mullin, C., Oliva, A., & Pantazis, D. (2019). The perceptual neural trace of memorable unseen scenes. Scientific Reports, 9, 6033 Paper
Mohsenzadeh, Y., Mullin, C., Lahner, B., Cichy, R.M., & Oliva, A. (2019). Reliability and generalizability of similarity-based fusion of fMRI and MEG data in the ventral and dorsal visual streams. Vision, 3(1), 8. Paper Website
Bylinskii, Z., Judd, T., Oliva, A., Torralba, A., & Durand, F. (2019). What do different evaluation metrics tell us about saliency models? IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 41(3), 740–757. Paper Website
Zhou,B., Andonian. A., Oliva, A., & Torralba, A. (2018). Temporal Relational Reasoning in Videos. European Conference on Computer Vision (ECCV)Website News
Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., & Torralba, A. (2018). Places: A 10 million Image Database for Scene Recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 40(6), 1452–1464.PaperDemo Website Places CNN Models
Khaligh-Razavi, S.M., Cichy, R.M., Pantazis, D., & Oliva, A. (2018). Tracking the spatiotemporal neural dynamics of object properties in the human brain. Journal of Cognitive Neuroscience, 30(11), 1559–1576.Paper
Singh, I., Oliva, A., & Howard, M. Visual memories are stored along a compressed timeline. Submitted manuscript. bioRxiv Paper
Madan, S., Bylinskii, Z., Tancik, M., Recasens, A., Zhong, K., Alsheikh, S., Pfister, H., Oliva, A. & Durand, F. (2018). Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics. arXiv, 1807.10441. arXiv Paper Website
Bau*,D., Zhou*,B., Khosla,A., Oliva, A., & Torralba, A. (2017). Network Dissection: Quantifying Interpretability of Deep Visual Representations. IEEE Computer Vision and Pattern Recognition (CVPR), pp. 2695–2702. PaperWebsite TalkVideo News
Cichy, R.M., Khosla, A., Pantazis , D., & Oliva, A. (2017). Dynamics of scene representations in the human brain revealed by magnetoencephalography and deep neural networks. Neuroimage, 153:346–358 Paper Website
Bainbridge, W.A., Dilks, D., & Oliva, A. (2017). Memorability: A Stimulus-Driven Perceptual Neural Signature Distinctive from Memory. Neuroimage, 149, 141–152Paper Brain ROIs
Teng, S., Sommer, V., Pantazis, D., & Oliva, A. (2017). Hearing scenes: A neuromagnetic signature of auditory source and reverberant space separation. eNeuro 4(1) ENEURO.0007-17.2017Paper News
Monfort, M., Johnson, M., Oliva, A., & Hofmann, K. (2017). Asynchronous Data Aggregation for Training Visual Navigation Networks. Lifelong Learning: A Reinforcement Learning Approach Workshop at ICML, Sydney, Australia. Paper
Bylinskii, Z., Borkin, M.A., Kim, N.W., Pfister, H., & Oliva, A. (2017). Eye Fixation Metrics for Large Scale Evaluation and Comparison of Information Visualizations. In Burch, M., Chuang, L., Fisher, B., Schmidt, A., Weiskopf, D. (Eds.), Eye Tracking and Visualization: Foundations, Techniques, and Applications (pp. 235–255). Springer International Publishing. Paper Website
Kim, N.*, Bylinskii, Z.*, Borkin, M., Gajos, K., Oliva, A., Durand, F., & Pfister, H. (2017). BubbleView: replacing eye-tracking to crowdsource image importance. ACM Transactions on Computer-Human Interaction (TOCHI), 24(5), 36 Paper Website
Oliva, A., & Schyns, P.G. (2017). Hybrid Image Illusion. In The Oxford Compendium of Visual Illusions, Ed. Shapiro and Todorovic, Oxford University Press, Chapter 111 (p 763–766). Paper
Bylinskii, Z.*, Alsheikh, S.*, Madan, S.*, Recasens, A.*, Zhong, K., Pfister, H., Durand, F., & Oliva, A. (2017). Understanding Infographics through Textual and Visual Tag Prediction. arXiv 1709.09215 arXiv Paper
Vo, M., Bylinskii, Z., & Oliva, A. (2017). Image Memorability in the Eye of the Beholder: Tracking the Decay of Visual Scene Representations. BioRxiv:141044 2017 bioRxiv Paper
Cichy, R.M., Pantazis , D., & Oliva, A. (2016). Similarity-based fusion of MEG and fMRI reveals spatio-temporal dynamics in human cortex during visual object recognition. Cerebral Cortex, 26 (8): 3563–3579. Paper Website
Cichy, R.M., Khosla, A., Pantazis , D., Torralba, A., & Oliva, A. (2016). Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence. Scientific Reports, 6, 27755. PaperWebsite
Borkin*, A.M., Bylinskii*, Z., Kim, N.W., Bainbridge, C.M., Yeh, C.S., Borkin, D., Pfister, H., & Oliva, A. (2016). Beyond Memorability: Visualization Recognition and Recall. InfoVis 2015 IEEE Transactions on Visualization and Computer Graphics, 22(1), 519–528. Paper Supplementary Material Website Video News
Zhou, B., Khosla, A., Lapedriza , A., Oliva, A., & Torralba, A. (2016). Learning Deep Features for Discriminative Localization. IEEE Conference on Computer Vision and Pattern Recogntion (CVPR), pp. 2921–2929. arXiv Paper Website
Bylinskii, Z., Recasens, A., Borji , A., Oliva, A., Torralba, A., & Durand, F. (2016). Where should saliency models look next? Proceedings of the European Conference in Computer Vision (ECCV), Amsterdam Paper Supplementary Material Poster
Oliva, A., & Teng, S. (2016). The Cognitive Society.In Handbook of Society and Technology Convergence (W.S Bainbridge and M. C. Roco, Eds), Springer International Publishing Switzerland (pp 743–751). Paper
Bainbridge, W.A., & Oliva, A. (2015). Interaction Envelope: Local Spatial Representations of Objects at All Scales in Scene-Selective Regions.Neuroimage, 122, 408–416 Paper Dataset
Bainbridge, W.A., & Oliva, A. (2015). A toolbox and sample object perception data for equalization of natural images.Data in Brief, 5, 846–851 Paper Dataset
Park*, S.J, Konkle*, T., & Oliva, A. (2015). Parametric coding of the size and clutter of natural scenes in the human brain. Cerebral Cortex, 25, 1792–1805. Paper Stimuli
Khosla, A, Raju, A.S., Torralba, A., & Oliva, A. (2015). Understanding and Predicting Image Memorability at a Large Scale.IEEE International Conference on Computer Vision (ICCV), pp. 2390–2398.PaperWebsite News
Vondrick, C, Pirsiavash, H., Oliva, A., & Torralba, A. (2015). Learning Visual Biases from Human Imagination.Advances in Neural Information Processing Systems (NIPS), 28. Paper
Zhou, B., Khosla, A., Lapedriza , A., Oliva, A., & Torralba, A. (2015). Object Detectors emerge in Deep Scene CNNs .International Conference on Learning Representations (ICLR 2015). arXiv Paper Slides Visualization Places-CNN Visualization ImageNet-CNN
Bylinskii, Z., Isola, P., Bainbridge, C.M., Torralba, A., & Oliva, A. (2015). Intrinsic and Extrinsic Effects on Image Memorability. Vision Research, 116, 165-178. Paper Supplementary Material Poster Website
Kim, N.W., Bylinskii, Z., Borkin, A.M., Oliva, A., Gajos, K.Z., & Pfister, H. (2015). A Crowdsourced Alternative to Eye-tracking for Visualization Understanding.ACM Conference on Human Factors in Computing Systems (CHI) 2015 Extended Abstracts . Paper Poster
Bainbridge, C. M., Bainbridge, W.A, & Oliva, A. (2015). Quadri-stability of a spatially ambiguous auditory illusion.Frontiers in Human Neuroscience. The Future of Perceptual Illusions : From Phenomenology to Neuroscience. PaperSound file Continuous Sound
Zhou, B., Lapedriza , A., Xiao, J., Torralba, A., & Oliva, A. (2014). Learning Deep Features for Scene Recognition using Places Database.Conference on Neural Information Processing Systems (NIPS). Paper Website Demo
Cichy, R.M., Pantazis , D., & Oliva, A. (2014). Resolving human object recognition in space and time.Nature Neuroscience, 17(3), 455–464. Paper Supplementary Material WebsiteNews News
Isola, P., Xiao, J., Parikh, D, Torralba, A., & Oliva, A. (2014). What makes a photograph memorable? IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 36(7), 1469–1482. Paper
Zhou, B., Liu, L., Oliva, A., & Torralba, A. (2014). Recognizing City Identity via Attribute Analysis of Geo-tagged Images. Proceedings of the 13th European Conference on Computer Vision (ECCV). Paper
Vu, T.H., Olsson, C., Laptev, I., Oliva, A., & Sivic, J. (2014). Predicting Actions from Static Scenes. Proceedings of the 13th European Conference on Computer Vision (ECCV). PaperWebsite
Khosla, A., Bainbridge, W.A., Torralba, A., & Oliva, A. (2013). Modifying the Memorability of Face Photographs. Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp 3200–3207. Paper News Website
Bainbridge, W.A., Isola, P., & Oliva, A. (2013). The Intrinsic Memorability of Face Photographs. Journal of Experimental Psychology: General, 142(4), 1323–1334. Paper Website
Borkin, M.A, Vo, A.A., Bylinskii, Z., Isola, P., Sunkavalli, S., Oliva, A., & Pfister, H. (2013). What makes a visualization memorable? IEEE Transactions on Visualization and Computer Graphics (TVCG), 19(12), 2306–2315. Paper News Website
Oliva, A., Isola, P., Khosla, A., & Bainbridge, W.A. (2013). What makes a picture memorable? SPIE Newsroom Article, 7 May 2013, DOI: 10.1117/2.1201304.004806 Paper SPIE Website
Brady, T.F., Konkle, T., Gill, J., Oliva, A., & Alvarez, G.A. (2013). Visual long-term memory has the same limit on fidelity as visual working memory. Psychological Science, 24(6), 981–990. Paper Website
Brady, T.F., Konkle, T., Alvarez, G.A, & Oliva, A. (2013). Real-world objects are not represented as bound units: Independent forgetting of different object details from visual memory. Journal of Experimental Psychology: General, 142(3), 791–808. Paper Stimuli
Oliva, A. (2013). The Art of Hybrid Images: Two for the View of One. Art & Perception, 1(1-2), 65-74. Paper
Laprevote, V., Oliva, A., Ternois, A.S., Schwan, R., Thomas, P., & Boucart, M. (2013). Low spatial frequency bias in schizophrenia is not face specific: when the integration of coarse and fine information fails. Frontiers in Psychopathology, 4:248 Paper Frontiers Website
Oliva, A. (2013). Scene Perception. Chapter (51) in the New Visual Neurosciences. Eds John S. Werner and Leo. M. Chalupa (pp. 725–732) Chapter
MacGregor, D., Baba, M., Oliva, A., McLaughlin, A.C., Scacchi, W., Scassellati, B., Rubin, P., Mason, R.M., & Spohrer, J.R. (2013). Convergence Platforms: Human-Scale Convergence and the Quality of Life. In Convergence of Knowledge, Technology and Society: Beyond Convergence of Nano-Bio-Info-Cognitive Technologies (M. C. Roco, W. S. Bainbridge, B. Tonn and G. Whitesides, Eds), Springer Publishing (pp. 53–93) Chapter
Olds, J.L., MacGregor, D., Madou, M., McLaughlin, A., Oliva, A., Scassellati, B., & Wong, P. (2013). Implications: Human Cognition and Communication and the Emergence of the Cognitive Society In Convergence of Knowledge, Technology and Society: Beyond Convergence of Nano-Bio-Info-Cognitive Technologies (M. C. Roco, W. S. Bainbridge, B. Tonn and G. Whitesides, Eds), Springer Publishing (pp. 223–253) Chapter
Konkle, T. & Oliva, A. (2012). A real-world size organization of object responses in occipitotemporal cortex Neuron, 74 (6), 1114–1124. Paper News Stimuli
Konkle, T. & Oliva, A. (2012). A familiar-size Stroop effect: Real-world Size is an automatic property of object representation.Journal of Experimental Psychology: Human Perception and Performance, (3), 561–569. Paper Stimuli
Khosla, A., Xiao, J., Torralba, A., & Oliva, A. (2012). Memorability of Image Regions.Advances in Neural Information Processing Systems (NIPS), P. Bartlett and F.C.N. Pereira and C.J.C. Burges and L. Bottou and K.Q. Weinberge (Eds.), 25 ,305–313. Paper PosterWebsite
Brady, T.F., & Oliva, A. (2012). Spatial Frequency Integration During Active Perception: Perceptual Hysteresis When an Object Recedes. Frontiers in Perception Science, 3:462. Paper Frontiers Website
Khosla, A., Xiao, J., Isola, P., Torralba, A., & Oliva, A. (2021). Image memorability and visual inception. In SIGGRAPH Asia 2012 technical briefs, (pp. 1–4). Paper
Park, S., Brady, T.F., Greene, M.R., & Oliva, A. (2011). Disentangling scene content from its spatial boundary: Complementary roles for the parahippocampal place area and lateral occipital complex in representing real-world scenes Journal of Neuroscience, 31(4), 1333–1340. Paper
Konkle, T. & Oliva, A. (2011). Canonical visual size for real-world objects. Journal of Experimental Psychology: Human Perception & Performance, 37(1), 23. Paper Stimuli Download
Isola, P., Parikh, D., Torralba, A., & Oliva, A. (2011). Understanding the Intrinsic Memorability of Images. Advances in Neural Information Processing Systems (NIPS), J. Shawe-Taylor and R.S. Zemel and P. Bartlett and F.C.N. Pereira and K.Q. Weinberger (Eds.), 24, 2429–2437. Paper Poster Website
Isola, P., Xiao, J., Torralba, A., & Oliva, A. (2011). What makes an image memorable? Proceedings of the 24rd IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 145–152). Paper Poster Website
Gagnier, K.M., Intraub, H., Oliva, A., & Wolfe, J. (2011). Why Does Vantage Point Affect Boundary Extension? Visual Cognition, 19:2, 234–257. Paper
Oliva, A., Park, S., & Konkle, T. (2011). Representing, perceiving and remembering the shape of visual space. In L.R. Harris and M. Jenkin (Eds.), Vision in 3D Environments. Cambridge: Cambridge University Press (pp 308–339). Chapter
Konkle, T.*, Brady, T.F.*, Alvarez, G.A., & Oliva, A. (2010). Scene memory is more detailed than you think: the role of categories in visual long-term memory. Psychological Science, 21(11), 1551–1556. Paper Website Press
Konkle, T., Brady, T.F., Alvarez, G.A & Oliva, A. (2010). Conceptual distinctiveness supports detailed visual long-term memory for real-world objects. Journal of Experimental Psychology: General, 139 (3), 558–578. Website Paper
Greene, M.R. & Oliva, A. (2010). High-Level Aftereffects to Global Scene Properties. Journal of Experimental Psychology: Human Perception & Performance, 36(6), 1430–1442. Paper
Xiao, J., Hayes, J., Ehinger, K., Oliva, A., & Torralba, A. (2010). SUN Database: Large-scale Scene Recognition from Abbey to Zoo. Proceedings of the 23rd IEEE Conference on Computer Vision and Pattern Recognition, (pp. 3485–3492), IEEE Computer Society. Paper Website
Hidalgo-Sotelo, B. & Oliva, A. (2010). Person, place, and past influence eye movements during visual search. In S. Ohlsson & R. Catrambone (Eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society (CSS), (pp. 820–825). Austin, TX: Cognitive Science Society. Paper
Ross, M.G., & Oliva, A. (2010). Estimating perception of scene layout properties from global image features. Journal of Vision, 10(1), 2-2. Website Paper
Laprevote, V., Oliva, A., Delerue, C., Thomas, P., & Boucart, M. (2010). Patients with schizophrenia are biased toward low spatial frequency to decode facial expression at a glance. Neuropsychologia, 48, 4164–4168. Paper
Oliva, A. (2010). Seeing and Thinking in the Mist (Book review of the Invisible Gorilla). Science, 329, 1017. Paper
Oliva, A. (2010). Understanding the Physics of the Mind: a proposal for a Perceptual Science Initiative. White paper submitted to the National Science Foundation SBE 2020. Paper
Greene, M.R., & Oliva, A. (2009). The briefest of glances: the time course of natural scene understanding. Psychological Science, 20 (4), 464–472. Paper
Greene, M.R., & Oliva, A. (2009). Recognition of Natural Scenes from Global Properties: Seeing the Forest Without Representing the Trees. Cognitive Psychology, 58(2), 137–179. Paper
Brady, T.F., Konkle, T., Oliva, A., & Alvarez, G. A. (2009). Detecting changes in real-world objects: The relationship between visual long-term memory and change blindness. Communicative & Integrative Biology 2(1), 1–3. Paper
Alvarez, G.A., & Oliva, A. (2009). Spatial Ensemble Statistics are Efficient Codes that Can be Represented with Reduced Attention. Proceedings of the National Academy of Sciences, 106, 7345–7350. Paper
Oliva, A. (2009). Visual Scene Perception. In Encyclopaedia of Perception, Ed: Bruce Goldstein. Sage Edition. Paper
Ehinger, K.A. *, Hidalgo-Sotelo, B. *, Torralba, A., & Oliva, A. (2009). Modelling Search for People in 900 Scenes: A combined source model of eye guidance. Visual Cognition, 17, 945–978. Website Paper
Brady, T.F., Konkle, T., Alvarez, G.A., & Oliva, A. (2008). Visual long-term memory has a massive storage capacity for object details. Proceedings of the National Academy of Sciences, USA, vol 105 (38), 14325–14329. Website Paper
Alvarez, G.A., & Oliva, A. (2008). The Representation of Simple Ensemble Visual Features Outside the Focus of Attention. Psychological Science, 19(4), 392–398. Paper
Brady, T.F., & Oliva, A. (2008). Statistical Learning using Real World Scenes: Extracting Categorical Regularities without Conscious Intent. Psychological Science, 19(7), 678–685. Poster Paper
Boucart, M., Dinon, J.F., Despretz, P., Desmettre, T., Hladiuk, K., & Oliva, A. (2008). Recognition of facial emotion in low vision: A flexible usage of facial features. Visual Neuroscience, 25(4), 603–609. Paper
Oliva, A. & Torralba, A. (2007). The Role of Context in Object Recognition. Trends in Cognitive Sciences, 11(12), 520–527. Paper
Alvarez, G. A, Konkle, T., & Oliva, A. (2007). Searching in Dynamic Displays: Effects of Configural and Spatiotemporal Predictability. Journal of Vision, 7(14), 12-12. Paper
Serre, T., Oliva, A., & Poggio, T. A. (2007). A feedforward architecture accounts for rapid categorization. Proceedings of the National Academy of Sciences, 104 (15), 6424–6429 Paper Poster Slides
Konkle, T., & Oliva, A. (2007). Normative representation of objects: evidence for an ecological bias in object perception and memory. In D.S. McNamara & J. G. Trafton (Eds.). Proceedings of the 29th Annual Conference of the Cognitive Science Society (CSS), (p 407–412), Nashville, TN: Cognitive Science Society 2007. Paper
Alvarez, G., & Oliva, A. (2007). The Role of Global Layout in Visual Short-term Memory. Visual Cognition, 15(1), 70–73. Paper
Torralba, A., Oliva, A., Castelhano, M., & Henderson, J.M. (2006). Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychological Review, 113, 766–786. Paper Website
Oliva, A., Torralba, A., & Schyns, P.G. (2006). Hybrid Images. ACM Transactions on Graphics (Siggraph), 25, 3, 527–532. Paper Slides
Greene, M.R., & Oliva, A. (2006). Natural scene categorization from the conjunction of ecological global properties. Proceedings of the 28th Annual Conference of the Cognitive Science Society (CSS), 291–296. Paper Poster
Oliva, A. & Torralba, A. (2006). Building the Gist of a Scene: The Role of Global Image Features in Recognition. Progress in Brain Research: Visual perception, 155, 23–36. Paper
Goffaux, V., Jacques, C., Mouraux, A., Oliva, A., Rossion, B., & Schyns. P.G. (2005). Diagnostic colors contribute to early stages of scene categorization: behavioral and neurophysiological evidences. Visual Cognition, 12, 878–892. Paper
Oliva, A. (2005). Gist of the scene. In the Encyclopedia of Neurobiology of Attention. L. Itti, G. Rees, and J.K. Tsotsos (Eds.), Elsevier, San Diego, CA (pages 251–256).Paper
Hidalgo-Sotelo, B., Oliva, A., & Torralba, A. (2005). Human Learning of Contextual Priors for Object Search: Where does the time go? Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05) – Workshops – vol. 3, p86–86.Paper
Oliva, A., Mack, M., Shrestha, M., & Peeper, A. (2004). Identifying the Perceptual Dimensions of Visual Complexity of Scenes. Proceedings of the Annual Meeting of the Cognitive Science Society (CSS), 26 (26) Paper
Oliva, A., Wolfe, J. M, & Arsenio, H. (2004). Panoramic Search: The interaction of Memory and Vision in Search through a Familiar Scene. Journal of Experimental Psychology: Human Perception and Performance, 30, 1132–1146. Paper
Torralba, A., & Oliva, A. (2003). Statistics of Natural Images Categories.Network: Computation in Neural Systems, 14, 391–412. Paper
Oliva, A., Torralba, A. Casthelano, M., & Henderson, J. (2003). Top-Down control of visual attention in object detection. Proceeding of the IEEE International Conference Image Processing, vol1 (pp 253–256).Paper
Torralba, A., & Oliva, A. (2002). Depth estimation from image structure. IEEE Pattern Analysis and Machine Intelligence (PAMI), 24, 1226–1238. Paper
Oliva, A., & Torralba, A. (2002). Scene-centered description from spatial envelope properties. Lecture Note in Computer Science Serie Proc. Second International Workshop on Biologically Motivated Computer Vision, Eds: H. Bulthoff, S.W. Lee, T. Poggio, & C. Wallraven. Srpinger-Verlag, Tuebingen, Germany (pp.263–272).Paper
Oliva, A., & Torralba, A. (2001). Modeling the Shape of the Scene: a Holistic Representation of the Spatial Envelope. International Journal in Computer Vision, 42, 145–175. Paper Database
Oliva, A., & Schyns, P.G. (2000). Diagnostic colors mediate scene recognition. Cognitive Psychology, 41, 176–210. Paper
Guérin-Dugué, A., & Oliva, A. (2000). Classification of scene photographs from local orientations features. Pattern Recognition Letters, 21 (13–14), 1135–1140. Paper
Schyns, P.G. & Oliva, A. (1999). Dr. Angry and Mr. Smile: when categorization flexibly modifies the perception of faces in rapid visual presentations. Cognition, 69, 243–265. Paper
Torralba, A. & Oliva, A. (1999). Semantic organization of scenes using discriminant structural templates. Proceedings of the Seventh IEEE International Conference on Computer Vision, Vol. 2, pp. 1253–1258. Paper
Oliva, A., Torralba, A., Guérin-Dugué, A., & Hérault, J. (1999). Global semantic classification of scenes using power spectrum templates. In Challenge of image retrieval, pp. 1–11. Paper
Oliva, A. & Schyns, P.G. (1997). Coarse blobs or fine edges? Evidence that information diagnosticity changes the perception of complex visual stimuli. Cognitive Psychology, 34, 72–107. Paper
Schyns, P.G. & Oliva, A. (1997). Flexible, diagnostically-driven, rather than fixed, perceptually determined scale selection in scene and face recognition. Perception, 26, 1027–1038.
Hérault, J., Oliva, A., & Guérin-Dugué, A. (1997). Scene categorisation by curvilinear component analysis of low frequency spectra. In ESANN’97: European symposium on artificial neural networks, pp. 91–96. Paper
Oliva, A., & Schyns, P. G. (1995). Mandatory scale perception promotes flexible scene categorization. In Proceedings of the XVII Meeting of the Cognitive Science Society, pp. 159–163.
Schyns, P.G. & Oliva, A. (1994). From blobs to boundary edges: Evidence for time- and spatial-scale-dependent scene recognition. Psychological Science, 5, 195–200. Paper
Habib, M., Gayraud, D., Oliva, A., Regis, J., Salamon, G., & Khalil, R. (1991). Effects of handedness and sex on the morphology of the corpus callosum: A study with brain magnetic resonance imaging Brain and cognition, 16(1), 41–61. Paper
