AI – Tuesday, August 13, 2024: Notable and Interesting News, Articles, and Papers

Advanced AI data center

A selection of the most important recent news, articles, and papers about AI.

News, Articles, and Analyses

Why larger LLM context windows are all the rage – IBM Research

https://research.ibm.com/blog/larger-context-window

(Wednesday, July 24, 2024) “IBM has scaled the context window of its open-source Granite 3B and 8B models to 128,000 tokens, the new industry standard.”

Generative AI’s slop era – The Atlantic

https://www.theatlantic.com/newsletters/archive/2024/08/ai-search-bots-war/679429/

Author: Damon Beres

(Friday, August 09, 2024) “New search bots underscore familiar problems with the technology.”

California, NVIDIA launch first-of-its-kind AI collaboration | Governor of California

https://www.gov.ca.gov/2024/08/09/california-nvidia-launch-first-of-its-kind-ai-collaboration/

(Friday, August 09, 2024) “New state initiative with NVIDIA kick-starts efforts to expand artificial intelligence (AI) tools and resources so students, educators, and workers – especially in community colleges – can learn new skills and advance their careers.

As Alexa turns 10, Amazon looks to generative AI | TechCrunch

https://techcrunch.com/2024/08/10/as-alexa-turns-10-amazon-looks-to-generative-ai/

Author: Brian Heater

(Saturday, August 10, 2024) “While Amazon has continued releasing Echo devices, including an upgraded Spot announced last month, the company has taken its foot off the gas.”

Intel’s Q2 2024 Earnings: Navigating Challenges & Strategic Shifts – The Futurum Group

https://futurumgroup.com/insights/intels-q2-2024-earnings-release-navigating-challenges-and-strategic-shifts/

Author: Ron Westfall

“Intel’s Q2 2024 earnings usher in a $10B cost reduction plan to bolster near-term competitiveness in key segments and land long game strategy.”

Technical Papers, Articles, and Preprints

[2408.05253] A Systematic Literature Map on Big Data

https://arxiv.org/abs/2408.05253

Authors: Rossi, Rogerio; Hirama, Kechi; Franco, Eduardo Ferreira

arXiv logo(Thursday, August 08, 2024) “The paradigm of Big Data has been established as a solid field of studies in many areas such as healthcare, science, transport, education, government services, among others. Despite widely discussed, there is no agreed definition about the paradigm although there are many concepts proposed by the academy and industry. This work aims to provide an analytical view of the studies conducted and published regarding the Big Data paradigm. The approach used is the systematic map of the literature, combining bibliometric analysis and content analysis to depict the panorama of research works, identifying patterns, trends, and gaps. The results indicate that there is still a long way to go, both in research and in concepts, such as building and defining adequate infrastructures and standards, to meet future challenges and for the paradigm to become effective and bring the expected benefits.”

[2408.05924] Adapting a Foundation Model for Space-based Tasks

https://arxiv.org/abs/2408.05924

Authors: Foutter, Matthew; Bhoj, Praneet; Sinha, Rohan; Elhafsi, Amine; Banerjee, Somrita; Agia, Christopher; Kruger, Justin; Guffanti, Tommaso; Gammelli, Daniele; D’Amico, Simone; Pavone, Marco

arXiv logo(Monday, August 12, 2024) “Foundation models, e.g., large language models, possess attributes of intelligence which offer promise to endow a robot with the contextual understanding necessary to navigate complex, unstructured tasks in the wild. In the future of space robotics, we see three core challenges which motivate the use of a foundation model adapted to space-based applications: 1) Scalability of ground-in-the-loop operations; 2) Generalizing prior knowledge to novel environments; and 3) Multi-modality in tasks and sensor data. Therefore, as a first-step towards building a foundation model for space-based applications, we automatically label the AI4Mars dataset to curate a language annotated dataset of visual-question-answer tuples. We fine-tune a pretrained LLaVA checkpoint on this dataset to endow a vision-language model with the ability to perform spatial reasoning and navigation on Mars’ surface. In this work, we demonstrate that 1) existing vision-language models are deficient visual reasoners in space-based applications, and 2) fine-tuning a vision-language model on extraterrestrial data significantly improves the quality of responses even with a limited training dataset of only a few thousand samples.”