{"id":1636,"date":"2023-10-04T11:31:42","date_gmt":"2023-10-04T11:31:42","guid":{"rendered":"https:\/\/geneea.com\/news\/?p=1636"},"modified":"2026-01-28T08:17:32","modified_gmt":"2026-01-28T08:17:32","slug":"geneeas-ai-spotlight-5","status":"publish","type":"post","link":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5","title":{"rendered":"Geneea&#8217;s AI Spotlight #5"},"content":{"rendered":"\n<p id=\"ember48\">The fifth edition of our newsletter on Large Language Models is here.&nbsp;<\/p>\n\n\n\n<p id=\"ember49\">Today, we take a look at&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>the race among industry leaders,<\/li>\n\n\n\n<li>the challenges of real-world applications,<\/li>\n\n\n\n<li>some new findings and framework releases, and<\/li>\n\n\n\n<li>how more and more websites are blocking AI data crawlers.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"ember51\">Clash of the Titans&nbsp;<\/h2>\n\n\n\n<p id=\"ember52\"><strong>Google \u2013 the empire strikes back<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Google&nbsp;<a href=\"https:\/\/cloud.google.com\/blog\/products\/ai-machine-learning\/vertex-ai-next-2023-announcements\">announced<\/a>&nbsp;the addition of&nbsp;<strong>Llama 2<\/strong>&nbsp;and&nbsp;<strong>Falcon<\/strong>&nbsp;support to Vertex, their AI platform.&nbsp;<strong>Claude 2<\/strong>&nbsp;should be available soon. The models are easy to use but&nbsp;<strong>not as easy<\/strong>&nbsp;as calling Palm 2 API. You need to deploy them yourself. Google provides wizards for this, but you still need to pick the right hardware depending on the model and your expected load. Even though the models are free, you might end up paying much more than you would for Palm 2 or GPT API.&nbsp;<\/li>\n\n\n\n<li>Palm 2&nbsp;<a href=\"https:\/\/cloud.google.com\/vertex-ai\/docs\/generative-ai\/models\/tune-text-models\">added support<\/a>&nbsp;for&nbsp;<strong>32k context&nbsp;<\/strong>windows and&nbsp;<strong>fine-tuning<\/strong>.<\/li>\n\n\n\n<li><a href=\"https:\/\/www.theinformation.com\/articles\/google-nears-release-of-gemini-ai-to-rival-openai\">According to The Information<\/a>&nbsp;(see&nbsp;<a href=\"https:\/\/www.reuters.com\/technology\/google-nears-release-ai-software-gemini-information-2023-09-15\/\">Reuters<\/a>&nbsp;article), Google is&nbsp;<strong>close to releasing Gemini<\/strong>, its new powerful model suite. A handful of businesses have been given early access to some of these models. Gemini is being positioned as a direct competitor to GPT-4, but Demis Hassabis, Google DeepMind\u2019s CEO,&nbsp;<a href=\"https:\/\/www.wired.com\/story\/google-deepmind-demis-hassabis-chatgpt\/\">says<\/a>&nbsp;that it will combine a large language model (LLM) with&nbsp;<strong>planning and problem-solving<\/strong>&nbsp;abilities (It was DeepMind&#8217;s AlphaGo that defeated the world&#8217;s number one-ranked Go player.) There are&nbsp;<a href=\"https:\/\/www.semianalysis.com\/p\/google-gemini-eats-the-world-gemini\">rumors<\/a>&nbsp;that Gemini is&nbsp;<strong>significantly more powerful than GPT-4<\/strong>.<\/li>\n<\/ul>\n\n\n\n<p id=\"ember54\"><strong>Meta\u2019s big plans<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.wsj.com\/tech\/ai\/meta-is-developing-a-new-more-powerful-ai-system-as-technology-race-escalates-decf9451\">According to the Wall Street Journal<\/a>, Meta has big plans in the AI domain after it fell behind the other big players in AI commercialization. It is working hard on a new model&nbsp;<strong>comparable to GPT-4<\/strong>. Currently, it is expanding its data centers and acquiring the necessary GPUs.&nbsp;<\/li>\n\n\n\n<li>It is hard to say how much difference this will make. About&nbsp;<a href=\"https:\/\/www.wsj.com\/articles\/mark-zuckerberg-was-early-in-ai-now-meta-is-trying-to-catch-up-94a86284\">one-third of Meta&#8217;s LLM researchers left<\/a>&nbsp;the company last year (some voluntarily, some not). Also, GPT-4 is here now, and Meta is only planning to start training the new model in early 2024. This also means it will probably be released after Google&#8217;s Gemini.&nbsp;<\/li>\n\n\n\n<li>According to WSJ, Zuckerberg wants the model to be open-source and free, but Meta&#8217;s lawyers think this might be too risky.<\/li>\n<\/ul>\n\n\n\n<p id=\"ember56\"><strong>Microsoft Copilot, Ernie, and chips<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Microsoft&nbsp;<a href=\"https:\/\/blogs.microsoft.com\/blog\/2023\/09\/21\/announcing-microsoft-copilot-your-everyday-ai-companion\/\">announced<\/a>&nbsp;<strong>Copilot<\/strong>, a unified AI assistant available in Windows 11, Microsoft 365, Edge, and Bing. Even Paint will get some AI. Sounds like Cortana 2.0.&nbsp;<\/li>\n\n\n\n<li>Few days ago, Anthropic&nbsp;<a href=\"https:\/\/www.anthropic.com\/index\/anthropic-amazon\">announced<\/a>&nbsp;a $4B investment from Amazon and tighter integration with AWS.<\/li>\n\n\n\n<li>Last month, OpenAI launched a business version of ChatGPT that competes with ChatGPT deployment in Microsoft Azure (<a href=\"https:\/\/www.reuters.com\/technology\/openai-releasing-version-chatgpt-large-businesses-2023-08-28\/\">Reuters<\/a>).<\/li>\n\n\n\n<li>Meanwhile,&nbsp;<strong>Baidu<\/strong>&nbsp;has introduced&nbsp;<strong>Ernie<\/strong>, its own alternative to ChatGPT. This launch had been delayed a few times, initially&nbsp;<a href=\"http:\/\/research.baidu.com\/Blog\/index-view?id=183\">scheduled for March<\/a>&nbsp;but scrapped at the last moment.&nbsp;<a href=\"https:\/\/www.economist.com\/business\/2023\/09\/03\/meet-ernie-chinas-answer-to-chatgpt\">The Economist has taken a closer look<\/a>&nbsp;at the challenges of running such a system in China. According to local regulations, the chatbot must align with the&nbsp;<strong>fundamental principles of socialism<\/strong>. Interestingly, the chatbot claims that COVID-19 originally came from the United States and was later transmitted to Wuhan in China.&nbsp;<a href=\"https:\/\/www.nytimes.com\/2023\/07\/14\/business\/baidu-ernie-openai-chatgpt-chinese.html\">The New York Times compared<\/a>&nbsp;Ernie\u2019s answers with those of ChatGPT.<\/li>\n\n\n\n<li>One of the main bottlenecks to AI development \u2013 the&nbsp;<strong>shortage of GPUs<\/strong>&nbsp;\u2013 remains (see articles by&nbsp;<a href=\"https:\/\/www.ft.com\/content\/bec85749-9354-4470-ac88-9323541c7bce\">FT<\/a>&nbsp;and&nbsp;<a href=\"https:\/\/www.wsj.com\/tech\/ai\/nvidia-supply-concerns-ease-but-long-term-challenges-remain-a259dc54\">WSJ<\/a>). All of Nvidia&#8217;s chips are made by a single company in Taiwan: Taiwan Semiconductor Manufacturing Company, and as TSMC&nbsp;<a href=\"https:\/\/asia.nikkei.com\/Business\/Tech\/Semiconductors\/TSMC-sees-AI-chip-output-constraints-lasting-1.5-years\">explains<\/a>, the shortage will last until 2025. We wonder what Ernie thinks about this.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"ember58\">Hype meets reality<\/h2>\n\n\n\n<p id=\"ember59\">As we move from being astonished that LLMs can suggest ten ideas for a blog to more practical applications, more and more challenges surface. Finally, there is some correction in expectations.&nbsp;<\/p>\n\n\n\n<p id=\"ember60\"><a href=\"https:\/\/garymarcus.substack.com\/\">Gary Marcus<\/a>&nbsp;has been skeptical since the start. Maybe too skeptical. He has been stressing that AI is much more than language models (e.g., planning of complex workflows), that AGI is not imminent, etc. Ted Gioia even&nbsp;<a href=\"https:\/\/www.honest-broker.com\/p\/ugly-numbers-from-microsoft-and-chatgpt\">argues<\/a>&nbsp;that Microsoft\u2019s bet on AI just created a new version of&nbsp;<a href=\"https:\/\/en.wikipedia.org\/wiki\/Office_Assistant\">Clippy<\/a>.<\/p>\n\n\n\n<p id=\"ember61\">Other experts were less pessimistic, but they still stressed that bringing LLMs to&nbsp;<strong>production<\/strong>&nbsp;takes some&nbsp;<strong>nontrivial effort<\/strong>. We mentioned some of those concerns before:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>\u201cBuilding LLM applications for production\u201d, an excellent post by Chip Huyen (see&nbsp;<a href=\"https:\/\/www.linkedin.com\/pulse\/geneeas-ai-spotlight-1-geneea\">issue #1<\/a>)<\/li>\n\n\n\n<li>\u201cAll the Hard Stuff Nobody Talks About when Building Products with LLMs\u201d by Phillip Carter from&nbsp;<a href=\"http:\/\/honeycomb.io\/\">Honeycomb.io<\/a>&nbsp;(<a href=\"https:\/\/www.linkedin.com\/pulse\/geneeas-ai-spotlight-2-geneea\">issue #2<\/a>)<\/li>\n\n\n\n<li>\u201cLost in the Middle: How Language Models Use Long Contexts\u201d (Longer contexts are not the silver bullet in&nbsp;<a href=\"https:\/\/www.linkedin.com\/pulse\/geneeas-ai-spotlight-4-geneea\">issue #4<\/a>)<\/li>\n\n\n\n<li>\u201cHow is ChatGPT&#8217;s behavior changing over time?\u201d (Changing quality of GPT results in&nbsp;<a href=\"https:\/\/www.linkedin.com\/pulse\/geneeas-ai-spotlight-4-geneea\">issue #4<\/a>)<\/li>\n<\/ul>\n\n\n\n<p id=\"ember63\"><a href=\"https:\/\/apnews.com\/article\/artificial-intelligence-hallucination-chatbots-chatgpt-falsehoods-ac4672c5b06e6f91050aa46ee731bcf4\">Associated Press<\/a>&nbsp;explores the problem of&nbsp;<strong>hallucinations<\/strong>. While Sam Altman, the CEO of OpenAI, thinks that hallucinations will be alleviated in two years, for now, he trusts ChatGPT&#8217;s answers \u201cthe least of anybody on Earth\u201d. Emily M. Bender, a linguistics professor, considers them an&nbsp;<strong>inherent property of LLMs<\/strong>&nbsp;as they are \u201cdesigned to make things up\u201d. For some use cases, such as marketing, \u201challucinations are actually an added bonus\u201d suggests Shane Orlick, president of Jasper AI.<\/p>\n\n\n\n<p id=\"ember64\">Also, according to&nbsp;<a href=\"http:\/\/similarweb.com\/\">Similarweb.com<\/a>&nbsp;(as&nbsp;<a href=\"https:\/\/www.reuters.com\/technology\/chatgpt-traffic-slips-again-third-month-row-2023-09-07\/\">reported by Reuters<\/a>), the number of ChatGPT users has declined for three months in a row. This might be AI fatigue, or it might be just school kids being on vacation.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"ember65\">LLMs are hungry, thirsty, and take deep breaths<\/h2>\n\n\n\n<p id=\"ember66\">We knew that LLMs are great electricity hogs, but they are also quite&nbsp;<strong>thirsty<\/strong>, and now it seems they work better when taking&nbsp;<strong>deep breaths<\/strong>:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Researchers from the University of California, Riverside,&nbsp;<a href=\"https:\/\/arxiv.org\/pdf\/2304.03271.pdf\">show<\/a>&nbsp;that LLMs use a surprisingly large amount of water in both training and inference, mostly for cooling and during electricity generation. Microsoft\u2019s water consumption rose by a third between 2021 and 2022, mainly due to AI development.&nbsp;<\/li>\n\n\n\n<li>According to the&nbsp;<a href=\"https:\/\/arxiv.org\/abs\/2309.03409\">paper<\/a>&nbsp;from DeepMind about prompt optimization mentioned above, Palm 2 was best at solving certain mathematical tasks when instructed with prompts starting with \u201cTake a deep breath and work on this problem step by step.\u201d Without taking a deep breath, the results were worse.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"ember68\">Fine-tuning, prompts, and LlamaIndex<\/h2>\n\n\n\n<p id=\"ember69\"><a href=\"https:\/\/arxiv.org\/abs\/2308.10792\"><strong>Instruction Tuning for Large Language Models: A Survey<\/strong><\/a><strong>&nbsp;(2023-08)<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A great survey of approaches to&nbsp;<strong>instruction tuning<\/strong>. Instruction tuning is what turns language models, i.e., devices predicting the most likely next word, into chatbots.&nbsp;<\/li>\n\n\n\n<li>The authors review instruction-tuning datasets, efficient methods for fine-tuning (LoRa, HINT, LOMO\u2026), and various model types (imitation models, multimodal models, and models tuned for specific domains, such as writing, coding, medical, \u2026).&nbsp;<\/li>\n\n\n\n<li>They discuss the main challenges, including limited dataset diversity and that models learn only surface patterns from training tasks.&nbsp;<\/li>\n\n\n\n<li>The paper covers so much important information that we decided to dedicate a&nbsp;<a href=\"https:\/\/geneea.com\/news\/reading-notes-instruction-tuning-for-llms\/\"><strong>separate post<\/strong><\/a>&nbsp;to our notes.<\/li>\n<\/ul>\n\n\n\n<p id=\"ember71\"><a href=\"https:\/\/arxiv.org\/abs\/2309.03409\"><strong>Large Language Models as Optimizers (2023-09)<\/strong><\/a><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>DeepMind shows that&nbsp;<strong>prompts&nbsp;<\/strong>can be very effectively&nbsp;<strong>optimized with LLMs<\/strong>.&nbsp;<\/li>\n\n\n\n<li>Two LLMs cooperate on the optimization: the&nbsp;<strong>scorer<\/strong>&nbsp;assigns a score to prompts generated by the&nbsp;<strong>optimizer<\/strong>. The optimizer&#8217;s task, defined in a meta-prompt, is to find a prompt with the highest score based on previous prompts and their scores.&nbsp;<\/li>\n\n\n\n<li>The open questions include how to avoid overfitting to training data, how to use error examples, and how to select the initial conditions.<\/li>\n<\/ul>\n\n\n\n<p id=\"ember73\"><strong>LlamaIndex Updates&nbsp;<\/strong>(<a href=\"https:\/\/medium.com\/llamaindex-blog\/llamaindex-update-09-03-2023-4a7c21c0f60b\">Sep 3<\/a>&nbsp;&amp;&nbsp;<a href=\"https:\/\/medium.com\/llamaindex-blog\/llamaindex-update-20-09-2023-86ed66f78bac\">Sep 20<\/a>)&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A&nbsp;<strong>fully working RAG application&nbsp;<\/strong>based on LamaIndex, including UI, was open-sourced (<a href=\"https:\/\/github.com\/run-llama\/sec-insights\">GitHub<\/a>). A RAG (Retrieval Augmented Generation) application searches external data and uses LLM to generate answers.&nbsp;<\/li>\n\n\n\n<li>Linear adapters allow&nbsp;<strong>tuning embeddings<\/strong>&nbsp;to a particular use case without re-embedding (more details&nbsp;<a href=\"https:\/\/medium.com\/llamaindex-blog\/fine-tuning-a-linear-adapter-for-any-embedding-model-8dd0a142d383\">here<\/a>).&nbsp;<\/li>\n\n\n\n<li><strong>Agents<\/strong>&nbsp;can now be composed&nbsp;<strong>hierarchically,&nbsp;<\/strong>which means you can easily combine agents, each specialized for a particular task. See&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1qIb09SyuLeiwGy_FGcRcQpM78yQ2p0_3?usp=sharing\">this notebook<\/a>&nbsp;for an example.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"trainingdata\">Training Data &amp; Intellectual Property<\/h2>\n\n\n\n<p id=\"ember76\">A growing number of websites are labeling their pages as off-limits for AI crawlers.&nbsp;&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>As of September 22, nearly 26% of the top&nbsp;<a href=\"https:\/\/dataforseo.com\/free-seo-stats\/top-1000-websites\">1,000 websites<\/a>&nbsp;(including Amazon, Quora, Bloomberg, CNN, NYT, and Reuters) were using&nbsp;<strong>robots.txt<\/strong>&nbsp;to block GPTBot,&nbsp;<a href=\"https:\/\/originality.ai\/blog\/study-websites-blocking-gptbot\">according to<\/a>&nbsp;<a href=\"http:\/\/originality.ai\/\">Originality.AI<\/a>, an AI content detection service.&nbsp;<\/li>\n\n\n\n<li>Only 14% blocked Common Crawl Bot. This&nbsp;<strong>does not make much sense<\/strong>&nbsp;because OpenAI is also training on Common Crawl.&nbsp;<\/li>\n\n\n\n<li>Also, blocking only GPTBot means the pages are not included in the training of OpenAI\u2019s models. However, they can still be downloaded for use by&nbsp;<strong>plugins<\/strong>. To prevent that, it is necessary to block ChatGPT-User.<\/li>\n\n\n\n<li>For some reason,&nbsp;<strong>only two sites were blocking Anthropic<\/strong>.&nbsp;<\/li>\n\n\n\n<li>We think that the different approach to crawlers is not intentional. For example, Reuters was not blocking Common Crawl Bot on September 22, but when we inspected it on September 24, it already was.&nbsp;<\/li>\n\n\n\n<li>An&nbsp;<a href=\"https:\/\/www.theguardian.com\/technology\/2023\/sep\/01\/the-guardian-blocks-chatgpt-owner-openai-from-trawling-its-content\">article<\/a>&nbsp;by The Guardian explains why they are blocking GPTBot and that they are open to \u201cmutually beneficial commercial relationships with developers\u201d.&nbsp;<\/li>\n\n\n\n<li>If you do not want your pages to be included in the training of AI bots, read the&nbsp;<a href=\"http:\/\/originality.ai\/\">Originality.AI<\/a>&nbsp;post for instructions on how to set up robots.txt properly. But be aware that there is no common standard, so other players might still crawl it.&nbsp;<\/li>\n<\/ul>\n\n\n\n<p>Please <a href=\"https:\/\/www.linkedin.com\/pulse\/geneeas-ai-spotlight-5-geneea\/?trackingId=k66Q839WStmzujEIZlk4ig%3D%3D\">subscribe<\/a> and stay tuned for the next issue of Geneea\u2019s AI Spotlight newsletter!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The fifth edition of our newsletter on Large Language Models is here.\u00a0<\/p>\n<p>Today, we take a look at\u00a0<\/p>\n<p>\u2022  the race among industry leaders,<br \/>\n\u2022  the challenges of real-world applications,<br \/>\n\u2022  some new findings and framework releases, and<br \/>\n\u2022  how more and more websites are blocking AI data crawlers.<\/p>\n","protected":false},"author":15,"featured_media":1640,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[378,374],"tags":[244,240,242],"class_list":["post-1636","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-large-language-models","category-newsletter","tag-ai","tag-generativeai","tag-newsletter"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Geneea&#039;s AI Spotlight #5 - Geneea News<\/title>\n<meta name=\"description\" content=\"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Geneea&#039;s AI Spotlight #5 - Geneea News\" \/>\n<meta property=\"og:description\" content=\"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5\" \/>\n<meta property=\"og:site_name\" content=\"Geneea News\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-04T11:31:42+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-01-28T08:17:32+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1019\" \/>\n\t<meta property=\"og:image:height\" content=\"573\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Marcela Soukupova\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Marcela Soukupova\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5\"},\"author\":{\"name\":\"Marcela Soukupova\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#\\\/schema\\\/person\\\/69c8751a4c026723f4bac2e892f52cd8\"},\"headline\":\"Geneea&#8217;s AI Spotlight #5\",\"datePublished\":\"2023-10-04T11:31:42+00:00\",\"dateModified\":\"2026-01-28T08:17:32+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5\"},\"wordCount\":1617,\"publisher\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2023\\\/09\\\/newsletter5-robot-1.png\",\"keywords\":[\"AI\",\"generativeAI\",\"newsletter\"],\"articleSection\":[\"Large language models\",\"Newsletter\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5\",\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5\",\"name\":\"Geneea's AI Spotlight #5 - Geneea News\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2023\\\/09\\\/newsletter5-robot-1.png\",\"datePublished\":\"2023-10-04T11:31:42+00:00\",\"dateModified\":\"2026-01-28T08:17:32+00:00\",\"description\":\"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#primaryimage\",\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2023\\\/09\\\/newsletter5-robot-1.png\",\"contentUrl\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2023\\\/09\\\/newsletter5-robot-1.png\",\"width\":1019,\"height\":573},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/geneeas-ai-spotlight-5#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/geneea.com\\\/news\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Geneea&#8217;s AI Spotlight #5\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#website\",\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/\",\"name\":\"Geneea News\",\"description\":\"Learn more about what&#039;s happening at Geneea: new NLP features, newest case studies, tutoring projects, conferences we attended, etc.\",\"publisher\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/geneea.com\\\/news\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#organization\",\"name\":\"Geneea News\",\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/cropped-geneea-logo-50pc.png\",\"contentUrl\":\"https:\\\/\\\/geneea.com\\\/news\\\/wp-content\\\/uploads\\\/2022\\\/02\\\/cropped-geneea-logo-50pc.png\",\"width\":242,\"height\":64,\"caption\":\"Geneea News\"},\"image\":{\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/geneea.com\\\/news\\\/#\\\/schema\\\/person\\\/69c8751a4c026723f4bac2e892f52cd8\",\"name\":\"Marcela Soukupova\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g\",\"caption\":\"Marcela Soukupova\"},\"sameAs\":[\"http:\\\/\\\/Marcela%20Soukupova\"],\"url\":\"https:\\\/\\\/geneea.com\\\/news\\\/author\\\/marcela-soukupova\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Geneea's AI Spotlight #5 - Geneea News","description":"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5","og_locale":"en_US","og_type":"article","og_title":"Geneea's AI Spotlight #5 - Geneea News","og_description":"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.","og_url":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5","og_site_name":"Geneea News","article_published_time":"2023-10-04T11:31:42+00:00","article_modified_time":"2026-01-28T08:17:32+00:00","og_image":[{"width":1019,"height":573,"url":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png","type":"image\/png"}],"author":"Marcela Soukupova","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Marcela Soukupova","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#article","isPartOf":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5"},"author":{"name":"Marcela Soukupova","@id":"https:\/\/geneea.com\/news\/#\/schema\/person\/69c8751a4c026723f4bac2e892f52cd8"},"headline":"Geneea&#8217;s AI Spotlight #5","datePublished":"2023-10-04T11:31:42+00:00","dateModified":"2026-01-28T08:17:32+00:00","mainEntityOfPage":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5"},"wordCount":1617,"publisher":{"@id":"https:\/\/geneea.com\/news\/#organization"},"image":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#primaryimage"},"thumbnailUrl":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png","keywords":["AI","generativeAI","newsletter"],"articleSection":["Large language models","Newsletter"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5","url":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5","name":"Geneea's AI Spotlight #5 - Geneea News","isPartOf":{"@id":"https:\/\/geneea.com\/news\/#website"},"primaryImageOfPage":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#primaryimage"},"image":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#primaryimage"},"thumbnailUrl":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png","datePublished":"2023-10-04T11:31:42+00:00","dateModified":"2026-01-28T08:17:32+00:00","description":"LLM newsletter #5: competition among industry leaders, challenges of application deployment, new articles and platforms, websites blocking data crawlers for AI.","breadcrumb":{"@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#primaryimage","url":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png","contentUrl":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2023\/09\/newsletter5-robot-1.png","width":1019,"height":573},{"@type":"BreadcrumbList","@id":"https:\/\/geneea.com\/news\/geneeas-ai-spotlight-5#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/geneea.com\/news"},{"@type":"ListItem","position":2,"name":"Geneea&#8217;s AI Spotlight #5"}]},{"@type":"WebSite","@id":"https:\/\/geneea.com\/news\/#website","url":"https:\/\/geneea.com\/news\/","name":"Geneea News","description":"Learn more about what&#039;s happening at Geneea: new NLP features, newest case studies, tutoring projects, conferences we attended, etc.","publisher":{"@id":"https:\/\/geneea.com\/news\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/geneea.com\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/geneea.com\/news\/#organization","name":"Geneea News","url":"https:\/\/geneea.com\/news\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/geneea.com\/news\/#\/schema\/logo\/image\/","url":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2022\/02\/cropped-geneea-logo-50pc.png","contentUrl":"https:\/\/geneea.com\/news\/wp-content\/uploads\/2022\/02\/cropped-geneea-logo-50pc.png","width":242,"height":64,"caption":"Geneea News"},"image":{"@id":"https:\/\/geneea.com\/news\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/geneea.com\/news\/#\/schema\/person\/69c8751a4c026723f4bac2e892f52cd8","name":"Marcela Soukupova","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/44f35824640c6a5b31bfef2f478d704874dc3d81bfad511c158ab12274072e16?s=96&d=mm&r=g","caption":"Marcela Soukupova"},"sameAs":["http:\/\/Marcela%20Soukupova"],"url":"https:\/\/geneea.com\/news\/author\/marcela-soukupova"}]}},"_links":{"self":[{"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/posts\/1636","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/users\/15"}],"replies":[{"embeddable":true,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/comments?post=1636"}],"version-history":[{"count":5,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/posts\/1636\/revisions"}],"predecessor-version":[{"id":1771,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/posts\/1636\/revisions\/1771"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/media\/1640"}],"wp:attachment":[{"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/media?parent=1636"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/categories?post=1636"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/geneea.com\/news\/wp-json\/wp\/v2\/tags?post=1636"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}