{"id":97455,"date":"2024-06-29T01:13:29","date_gmt":"2024-06-29T01:13:29","guid":{"rendered":"https:\/\/www.techrepublic.com\/?p=4247730"},"modified":"2024-06-29T01:13:29","modified_gmt":"2024-06-29T01:13:29","slug":"google-adds-new-gemini-models-to-vertex-ai","status":"publish","type":"post","link":"https:\/\/cloudnewshub.com\/?p=97455","title":{"rendered":"Google Adds New Gemini Models to Vertex AI"},"content":{"rendered":"<div><img decoding=\"async\" src=\"https:\/\/assets.techrepublic.com\/uploads\/2024\/06\/google-cloud-ai-featured-jun-24.jpg\" class=\"ff-og-image-inserted\"><\/div>\n<p>Google Cloud made a flurry of AI announcements today, with new models available in Vertex AI, upgrades to the Gemini API and new languages in Google Translate enabled by AI. Developers can now take advantage of the 2 million token context window in Gemini 1.5 Pro without needing to be patient on a waitlist. Plus, you can now apply to be one of a limited number of users of Google\u2019s newest image generator, Imagen 3, which can create photorealistic images for marketing or corporate presentations.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Googles_newest_AI_are_open_for_business\"><\/span>Google\u2019s newest AI are open for business<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>New or higher-performance models of several Google AI are in wider availability in the Vertex AI platform today:<\/p>\n<ul>\n<li>Gemini 1.5 Flash, a relatively compact model with a 1 million-token context window, is generally available.<\/li>\n<li>Gemini 1.5 Pro is generally available.<\/li>\n<li>Imagen 3 is in preview. Apply <a href=\"https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSdMHAK_KJygnvV2Psga7FIzKAhAqIBS_bHYzfgf_Y2h7fsoGA\/viewform\" target=\"_blank\" rel=\"noopener noreferrer\">here<\/a>.<\/li>\n<\/ul>\n<p>\u201cGemini 1.5 Flash makes it easier for us to continue our scale-out phase of applying generative AI in high-volume tasks without the trade-offs on quality of the output or context window, even for multimodal use cases,\u201d said JC Escalante, global head of generative AI at market research firm Ipsos, in a <a href=\"https:\/\/cloud.google.com\/blog\/products\/ai-machine-learning\/vertex-ai-offers-enterprise-ready-generative-ai?e=48754805\" target=\"_blank\" rel=\"noopener noreferrer\">Google press release<\/a>.<\/p>\n<p>Vertex AI now offers or will soon offer:<\/p>\n<ul>\n<li>The lightweight Gemini variant <strong><a href=\"https:\/\/www.techrepublic.com\/article\/google-gemma-chat-ai\/\">Gemma 2<\/a><\/strong>: Generally available on Vertex AI next month in two sizes: 9 billion parameters and 27 billion parameters<\/li>\n<li>Anthropic\u2019s Claude 3.5 Sonnet, which is available now.<\/li>\n<li>Context caching, a technique used to create higher speed and lower cost for AI requests using repetitive content, is now in public preview for Gemini 1.5 Pro and Flash.<\/li>\n<li>Provisioned throughput, a feature of Vertex AI for provisioned workloads on Gemini models, is generally available now for users on the allowlist.<\/li>\n<li>Grounding for better accuracy, in which the AI can check its information against Google Search, is available now. Grounding from third parties such as Thomson Reuters is expected to roll out starting next quarter.<\/li>\n<li>Grounding with High Fidelity mode, which combines Gemini 1.5 Flash with company data, is now in experimental preview.<\/li>\n<\/ul>\n<p>Vertex AI is available in <a href=\"https:\/\/cloud.google.com\/vertex-ai\/docs\/general\/locations\" target=\"_blank\" rel=\"noopener noreferrer\">a wide variety of geographical regions<\/a>.<\/p>\n<p><strong>SEE: Here are <a href=\"https:\/\/www.techrepublic.com\/article\/future-of-search-ai-search-engines\/\">five ways to search the web with generative AI<\/a>.<\/strong><\/p>\n<aside class=\"pinbox right\">\n<h3 class=\"heading\">More Google news &amp; tips<\/h3>\n<\/aside>\n<h2><span class=\"ez-toc-section\" id=\"Gemini_API_can_now_run_code_execution_and_more\"><\/span>Gemini API can now run code execution and more<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Code execution is now possible in Gemini 1.5 Pro and Gemini 1.5 Flash, letting developers run Python within the model and experiment with letting the generative AI iterate and learn from the code. It can be accessed through the Gemini API or Google AI Studio.<\/p>\n<p>In addition, users of the <a href=\"https:\/\/developers.googleblog.com\/en\/new-features-for-the-gemini-api-and-google-ai-studio\/\" target=\"_blank\" rel=\"noopener noreferrer\">Gemini API<\/a> can now:<\/p>\n<ul>\n<li>Use the full 2 million token window on Gemini 1.5 Pro.<\/li>\n<li>Use context caching for both Gemini 1.5 Pro and 1.5 Flash.<\/li>\n<li>Experiment with Gemma 2 in Google AI Studio.<\/li>\n<\/ul>\n<h2><span class=\"ez-toc-section\" id=\"Cantonese_and_109_other_languages_added_to_Google_Translate\"><\/span>Cantonese and 109 other languages added to Google Translate<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Google has used the PaLM 2 language model to <a href=\"https:\/\/support.google.com\/translate\/answer\/15139004?visit_id=638551096349597865-4018359467&amp;p=TranslateNewLanguages2024&amp;rd=1\" target=\"_blank\" rel=\"noopener noreferrer\">add 110 languages to the public Google Translate service<\/a>; this is its largest-ever expansion of this service. A highlight is Cantonese, a language that Google has found difficult to find data in order to add it to Translate in the past because it \u201coften overlaps with Mandarin in writing.\u201d<\/p>\n<p>PaLM 2 has enabled Google to more efficiently add more languages that are similar to each other, Google Senior Software Engineer Isaac Caswell said in a <a href=\"https:\/\/blog.google\/products\/translate\/google-translate-new-languages-2024\/\" target=\"_blank\" rel=\"noopener noreferrer\">press release about this Google Translate expansion<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google Cloud made a flurry of AI announcements today, with new models available in Vertex AI, upgrades to the Gemini API and new languages in Google Translate enabled by AI. Developers can now take advantage of the 2 million token context window in Gemini 1.5 Pro without needing to be patient on a waitlist. Plus, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[],"tags":[],"class_list":["post-97455","post","type-post","status-publish","format-standard","hentry"],"_links":{"self":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts\/97455","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=97455"}],"version-history":[{"count":0,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=\/wp\/v2\/posts\/97455\/revisions"}],"wp:attachment":[{"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=97455"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=97455"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cloudnewshub.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=97455"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}