AI
- 2023年10月23日
- AI
Google is focusing at Duolingo with new English tutoring tool
Google is making a significant move in the language learning space with a new feature in Google Search that aims to enhance users’ English speaking skills. Initially, this feature is rolling out to Android users in Argentina, Colombia, India, Indonesia, Mexico, and Venezuela, with plans to expand to more countries and languages in the future. This new tool offers interactive speaking practice and personalized feedback for learners translating to or from English, making it a valuable addition to Google’s language learning resources.
The personalized nature of this feature is a standout aspect. Google’s approach includes providing semantic feedback to assess the relevance and comprehensibility of a learner’s response to a given question. Additionally, it identifies areas where grammar improvements can be made and offers example answers at different language complexity levels. During practice sessions, users can also access contextual translations for any words they don’t understand, creating a holistic learning experience.

To develop this feature, Google invested heavily in AI and machine learning. The Google Translate team created the Deep Aligner model for suggesting translations, and other research groups adapted grammar correction models for speech transcriptions, especially for users with accented speech. Google Research teams designed models for semantic feedback and sentence complexity estimation. To ensure a well-rounded language learning experience, Google collaborated with linguists, teachers, and ESL/EFL pedagogical experts, who contributed a mix of human-expert content, AI-assisted content, and in-house human-reviewed material.
While Google’s precise intentions with this feature remain unclear, it has the potential to boost user engagement. Although the blog post doesn’t explicitly indicate that Google is targeting established language learning apps like Duolingo, it is an intriguing move in a field with substantial profit potential. Google has previously ventured into language learning and education tools, and the success and direction of these efforts may depend on user adoption and popularity.
You can also check out the blog from Google here.
Yuuma
yuuma at 2023年10月23日 10:00:00
- 2023年10月16日
- AI
Google is releasing generative AI capabilities
Google is introducing generative AI capabilities to its popular Google Photos app through the release of the Pixel 8 and Pixel 8 Pro smartphones. Initially revealed at Google’s I/O developer conference in May, this feature enables more advanced photo edits, such as filling in gaps, repositioning subjects, and adjusting the foreground or background of images. Previously, achieving these effects required external tools like Google’s Magic Eraser or professional software like Photoshop, involving more manual effort.

Using generative AI, Google Photos allows users to perform complex edits like resizing or repositioning subjects with ease. Users can tap on the object they wish to edit, drag it to move or pinch to resize, and make contextual adjustments to lighting and background. Magic Editor also offers multiple output options for user preference. However, Google acknowledges that the feature is in its early stages and may not always produce the desired results but hopes to improve it with user feedback and technological advancements.
The real-world testing of Magic Editor is imminent, and Google Photos users, especially those on the newer Pixel devices, will have the opportunity to try it out. With 1.7 billion photo edits made by Google Photos users monthly, the potential for learning and improvement is significant. This feature is part of a broader set of AI-powered photo-editing tools for the Pixel 8 and 8 Pro, including Best Take, Zoom Enhance, and enhancements to Magic Eraser, which will be available on Pixel 8 devices starting October 12.
Yuuma
yuuma at 2023年10月16日 10:00:00
- 2023年10月04日
- AI
Bing Chat に DALL-E 3 が搭載
Edgeブラウザ上などで利用できるBing ChatにDALL-E 3が搭載されたようです。
もともDALL-E 2を利用した画像生成は可能でした。
DALL-E 3に変わったことでプロンプトを細かく設定することができ、
より写実的な画像を出力可能になったそうです。
Bing Chatの入力欄に「〜の絵を描いてください」と入力することで
イラストを出力してくれました。
出力された画像の例を見る限りではありますが、
画像生成AIでよく発生する指の破綻が少ないように思えます。
水曜担当:Tanaka
tanaka at 2023年10月04日 10:00:00
Free Alternatives to GPT-4
Today, I would like to share about 3 free alternatives to GPT-4. Let’s take a look.
LlaMA 2
LlaMA 2 is a cutting-edge open-source large language model developed by Meta AI. It’s available for commercial use and comes with pre-trained models, fine-tuned models, and code resources. You can find all these assets on HuggingFace. You can also get a feel for how well the model performs by trying it out on HuggingChat. By making LlaMA 2 openly accessible, Meta AI is empowering researchers and developers to create innovative applications that leverage advanced language capabilities.
PaLM 2
Google AI’s PaLM 2 is Google’s latest large language model, excelling in advanced reasoning tasks like coding, mathematics, classification, question answering, translation, multilingual proficiency, and natural language generation. It surpasses previous state-of-the-art large language models like the original PaLM due to its optimized compute-scaling approach, improved dataset mixture, and architectural enhancements. You can access PaLM 2 for free using Bard. While there’s still room for improvement compared to GPT-4 in terms of quality and performance, it offers impressive capabilities.
Claude 2
Claude 2 represents the latest version of Anthropic’s conversational AI assistant. It offers improved performance, longer responses, and can be accessed through an API as well as a new public beta website, claude.ai. Developers at Anthropic have focused on enhancing its capabilities in areas such as coding, mathematics, and logical reasoning compared to earlier Claude versions. For instance, Claude 2 recently achieved a score of 76.5% on the multiple-choice section of the Bar exam, a significant improvement from Claude 1.3’s 73.0%. You can access various Claude models on Poe and experience their performance firsthand.
Conclusion
Despite GPT-4 being unavailable to the public, there are other promising open-source large language models emerging as alternatives that are accessible to everyone. While these models may not be as massive as GPT-4, they show that cutting-edge language AI is evolving rapidly and becoming more widely available. These freely accessible models excel in fields such as mathematics, coding, and logical reasoning, making them suitable replacements for various applications.
Asahi
waithaw at 2023年10月03日 10:00:00
- 2023年10月02日
- AI
Amazon launches a generative AI service called Bedrock
Amazon has officially launched Bedrock, a generative AI service that provides access to various AI models, both from Amazon and third-party partners, via an API.
Bedrock empowers AWS customers to build applications using these generative AI models and customize them with their own data. Brands and developers can create AI agents for tasks like travel booking, inventory management, and insurance claims processing. In the near future, Bedrock will incorporate Llama 2, an open-source large language model from Meta, along with models from AI21 Labs, Anthropic, Cohere, and Stability AI.

Amazon claims that Bedrock will be the first fully managed generative AI service to offer Llama 2, including its 13-billion- and 70-billion-parameter versions. While similar to Google’s Vertex AI, Amazon argues that Bedrock’s advantage lies in its seamless integration with existing AWS services, like AWS PrivateLink, ensuring secure connections to a company’s virtual private cloud. This perceived advantage may vary depending on the specific customer and their cloud infrastructure.
Swami Sivasubramanian, VP of data and AI at AWS, highlighted the increasing interest in generative AI, driven by data availability, scalable computing, and machine learning advancements. He emphasized that Bedrock’s launch democratizes generative AI, making it accessible to businesses of all sizes and their employees, from developers to data analysts.
Additionally, Amazon introduced its Titan Embeddings model, a first-party model that converts text into numerical representations known as embeddings, supporting around 25 languages and text segments of up to 8,192 tokens, aligning with OpenAI’s latest embeddings model. These announcements reflect Amazon’s commitment to the rapidly growing generative AI market following its initial challenges with Bedrock’s availability.
Yuuma
yuuma at 2023年10月02日 10:00:00