• AI Update
  • Posts
  • Amazon TTS, Gemini 1.5 Pro, Sora for Video

Amazon TTS, Gemini 1.5 Pro, Sora for Video

AI News Stories

1. OpenAI Systems Aid Hackers

Microsoft and OpenAI report that hacking groups tied to China, Russia, North Korea, and Iran have used OpenAI systems, but primarily for mundane tasks like drafting emails rather than generating novel cyberattacks. Access was cut off once the use was detected. Read More. Key takeaways:

  • Groups linked to China, Russia, North Korea, Iran

  • Tied to routine tasks like emails rather than new attacks

  • Access revoked once use identified

2. Amazon Massive Text-To-Speech Model

Amazon unveils BASE TTS, the largest text-to-speech model to date at 980 million parameters. Trained on 100,000 hours of speech data, BASE TTS delivers more natural sounding voices and improved articulation of complex sentences. Read More. Key takeaways:

  • New record for largest text-to-speech model at 980 million parameters

  • Trained on 100,000 hours of speech data

  • More natural voices and better sentence articulation

Source: Amazon.com

3. Gemini 1.5 Pro Pushes Boundaries

Google announces Gemini 1.5 Pro, an upgraded version of its Gemini AI model capable of processing significantly more data - up to 1 million tokens of text or 1 hour of video. However, the full capabilities are only available to select developers for now. Read More. Key takeaways:

  • Can process up to 1 million tokens of text or 1 hour of video

  • Main upgrade is larger context window

  • Full version limited to private developer preview for now

4. OpenAI Unveils Striking New AI Video Generator Sora

OpenAI reveals Sora, a new AI model that generates high-quality videos from text prompts up to one minute long. The announcement comes with limited details as OpenAI shares Sora privately to assess safety risks before a potential future public release. Read More. Key takeaways:

  • Sora produces detailed, photorealistic videos from text

  • Videos up to 60 seconds, a length not seen before

  • Model not released publicly yet due to safety concernsAma

Top 5 AI Features on Google Pixel 8 Phone

In this video, Mark examines these features and discusses practical applications. The top 5 AI features are:

  • Magic Photo Eraser

  • Audio Magic Eraser

  • Gemini Assistant

  • Gboard's Proofread

  • Call Screening.

AI Tools

  • Easy-Peasy AI uses GPT-4 and GPT-3.5 to create high-quality content 10x faster across platforms like social media, blogs, marketing, and more

  • An empathetic AI chatbot with advanced features like public URL fetching, file uploads, diverse prompts, and multiple personas

  • Easy-Peasy AI also offers image generation, audio transcription, and text-to-speech to enhance your projects

  • Forefront AI lets you customize AI models using pre-trained large language models tailored to your needs. Forget deprecated models and inconsistent performance

  • Access GPT-4 for free on Forefront AI. Easily explore personas, generate images, fine-tune models, and access GPT-3.5

  • Forefront AI enables easy integration and hosting - export models and run anywhere. Get serverless endpoints for chatbots

  • Jenni AI is a trusted writing assistant that helps you write, cite, and edit academic papers and essays

  • Jenni provides real-time autocomplete and accurately formats in-text citations in all major styles

  • Jenni enhances research with paraphrasing, content generation from files, PDF chatting, and formatted draft exports

  • Reclaim optimizes scheduling for teams to improve productivity, collaboration, and work-life balance with features like priority links, auto 1:1s, and no-meeting days

  • Reclaim efficiently manages meetings by auto-scheduling at best times based on team priorities and syncs with top tools

  • Reclaim prevents burnout and overload with protection from forced overtime, notifications, and back-to-backs while analyzing time usage