The Complete Guide to Google Gemini: Every Feature Explained (2026)

Gemini has more built into it than you think. This is the full tour: every feature, what's worth your time, and what you can skip.

Mar 29, 2026

If you’ve recently moved away from ChatGPT, chances are you’ve landed on Claude or Gemini. I already wrote a complete guide on making the most of Claude. Now it's Gemini's turn.

Even though Claude is still my daily driver, there are things I only go to Gemini for because it does them better than any other LLM or because they’re only available in Gemini.

Image generation. Video. Web search and deep research. And brainstorming, where Gemini pulls from a wider range of sources on its own, while Claude works best with what you give it.

And if you've been reading this newsletter for a while, you already know I'm hooked on Google's tools.

I've written about them a lot: an overview of every AI tool Google has built, tools I built in Google AI Studio, how I use NotebookLM for research and writing, how I automated a lot of my work with Google Apps Script, prompts for using Nano Banana, and pieces on using NotebookLM together with Gemini and integrating it with Claude Code.

So this was the missing piece.

A complete guide on Gemini: what every feature does, how to make the most of it, when each one is useful, and some of my favorite ways to combine its tools.

Upgrade to Premium

Here’s what we’re covering:

How to set up Google Gemini (plans, pricing, family sharing, custom instructions, importing from ChatGPT, connected apps)
Every feature inside Gemini’s chat (models, uploads, Deep Research, image generation, video creation, music, Canvas, Guided Learning, Gemini Live, Gems, Scheduled Actions)
5 ways to combine Gemini’s tools (real workflows)
Where Gemini falls short (honest take)

Quick note on pricing

Before we begin, a quick note: I recently updated the Premium subscription for AI Blew My Mind from $8/month to $12/month (and from $70/year to $100/year).

If you’ve been thinking about upgrading, I created a 30% discount code that lets you lock in the old rate forever. It’s valid until April 7 (included). Starting April 8, it’s the new price. If you’re already a paid subscriber, nothing changes. Your rate stays the same and you get all new perks automatically.

Upgrade now to lock in the price

Now let’s get into it.

How to set up Google Gemini

Before we get into what Gemini can do, let's get it set up properly. Access, your plan, getting Gemini up to speed on who you are and what you do, and bringing over your context if you're switching from another AI.

Picking your plan (and the perk nobody talks about)

Gemini has a free tier and three paid plans. (Google recently added Plus as a smaller entry point, similar to what ChatGPT did with its Go plan.)

Now here’s the part that makes Google’s pricing hard to beat:

Any paid plan lets you share it with up to five family members, where everyone gets their own individual access to the full plan. You share the storage, but that’s it.

I don't know any other AI platform that offers this. If you have a partner, kids, or parents who are curious about AI, two minutes of setup and five people are in, for the same price.

Here’s how: go to gemini.google.com → Settings → Manage subscription. That takes you to Google One. Click Manage family settings, toggle on Share Google One with family, click Manage family settings again, and invite family members by email.

Google One Manage membership page showing the Manage family settings section expanded with the Share Google One with family toggle turned on and the Manage button highlighted

How to customize Gemini to know you

Three things to set up before you go any further. All of them help Gemini understand who you are and what you do, so its answers stop being generic and start being relevant to you.

Step 1. Import your ChatGPT or Claude history into Gemini

If you’re switching to Gemini from ChatGPT, Google just launched Import Memory to Gemini, which lets you upload your full chat history from ChatGPT or Claude as a ZIP file.

Gemini Import memory to Gemini page showing two options: a two-step Add Memory flow where you copy a prompt into your other AI provider and paste the response back, and an Import Chats section for uploading a ZIP file up to 5GB with a previous import from February 2026 visible

The catch: that’s everything. Every random question, every throwaway conversation. It’s a lot of noise.

If you want something cleaner and you care about only sharing with Google what's relevant, I built the AI Migrator for exactly this.

AI Migrator by AI Blew My Mind landing page showing the headline Leave ChatGPT Without Losing Everything with Start Migrating and Learn How buttons, and a banner noting 700,000+ users are leaving ChatGPT

Instead of importing everything, it processes your conversation history and extracts a structured profile: who you are, what you're working on, your goals, your communication preferences, your projects, your patterns. Only what helps Gemini be useful to you and your work. You paste it into Gemini's instructions or memory, and it knows you from the first conversation.

Read my step-by-step guide on migrating.

One more thing worth knowing: unlike ChatGPT and Claude, Gemini doesn’t let you export your chat history at the moment. You can import in, but you can’t export out.

Step 2. Set up custom instructions in Gemini

Gemini Settings page showing the Instructions for Gemini section with custom preferences filled in, the Add button highlighted, and the left sidebar showing Connected apps, Scheduled actions, and other settings options

Go to Settings → Instructions for Gemini → click Add.

This is where you tell Gemini who you are, what you do, how you like responses. Think of it like setting preferences once so you don’t have to repeat yourself in every conversation. Tell it your role, your industry, how detailed you want answers, what tone works for you. Gemini will apply this across all your chats.

If you used the AI Migrator, your communication preferences are already extracted and you can paste them directly in here.

Step 3: Connect your apps and turn on Personal Intelligence

Gemini Settings Connected apps page showing Google Workspace toggled on with Gmail, Google Calendar, Google Docs, Google Drive, Google Keep, and Google Tasks available, and YouTube Music toggled on below. This section may appear as Personal Intelligence in some accounts.

Go to Settings → Connected Apps

Here you can give Gemini access to your Google tools. For me right now, that's YouTube Music and Google Workspace and it helps me easily ask Gemini to find an email, check my calendar, or pull up a document.

Some accounts will see a broader version called Personal Intelligence, which also connects to Google Search and Google Photos. That’s where it gets interesting: Gemini can find a bag that matches shoes you bought last week, or troubleshoot your fridge by pulling the model number from a receipt in your email.

I don’t have full access yet. It’s rolling out gradually, starting with the U.S. If you have it, you’ll find it in your settings. Fully opt-in, off by default, and Google says Gemini doesn’t train on your Gmail inbox or Photos library.

Upgrade before April 8

Every feature inside Gemini's chat

You’re set up, Gemini knows who you are, and you’re ready to go. Now let’s go through everything inside the chat, because there’s a lot more in there than what you see when you open it.

To access Gemini, go to gemini.google.com from your browser or download the mobile app.

Annotated Gemini chat interface showing the prompt bar with labeled arrows pointing to Add files, Choose tools, Pick a model, and Dictate buttons, with quick action chips for Create image, Write anything, Help me learn, Create a video, and Boost my day below

Choose the right Gemini model (Flash vs Thinking vs Pro)

When you open Gemini, you can pick which version of the model runs your request. It comes in three versions, broken down by “thinking” styles:

Gemini 3 Flash: The high-speed “workhorse”. It’s 3x faster than previous models and is the default for quick tasks in Search and Gmail.
Gemini 3 Thinking: The specialist for PhD-level logic. It uses “thinking tokens” to show its work, making it the choice for complex math, scientific research, and deep coding.
Gemini 3 Pro: The flagship for complex reasoning.

When you're picking which model to use, the rule is simple: the harder the problem, the more powerful the model you want behind it.

What you can upload and connect in Gemini

Gemini file upload dropdown menu showing five options: Upload files, Add from Drive, Photos, Import code, and NotebookLM

You can bring a lot into a Gemini conversation:

Direct file uploads. PDFs, images, documents, spreadsheets, presentations, even video files. It can watch, timestamp, and summarize footage, which most other models still can't do.
Google Drive integration. Connect your Drive and Gemini can reference your cloud-stored files directly. No downloading and re-uploading.
Google Photos integration. Pull images straight from your photo library.
Code import. Bring in entire scripts or repositories. Gemini’s massive context window means it can analyze complex codebases to find bugs or suggest refactors.
NotebookLM integration. You can now pull your notebooks directly into Gemini as sources. As many as you want. All that research and synthesized thinking you built is now part of your conversation.
@ commands for connected apps. Type @ in the chat and you’ll see the apps you’ve enabled. You can ask Gemini to find an email, summarize a document from Drive, or play a song, all from within the conversation.

Two Gemini chat examples side by side showing @ commands in action: on the left, @Gmail pulling flight details to Rome including flight number, route, and departure time; on the right, @YouTube Music returning a playlist of top 10 80s songs

Gemini's built-in tools: what each one does

Gemini Tools dropdown menu showing six built-in tools: Create image, Canvas, Deep Research, Create video, Create music, and Guided Learning

Google has built more tools directly into Gemini than any other AI platform right now. Here's what each one does and when it's worth using.

Deep Research: how Gemini's AI Research Agent Works

Gemini Deep Research activated with a prompt about the AI Competency Gap, showing the source picker with Search, Gmail, Drive, and Chat all selected as research sources

Whenever you need to go deep on a topic, activate this. Gemini browses the web, reads multiple sources, synthesizes everything, and comes back with a comprehensive, cited report. You can go make coffee while it works.

And it’s no longer limited to the web. You can now have it research across your own sources too: Gmail, Drive, even Google Meet chats. So instead of just “what does the internet say about X”, you can ask “what have we discussed about X in our team emails and documents”.

GIF showing Gemini Deep Research generating a comprehensive sourced report with structured sections, citations, and linked references

Image generation with Nano Banana (Gemini's built-in image editor)

Gemini generating a flat-design infographic about AI Hallucinations using Nano Banana Pro, with the Image tool selected in the prompt bar and the completed infographic showing causes, types, and mitigation strategies

My favorite image generation model right now. You can create images from scratch, edit existing ones with natural language (”remove the text” or “change the lighting”), or keep the same character consistent across different scenes.

I wrote this review for AI Supremacy with dozens of examples when it first came out, and in this article I shared 58 prompts for using Nano Banana 2 to create infographics.

Video Creation with Veo (up to 4K, with audio)

Gemini generating a cinematic video with Veo from a detailed prompt describing a person coding in a cozy rainy apartment, with the Video tool selected in the prompt bar and the finished clip showing moody lighting and plants

Gemini’s native video engine and, in my opinion, the best one available right now. You can generate cinematic, high-definition video clips (up to 4K) directly in the chat. It handles native audio generation too, meaning your videos come with synchronized sound effects or dialogue.

Something worth knowing: Gemini is the only major AI tool that can also analyze videos. You can drop a video into the chat and ask it to extract specific things from it. If you’re a marketer, you could upload a competitor’s ad and ask Gemini to break down the messaging, the visual structure, the pacing. If you’re a content creator, you could upload your own video and ask for feedback on structure and delivery.

Music creation with Lyria 3 Pro

Gemini generating a custom outlaw country birthday song called "Cracked Ceramic" after a user uploads a photo and asks for a birthday roast, showing the built-in music creation tool.

0:00

-2:44

You can create custom music tracks directly inside Gemini. The latest model, Lyria 3 Pro, lets you compose tracks up to 3 minutes long, and paid subscribers can also turn photos or ideas into songs with lyrics.

I don’t find much difference in quality between Google’s music generation and dedicated tools like Suno or Mureka. You have fewer customization options in Gemini, but the quality is solid. And it’s included in your subscription (or free within limits), which means one less tool to pay for separately.

Canvas: Gemini's side-by-side writing and coding workspace

Gemini Guided Learning creating an interactive quiz on AI Hallucinations from a NotebookLM source, showing a multiple-choice question with four options, a progress bar at 1 of 10, and the Canvas tool selected in the prompt bar

Your interactive workspace inside Gemini. It opens a side-by-side editor where you can work on something and refine it with AI, all in one view. Use it for:

Writing. Draft long-form content, highlight sections to rewrite, change tone, ask for edits. Close the chat, come back later, keep going.
Building. Prototype small apps, tools, and games from a single prompt. I tested this when I built 12 games and apps with it, and Gemini in Canvas delivers the most complete result on the first try compared to ChatGPT and Claude.

Guided Learning: turn Gemini into a tutor

Two Gemini Guided Learning screens side by side: on the left, Gemini breaking down AI data centers into four learning areas and asking which to explore first; on the right, a detailed lesson on cooling methods with a Socratic method question highlighted at the bottom asking the user to think through why liquid cooling is more effective

Instead of just giving you the answer, this mode turns Gemini into a tutor.

It asks you questions, walks you through quizzes with hints, and explains what you got wrong. You can create quizzes from documents you upload or just ask Gemini to quiz you on any topic.

Useful for more than studying. Preparing for a client meeting on unfamiliar territory, onboarding into a new industry, or just testing yourself after a Deep Research session.

Gemini Live: Voice conversations, camera, and screen sharing

Gemini mobile app home screen showing quick-start options for image, music, writing, learning, and video creation, with a purple arrow pointing to the Gemini Live voice conversation button and the active Live session interface beside it.

This is Gemini’s voice mode, available on the mobile app in 45+ languages and over 150 countries. Open the app, tap the Gemini Live icon, and start talking.

What makes it more than a voice assistant:

Share your camera. Point your phone at something and ask about it. A plant, a menu in a foreign language, storage ideas for a corner of your apartment, step-by-step help fixing your coffee machine.
Share your screen. Get real-time help with whatever’s on your screen. Picking photos for a post, navigating settings, getting a second opinion on something you’re about to buy.
Brainstorm out loud. Talk through ideas while walking, driving, or doing something else. You can change your mind mid-sentence, interrupt, switch topics.

It also connects to Google Maps, Calendar, Tasks, and Keep, so you can plan, organize, and stay on top of things hands-free.

Upgrade before April 8

What you can do with any Gemini answer (export, fact-check, listen)

Gemini three-dot More menu expanded showing Listen, Export to Docs, Draft in Gmail, and Report legal issue options

When you click the three dots on any Gemini answer (the “More” menu), you get a few options worth knowing about:

Export answers to Docs, Gmail, or Sheets

Everything Gemini creates can go directly to Google Docs, Gmail as a draft, or Google Sheets for tables. No copy-pasting between windows.

Fact-check any Gemini response

This one's underrated (and underused). One click and Gemini goes back and fact-checks itself. It uses web search to verify its own response and highlights what checks out and what doesn't. Any AI model hallucinates, and Google basically built a "trust but verify" button right into the interface.

Two Gemini screens side by side: on the left, the three-dot menu showing Double-check response, Listen, Export to Docs, and Draft in Gmail options; on the right, the verification results with green and orange highlights on claims and a legend explaining what each color means

Listen to Gemini's answers out loud

Don't feel like reading? You can have Gemini read its response out loud.

Click the three dots, click Listen. Good for absorbing longer responses while you're doing something else.

How to automate tasks with Gems and Scheduled Actions

Gems let you save a setup once and reuse it. Scheduled Actions let Gemini run tasks for you on autopilot. Here's how both work.

How to create and use Gems in Gemini

Gems are custom AI tools you build inside Gemini for tasks you do repeatedly.

To find them: expand the menu bar in Gemini and click Gems. That takes you to the Gem manager, where you’ll find pre-made Gems from Google, your past Gems (which you can edit, delete, or share with others), and the option to create a new one.

When you create a new Gem, you set up:

Name and description
Instructions for what the Gem should do (your brand voice, a specific process, formatting rules)
A default tool, the same tools we already covered. For example, I have a Gem for creating thumbnails that uses the Image tool by default, and the one below only for research that uses Deep Research.
Knowledge sources: upload files, add from Drive, or connect NotebookLM notebooks

Gemini Gem editor showing a custom Gem called AI news Deep Research with instructions defining a Chief Intelligence Officer role, a NotebookLM project attached as knowledge source, Deep Research set as the default tool, and a preview of the Gem's chat interface on the right

The difference between Gems and custom instructions: custom instructions apply to every conversation you have with Gemini. Gems apply only when you open that specific Gem. You can have as many as you need for different tasks.

Scheduled Actions: Automate recurring tasks in Gemini

Gemini can do things for you on a schedule, without you being in the chat.

Gemini chat showing three scheduled actions created from natural language: a daily unread email summary at 8am, a weekly Friday flight search from Bucharest to Tokyo with pricing table, and a reminder via Google Tasks to fix a bug in the AI Blew My Mind LAB tomorrow at 9am

Scheduled Actions are recurring prompts that run automatically at a set time. You describe what you want in a normal chat, and Gemini creates the action for you.

The cool part is that these work across all your connected Google tools. Ask for a flight search and it uses Google Flights. Ask for an email summary and it pulls from your Gmail. Ask for a calendar briefing and it reads your schedule.

Reminders work through Google Tasks. Ask Gemini to “set a reminder” and it creates a task with a date and time. Simple, but useful when you’re in the middle of something and don’t want to switch apps.

5 ways to combine Gemini's tools (real workflows)

1. Learn from YouTube videos. Drop a video URL in the chat, ask Gemini to extract the key takeaways and frameworks, then switch to Guided Learning to quiz yourself on the material, use Nano Banana to create an infographic that maps how everything connects, or open Canvas to build an interactive mini app that turns the content into a study tool or a flashcard game.

2. Build an end-to-end campaign. Attach a client strategy brief from Drive, connect a NotebookLM notebook with past campaign examples and top-performing hooks, and ask Gemini to write the marketing copy based on all of it. Then generate video assets with Veo right inside the same chat.

3. Turn a research rabbit hole into something usable. Run Deep Research on a topic, export the report to Docs, then come back to Gemini and ask it to turn the key findings into a client-ready summary, and export that in Gmail as a draft, ready for your polish before sending it out.

4. Monitor something on autopilot. Set up a Scheduled Action to research your industry, your competitors, or a specific topic every week. Gemini delivers a sourced report on a schedule. When something interesting comes up, continue the conversation right there and dig deeper.

5. Reverse-engineer what works in your industry. Drop a competitor's video ad into the chat and ask Gemini to break down the messaging, structure, and hooks. Then use that analysis as input for your own campaign: write the copy in Canvas, generate visuals with Nano Banana, create video with Veo.

Where Gemini falls short (honest take)

No tool review from me without the part where I tell you what doesn’t work well.

The safety filters are too aggressive. This is Gemini’s most frustrating limitation. It sometimes refuses to engage with completely reasonable requests because it flags them as sensitive. I’ve had it refuse to generate an image starting from a photo of me, or block a video for the same reason. You end up rephrasing your prompt three times to get an answer.
Long conversations lose coherence. While the context window is massive, Gemini doesn’t always use it well in extended back-and-forth sessions. It starts recommending things you’ve already tried, forgets constraints you set earlier, or loops back to failed approaches. For quick, focused tasks it’s excellent. For long iterative projects, I’ve found it more frustrating than Claude.
No chat export. This one matters if you care about owning your data. You can import into Gemini, but you can’t export your chat history out of it. ChatGPT and Claude both let you export. Gemini keeps your conversations without giving you easy access to them in a portable format.
Writing quality is a step behind. For pure writing tasks, long-form articles, nuanced copywriting, matching a specific voice, Claude is still better in my experience. It can follow your voice easily. Gemini’s writing tends to be more generic, leaning technical or academic, and needs more steering to get personality into it. For research I think it’s the best, and I enjoy using it for brainstorming, but for writing, it’s not there yet.
Workspace integrations from the chat are still limited. You can reference Gmail and Drive through Gemini, but the agentic capabilities, where it actually takes action for you across apps, are still early.

Your turn

Gemini has grown into something far beyond a chatbot. Between the research tools, the creative suite, the personal intelligence layer, and the scheduled actions, there's a lot to explore. And knowing Google, there's much more coming.

What's the one Gemini feature you're going to try first?

AI blew my mind

Discussion about this post

Ready for more?