On the upcoming WWDC, Apple is predicted to announce an on-device massive language mannequin (LLM). The following model of the iOS SDK will possible make it simpler for builders to combine AI options into their apps. Whereas we await Apple’s debut of its personal Generative AI fashions, firms like OpenAI and Google already present SDKs for iOS builders to include AI options into cell apps. On this tutorial, we are going to discover Google Gemini, previously referred to as Bard, and show how one can use its API to construct a easy SwiftUI app.
We’re set to construct a Q&A app that makes use of the Gemini API. The app encompasses a simple UI with a textual content discipline for customers to enter their questions. Behind the scenes, we are going to ship the consumer’s query to Google Gemini to retrieve the reply.
Please observe that it’s a must to use Xcode 15 (or up) to observe this tutorial.
Getting Began with Google Gemini APIs
Assuming that you simply haven’t labored with Gemini, the very very first thing is to go as much as get an API key for utilizing the Gemini APIs. To create one, you possibly can go as much as Google AI Studio and click on the Create API key button.
Utilizing Gemini APIs in Swift Apps
It is best to now have created the API key. We’ll use this in our Xcode venture. Open Xcode and create a brand new SwiftUI venture, which I’ll name GeminiDemo. To retailer the API key, create a property file named GeneratedAI-Data.plist
. On this file, create a key named API_KEY
and enter your API key as the worth.
To learn the API key from the property file, create one other Swift file named APIKey.swift
. Add the next code to this file:
guard let filePath = Bundle.major.path(forResource: “GenerativeAI-Data”, ofType: “plist”)
else {
fatalError(“Could not discover file ‘GenerativeAI-Data.plist’.”)
}
let plist = NSDictionary(contentsOfFile: filePath)
guard let worth = plist?.object(forKey: “API_KEY”) as? String else {
fatalError(“Could not discover key ‘API_KEY’ in ‘GenerativeAI-Data.plist’.”)
}
if worth.begins(with: “_”) {
fatalError(
“Comply with the directions at https://ai.google.dev/tutorials/setup to get an API key.”
)
}
return worth
}
}
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
enum APIKey { // Fetch the API key from `GenerativeAI-Data.plist` static var `default`: String {
guard let filePath = Bundle.major.path(forResource: “GenerativeAI-Data”, ofType: “plist”) else { fatalError(“Could not discover file ‘GenerativeAI-Data.plist’.”) }
let plist = NSDictionary(contentsOfFile: filePath)
guard let worth = plist?.object(forKey: “API_KEY”) as? String else { fatalError(“Could not discover key ‘API_KEY’ in ‘GenerativeAI-Data.plist’.”) }
if worth.begins(with: “_”) { fatalError( “Comply with the directions at https://ai.google.dev/tutorials/setup to get an API key.” ) }
return worth } } |
Should you resolve to make use of a unique identify for the property file as a substitute of the unique ‘GenerativeAI-Data.plist’, you will want to change the code in your ‘APIKey.swift’ file. This modification is important as a result of the code references the particular filename when fetching the API key. So, any change within the property file identify must be mirrored within the code to make sure the profitable retrieval of the API key.
Including the SDK Utilizing Swift Bundle
The Google Gemini SDK is well accessible as a Swift Bundle, making it easy so as to add to your Xcode venture. To do that, right-click the venture folder within the venture navigator and choose Add Bundle Dependencies. Within the dialog, enter the next package deal URL:
https://github.com/google/generative-ai-swift |
You may then click on on the Add Bundle button to obtain and incorporate the GoogleGenerativeAI
package deal into the venture.
Constructing the App UI
Let’s begin with the UI. It’s simple, with solely a textual content discipline for consumer enter and a label to show responses from Google Gemini.
Open ContentView.swift
and declare the next properties:
@State non-public var isThinking = false
@State non-public var textInput = “” @State non-public var response: LocalizedStringKey = “Hey! How can I aid you at this time?”
@State non-public var isThinking = false |
The textInput
variable is used to seize consumer enter from the textual content discipline. The response
variable shows the API’s returned response. Given the API’s response time, we embrace an isThinking
variable to watch the standing and present animated results.
For the physique
variable, substitute it with the next code to create the consumer interface:
ScrollView {
VStack {
Textual content(response)
.font(.system(.title, design: .rounded, weight: .medium))
.opacity(isThinking ? 0.2 : 1.0)
}
}
.contentMargins(.horizontal, 15, for: .scrollContent)
Spacer()
HStack {
TextField(“Kind your message right here”, textual content: $textInput)
.textFieldStyle(.plain)
.padding()
.background(Shade(.systemGray6))
.clipShape(RoundedRectangle(cornerRadius: 20))
}
.padding(.horizontal)
}
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
VStack(alignment: .main) {
ScrollView { VStack { Textual content(response) .font(.system(.title, design: .rounded, weight: .medium)) .opacity(isThinking ? 0.2 : 1.0) } } .contentMargins(.horizontal, 15, for: .scrollContent)
Spacer()
HStack {
TextField(“Kind your message right here”, textual content: $textInput) .textFieldStyle(.plain) .padding() .background(Shade(.systemGray6)) .clipShape(RoundedRectangle(cornerRadius: 20))
} .padding(.horizontal) } |
The code is sort of simple, particularly when you have some expertise with SwiftUI. After making the modifications, you must see the next consumer interface within the preview.
Integrating with Google Gemini
Earlier than you need to use the Google Gemini APIs, you first have to import the GoogleGenerativeAI
module:
import GoogleGenerativeAI |
Subsequent, declare a mannequin
variable and initialize the Generative mannequin like this:
let mannequin = GenerativeModel(identify: “gemini-pro”, apiKey: APIKey.default) |
Right here, we make the most of the gemini-pro
mannequin, which is particularly designed to generate textual content from textual content enter.
To ship the textual content to Google Gemini, let’s create a brand new perform known as sendMessage()
:
withAnimation(.easeInOut(length: 0.6).repeatForever(autoreverses: true)) {
isThinking.toggle()
}
Job {
do {
let generatedResponse = attempt await mannequin.generateContent(textInput)
guard let textual content = generatedResponse.textual content else {
textInput = “Sorry, Gemini bought some issues.nPlease attempt once more later.”
return
}
textInput = “”
response = LocalizedStringKey(textual content)
isThinking.toggle()
} catch {
response = “One thing went unsuitable!n(error.localizedDescription)”
}
}
}
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
func sendMessage() { response = “Pondering…”
withAnimation(.easeInOut(length: 0.6).repeatForever(autoreverses: true)) { isThinking.toggle() }
Job { do { let generatedResponse = attempt await mannequin.generateContent(textInput)
guard let textual content = generatedResponse.textual content else { textInput = “Sorry, Gemini bought some issues.nPlease attempt once more later.” return }
textInput = “” response = LocalizedStringKey(textual content)
isThinking.toggle() } catch { response = “One thing went unsuitable!n(error.localizedDescription)“ } } } |
As you possibly can see from the code above, you solely have to name the generateContent technique of the mannequin to enter textual content and obtain the generated response. The result’s in Markdown format, so we use LocalizedStringKey
to wrap the returned textual content.
To name the sendMessage()
perform, replace the TextField
view and fix the onSubmit
modifier to it:
TextField(“Kind your message right here”, textual content: $textInput) .textFieldStyle(.plain) .padding() .background(Shade(.systemGray6)) .clipShape(RoundedRectangle(cornerRadius: 20)) .onSubmit { sendMessage() } |
On this state of affairs, when the consumer finishes inputting the textual content and presses the return key, the sendMessage()
perform is known as to submit the textual content to Google Gemini.
That’s it! Now you can run the app in a simulator or execute it straight within the preview to check the AI function.
Abstract
This tutorial reveals how one can combine Google Gemini AI right into a SwiftUI app. It solely requires just a few traces of code to allow your app with Generative AI options. On this demo, we use the gemini-pro
mannequin to generate textual content from text-only enter.
Nonetheless, the capabilities of Gemini AI usually are not simply restricted to text-based enter. Gemini additionally gives a multimodal mannequin named gemini-pro-vision
that permits builders to enter each textual content and pictures. We encourage you to take full benefit of this tutorial by modifying the offered code and experimenting with it.
If in case you have any questions concerning the tutorial, please let me know by leaving a remark under.