This is a post specifically for tonight attendees at the AI Superpowers Unlocked panel taking place tonight (Wednesday, May 15, 2024). I’ll expand this post later.
I decided to try out ChatGPT’s newly-released GPT-4o model (the lowercase “o” stands for “omni”) by uploading images and asking it to describe them to me.
After this quick test, my impression is simply “wow.”
Image 1: Me and Steven Bosco playing together at the Anaconda party at PyCon US 2022
Me: What’s happening in this picture?
GPT-4o: The picture shows two musicians performing on stage. One musician is playing an accordion, and the other is playing an acoustic guitar and singing into a microphone. The guitarist is wearing a cowboy hat. In the background, there is a banner with the name “Steven Bosco” and a QR code for tips via Venmo and CashApp. The setting appears to be an indoor venue with a rustic, cozy ambiance. There is a small audience in the foreground, and the atmosphere looks lively and engaging.
Not bad. I was impressed that ChatGPT was able to determine that the banner read “Steven Bosco” even though my head clearly blocks the “B” — it would have had to determine the “Bosco” from the tip sign. I was also impressed by its description of the “feel” of the room.
Image 2: A comic I drew as a storyboard for an app about authentication and authorization
Me: What is being depicted in this image?
GPT-4o: The image is a two-panel cartoon.
Panel 1:
The setting is a ship’s engine room filled with various controls and equipment.
There is a large machine, a computer, and a control panel with levers.
The text reads: “A ship’s engine room is full of important controls! If you don’t manage access to these controls, things can go very wrong, very quickly!”
Panel 2:
The scene shows a cat wearing a chef’s hat and holding a spoon, standing outside a door labeled “ENGINE ROOM.”
Next to the door is a sign that reads: “CHEF ACCESS DENIED.”
The cat is saying, “But I wanted to play with the big red button!”
The text reads: “For safety’s sake, the ship’s fine-grained authorization allows only engineers and the command crew inside the engine room.”
The cartoon humorously illustrates the importance of restricting access to critical areas on a ship, such as the engine room, to prevent unauthorized personnel from tampering with important controls.
In case you were wondering, I wrote about the comic in this article.
I wanted to see how well GPT-4o does with handwriting. I’m told that my penmanship is unusually readable; results with other people’s handwriting are bound to vary.
I was impressed by how it identified the image not just as a comic, but one with two panels. Even more impressive: that ChatGPT got the general gist of the comic, right down to its use of humor to illustrate its point.
Image 3: My notes from my computer networking class at Crazy Go Nuts University
Me: What’s this image about?
GPT-4o: The image is a detailed handwritten study guide about the OSI (Open Systems Interconnection) model, often referred to as the OSI seven-layer model or OSI seven-layer cake. The OSI model is used to standardize networking functions to support interoperability among different products and software.
The guide includes the following:
Left Side:
An explanation of OSI layers and their function.
“Provide service to the layer higher.”
“Relies on next lower layer to perform.”
A diagram showing the communication between two systems, with peer protocols between corresponding layers.
A reminder: “Peer layers must provide the same functions, but they need not provide them in the same way.”
Right Side:
A detailed description of each of the seven OSI layers:
Physical Layer: Deals with the physical connection between devices, transmission of raw bit streams over a physical medium.
Data Link Layer: Ensures error-free transmission between two directly connected nodes, managing frame synchronization, error control, and flow control.
Network Layer: Manages data routing, forwarding, addressing, and packet switching.
Transport Layer: Provides reliable data transfer services to the upper layers, including flow control, error correction, and segmentation.
Session Layer: Manages sessions between applications, providing services like establishment, maintenance, and termination of connections.
Presentation Layer: Translates data between the application layer and the network, handling data encryption, decryption, and compression.
Application Layer: Provides network services directly to end-user applications, including email, file transfer, and network management.
A final note: “Remember: Standards must define the functions & services to be provided by a layer but say nothing about how that’s to be done.”
This guide is likely intended for students or professionals studying networking concepts, providing a clear and concise overview of the OSI model and its layers.
The challenge with this one was the low resolution of the photo. I took it with my late ’90s-era camera with its potato-fidelity 1024 * 768 resolution (and that was its “high-quality” mode — normal mode was 640 * 480!).
Image 4: Another comic I drew as a storyboard for that app about authentication and authorization
Me: Describe this image, please.
GPT-4o: This image is a four-panel cartoon strip that humorously depicts a situation involving an alien, a disguise kit, and a spaceship. Here’s a description of each panel:
Panel 1:
A character with antennae and a space uniform is holding a box labeled “PLATYPUS DISGUISE KIT.”
The character is thinking, “Let’s see if this works!”
Panel 2:
The same character, now disguised with a platypus head, is returning to a spaceship.
The character says, “It is I, Captain Platypus, returning to the ship!”
Another character on the ship welcomes him, saying, “Welcome back, Captain! Come aboard.”
In the background, someone is seen thinking, “Aw, yeah—free ship!”
Panel 3:
The disguised character is now onboard the ship.
The text reads: “Once he boards the ship…”
The disguised character says, “Nice ship. I think I’ll take it.”
Another character on the ship is startled and says, “eep!”
Panel 4:
The real captain, presumably Captain Platypus, returns and is shocked by the situation.
He exclaims, “I was gone only five minutes… WHAT HAPPENED?”
Another character, who appears to be tied up, responds, “I think we need an anomaly detection system.”
The cartoon humorously highlights the importance of security and the potential consequences of not having proper identification and detection systems in place on a spaceship.
One of the challenges with this image is that it’s a four-panel comic, but there are no lines separating the panels — just white space. Following only the circled numbers specifying the order in which the comic should be read, GPT-4o “got” the comic, including the fact that the character in panel 1 was impersonating Captain Platypus.
GPT-4o got one thing wrong — it thought that the “Aw, yeah — free ship!” line is from panel 2; it’s the alien stealing the ship in panel 4.
In addition to updating the text of the article, I also created some explainer graphics to liven it up and save the reader from being hit with just a wall of text. Those graphics are what you see in this article — enjoy!
If you were to time travel and visit Crazy Go Nuts University during my student days, you’d find that the thing I was known for wasn’t programming or playing the accordion, but drawing comics.
The web came around at the very end of my long and colorful academic career, so my comics mostly appeared in student newspapers — primarily Golden Words, a satire newspaper in the same vein as the original print version of The Onion, as well as the main student newspaper, The Queen’s Journal.
I make the occasional comic every now and again these days, and when Dan Arias, a former coworker at Auth0, found out about it, he asked me to draw some comics as a way to “storyboard” some screens for an app for the 2023 Oktane conference.
The comics were supposed to showcase some features of Auth0’s customer identity management system, and if possible, do so in a humorous way. They also had to use some animal mascots that had been created for the project: a platypus, a rabbit, a capybara, and a boar.
I recently found the sketchbook with the comics I made for the app. They never went into the app — they were just storyboards for the app’s artist, Sofía Prósper Díaz-Mor, to use as guides, and the final versions that appeared in the app looked fantastic.
Still, there’s a rough charm to my doodles, so I thought I’d post them here. Perhaps it’s time for me to make more posts as comics…
Fine-grained authorization and the big red button
The app had a space theme, so all the comics featured our animal characters — once again, a platypus, a rabbit, a capybara, and a boar — as characters having science fiction adventures that also featured some aspect of digital identity.
This comic was about fine-grained authorization, which is a fancy way of saying “very specific control over who’s allowed to do what in a system”…
Authentication needs anomaly detection
This comic was the storyboard for a story about anomaly detection, which attempts to detect logins that have a suspicious quality to them. I did this by having an alien disguise themself as the ship’s commanding officer, Captain Platypus, and board the ship…
Single sign-on and the planet of a thousand apps
“The planet of a thousand apps” was the setting for this comic about single sign-on. The idea was every activity on the planet was controlled by its own app, which meant that you either had to log into a different app to do anything, or you could use single sign-on…
The power of the passkey
To illustrate the security advantages of passkeys, I came up with this comic. It shows that with a passkey, you don’t have to memorize a password, and even if a hacker manages to break into the server, all it has is the passkey’s public key, which (as its name implies) is known to everyone…
Decentralized identity: A new hope
“Make Star Wars without getting us into legal trouble,” they said, and this is the resulting comic. It features our rabbit character as “Bun Solo” and our capybara as “Capybacca.” In this rough sketch comic, they destroy the centralized identity database, the Data Star, freeing the citizens of the galaxy to use decentralized identities. In the second page, I show the uses for them…
More to come…
Watch this space — I think it’s time to do more comic-style blog posts here on Global Nerdy!
Here’s the “official unofficial” list of tech, entrepreneur, and nerd events for Tampa Bay and surrounding areas for Monday, May 13 through Sunday, May 19, 2024.
How do I put this list together? It’s largely automated. I have a collection of Python scripts in a Jupyter Notebook that scrapes Meetup and Eventbrite for events in categories that I consider to be “tech,” “entrepreneur,” and “nerd.” The result is a checklist that I review. I make judgment calls and uncheck any items that I don’t think fit on this list.
In addition to events that my scripts find, I also manually add events when their organizers contact me with their details.
What goes into this list? I prefer to cast a wide net, so the list includes events that would be of interest to techies, nerds, and entrepreneurs. It includes (but isn’t limited to) events that fall under any of these categories:
Programming, DevOps, systems administration, and testing
Tech project management / agile processes
Video, board, and role-playing games
Book, philosophy, and discussion clubs
Tech, business, and entrepreneur networking events
Toastmasters and other events related to improving your presentation and public speaking skills, because nerds really need to up their presentation game
Sci-fi, fantasy, and other genre fandoms
Self-improvement, especially of the sort that appeals to techies
I’m being asked by a growing number of people this one question: “I want to get started learning and building AI apps with Python. Should I go with the Python.org distribution, Anaconda, or something else?”
For anyone who wants to learn Python and AI development, my answer is invariably:
Their one installer gets you a LOT of stuff: Not just Python, but also the R programming language, the pip and conda package managers, Jupyter Notebook and Jupyter Lab, and a ridiculous number of installed and optimized Python packages ready to go.
Anaconda is made by people into scientific computing, data science, and machine learning, for people into scientific computing, data science, and machine learning. You have to do way let setup after installing it to get started on your AI journey.
Anaconda is proven. Nearly 40 million people use it, and so do 90% of the Fortune 500.
There’s also Anaconda Cloud, which features an online notebook service (an online interactive coding environment that also makes it easy to share code), articles and online courses, and a big user community.
Anaconda delivers the same great Python programming experience on Linux, macOS, and Windows. If you move between operating systems like I do, or teach Python courses, this is incredibly handy.
Once again — if you’re learning Python for AI development:
ℹ️ Disclaimer: This is NOT a paid promo for Anaconda. I met some of their people at PyCon, but the company has no idea who I am or that I’m doing this, and they certainly don’t know how to put money in my bank account. I’m just a satisfied user who uses it for my own projects and for teaching Python and AI programming.