Preeti

Understanding Ontologies Through Pizza: A Beginner’s Guide (Part 1)

Preeti / November 1, 2025

Understanding Ontologies Through Pizza: A Beginner’s Guide (Part 1) Why This Matters? Ever wondered how AI systems understand that a Margherita is a type of pizza, not just a random word? Or how recommendation engines know which wines pair with which foods? The answer lies in ontologies—the structured knowledge frameworks that help machines understand our world the way humans do. In this series, we’ll build that understanding from scratch using everyone’s favorite example: pizza 🍕 . What Exactly Is an Ontology? Think of an ontology as a sophisticated family tree—but instead of people, it organizes concepts and their relationships. Formal definition: An ontology is an explicit, formal specification of concepts in a domain and the relationships among them. It’s the difference between a computer storing “Margherita” as random text versus understanding it as a specific type of vegetarian pizza with tomato sauce, mozzarella, and basil. Real-World Impact Ontologies power: Medical systems (SNOMED, UMLS) that help doctors share patient information E-commerce platforms (like Amazon’s product categorization) Semantic Web technologies that make information machine-readable Enterprise knowledge graphs that connect business data The global ontology market is projected to reach $2.1 billion by 2028, driven by AI and semantic search needs. The Pizza Ontology: Your Learning Companion We’re using the Pizza Ontology—originally developed for teaching knowledge representation—because it’s: Universally relatable (everyone knows pizza) Sufficiently complex (enough variations to demonstrate key concepts) Beginner-friendly (no specialized domain knowledge required) By the end of this series, you’ll understand how to model any domain from healthcare to finance. Core Building Blocks: Classes, Properties, and Instances 1. Classes: The Categories Classes represent types or categories of things. They answer: “What kinds of things exist in this domain?” In our Pizza Ontology: Pizza ├── VegetarianPizza ├── MeatPizza └── CheeseTopping ├── MozzarellaTopping └── ParmesanTopping Key insight: Classes form hierarchies using “is-a” relationships. A MargheritaPizza is a VegetarianPizza, which is a Pizza. 2. Properties (Slots): The Relationships Properties describe characteristics and connections. They answer: “What can we say about these things?” Pizza properties include: hasTopping (connects pizza to toppings) hasBase (thin crust, thick crust, etc.) hasCalorieContent (nutritional info) 3. Instances: The Real Things Instances are actual examples of classes: Dominos_Margherita_Large (a specific pizza) MozzarellaTopping (a specific cheese type) Class vs Property: The #1 Beginner Mistake ❌ Common error: Making everything a class Should “spicy” be a class or a property? ✅ Correct approach: SpicyTopping is a class (a type of topping) hasSpiceLevel is a property (describes intensity: mild/medium/hot) Rule of thumb: If something has independent existence and you can list examples, it’s likely a class. If it describes or relates other things, it’s a property. Why Structure Matters: Constraints and Validation Ontologies don’t just organize—they enforce meaning: Example: Disjoint Classes MeatTopping ⊥ VegetarianTopping This means no topping can be both meat and vegetarian simultaneously. Try to create “vegetarian pepperoni” and the system catches the contradiction. Example: Cardinality Constraints A pizza must have exactly 1 base A pizza can have 0 or more toppings A Four Cheese pizza must have at least 4 cheese toppings These rules prevent nonsensical data like “baseless pizza” or mislabeled products. Competency Questions: Your Design North Star Before building any ontology, define what questions it should answer: For Pizza Ontology: Which pizzas are strictly vegetarian? Which pizzas have at least two cheeses? What toppings never appear together? Which pizzas are suitable for lactose-intolerant customers? What’s the average calorie content of meat pizzas vs vegetarian pizzas? Your ontology’s structure flows from these questions—not the other way around. Protégé: Your Ontology Workshop We’ll use Protégé, the industry-standard open-source editor developed at Stanford. Think of it as: A visual class hierarchy builder A constraint validator A reasoning engine tester Why Protégé? Free and widely adopted in academia and industry Works with OWL (Web Ontology Language) standard Has powerful reasoners (HermiT, Pellet) that check consistency The Power of Reasoning: Making Machines Think Here’s where ontologies become magical: You define: VegetarianPizza = Pizza AND hasTopping ONLY VegetarianTopping MargheritaPizza = Pizza AND hasTopping {Mozzarella, Tomato, Basil} The reasoner infers: “Since Margherita only has vegetarian toppings, it must be a VegetarianPizza”—even if you never explicitly said so. This automated classification scales to thousands of concepts, catching errors humans would miss. Preview: What’s Next in This Series Part 2: Hands-on: Building the Pizza Ontology in Protégé Part 3: Advanced Modeling: Restrictions, Properties, and Design Patterns Part 4: From Ontology to Knowledge Graph: Loading into Neo4j Part 5: Querying and Reasoning: Making Your Ontology Work Key Takeaways ✅ Ontologies = Shared vocabulary + Formal structure + Machine-interpretable definitions ✅ Three core elements: Classes (types), Properties (relationships), Instances (examples) ✅ Design with purpose: Start with competency questions, not arbitrary categories ✅ Constraints enable validation: Your ontology should catch mistakes, not just organize data ✅ Reasoning unlocks power: Well-designed ontologies make implicit knowledge explicit Your Turn: Quick Exercise Before moving to Part 2, try this thought experiment: Model a “sandwich ontology”: What are the main classes? (Sandwich, Bread, Filling, etc.) What properties connect them? (hasBread, hasFilling, isVegetarian) Write 3 competency questions your ontology should answer Identify one pair of disjoint classes (Share your answers in the comments—I’ll feature the best designs!) Resources 📄 Ontology Development 101 – Stanford Guide🛠️ Download Protégé🍕 Pizza Ontology on GitHub Next: [Part 2 – Building Your First Pizza Ontology in Protégé →] Found this helpful? Follow for Parts 2-5 where we’ll build, query, and deploy a working knowledge graph! Tags: #Ontology #KnowledgeGraphs #SemanticWeb #AI #MachineLearning #DataScience #OWL #Protégé #KnowledgeRepresentation About This Series This is Part 1 of a 5-part series demystifying ontologies through practical examples. Whether you’re a data scientist, software engineer, or domain expert, you’ll learn to build structured knowledge that machines can reason over.

Blog

Importance of Data Science – Copy

Preeti / July 30, 2024

What is data science? Data science is the study of data to extract meaningful insights for business. It is a multidisciplinary approach that combines principles and practices from the fields of mathematics, statistics, artificial intelligence, and computer engineering to analyze large amounts of data. This analysis helps data scientists to ask and answer questions like what happened, why it happened, what will happen, and what can be done with the results. Why is data science important? Data science is important because it combines tools, methods, and technology to generate meaning from data. Modern organizations are inundated with data; there is a proliferation of devices that can automatically collect and store information. Online systems and payment portals capture more data in the fields of e-commerce, medicine, finance, and every other aspect of human life. We have text, audio, video, and image data available in vast quantities. History of data science While the term data science is not new, the meanings and connotations have changed over time. The word first appeared in the ’60s as an alternative name for statistics. In the late ’90s, computer science professionals formalized the term. A proposed definition for data science saw it as a separate field with three aspects: data design, collection, and analysis. It still took another decade for the term to be used outside of academia. Future of data science Artificial intelligence and machine learning innovations have made data processing faster and more efficient. Industry demand has created an ecosystem of courses, degrees, and job positions within the field of data science. Because of the cross-functional skillset and expertise required, data science shows strong projected growth over the coming decades. What is data science used for? Data science is used to study data in four main ways: 1. Descriptive analysis Descriptive analysis examines data to gain insights into what happened or what is happening in the data environment. It is characterized by data visualizations such as pie charts, bar charts, line graphs, tables, or generated narratives. For example, a flight booking service may record data like the number of tickets booked each day. Descriptive analysis will reveal booking spikes, booking slumps, and high-performing months for this service. 2. Diagnostic analysis Diagnostic analysis is a deep-dive or detailed data examination to understand why something happened. It is characterized by techniques such as drill-down, data discovery, data mining, and correlations. Multiple data operations and transformations may be performed on a given data set to discover unique patterns in each of these techniques.For example, the flight service might drill down on a particularly high-performing month to better understand the booking spike. This may lead to the discovery that many customers visit a particular city to attend a monthly sporting event. 3. Predictive analysis Predictive analysis uses historical data to make accurate forecasts about data patterns that may occur in the future. It is characterized by techniques such as machine learning, forecasting, pattern matching, and predictive modeling. In each of these techniques, computers are trained to reverse engineer causality connections in the data.For example, the flight service team might use data science to predict flight booking patterns for the coming year at the start of each year. The computer program or algorithm may look at past data and predict booking spikes for certain destinations in May. Having anticipated their customer’s future travel requirements, the company could start targeted advertising for those cities from February. 4. Prescriptive analysis Prescriptive analytics takes predictive data to the next level. It not only predicts what is likely to happen but also suggests an optimum response to that outcome. It can analyze the potential implications of different choices and recommend the best course of action. It uses graph analysis, simulation, complex event processing, neural networks, and recommendation engines from machine learning. Back to the flight booking example, prescriptive analysis could look at historical marketing campaigns to maximize the advantage of the upcoming booking spike. A data scientist could project booking outcomes for different levels of marketing spend on various marketing channels. These data forecasts would give the flight booking company greater confidence in their marketing decisions.

Author name: Preeti

Understanding Ontologies Through Pizza: A Beginner’s Guide (Part 1)

Importance of Data Science – Copy