Skip to main content

Raster vs. Vector

Vector is Corrector.

Object data model:

  • generalization
  • represent real things as locations and attributes
  • objects = conceptualizations of real items
  • concrete or abstract
{
"type": "Feature",
"geometry": {
"type": "Point",
"coordinates": [77.230004, 28.6947]
},
"properties": {
"name": "Majnu ka Tilla",
"type": "food joints",
"rating": 4.5,
"state": "Delhi",
"country": "India"
}
}
Restaurant NameCoordinatesFood TypeCost (1-5)HoursCapacityRating
Ama Restaurant[77.2305, 28.6948]Tibetan, Chinese211AM-10PM404.3
Dolma House[77.2307, 28.6945]Tibetan210AM-9PM254.4
AMA Cafe[77.2306, 28.6947]Cafe, Breakfast38AM-9PM304.6
Rigo Restaurant[77.2304, 28.6946]Korean, Japanese312PM-10PM354.2
Busan Korean[77.2305, 28.6949]Korean412PM-11PM454.5

Rester is faster, but vector is corrector.

Vector caclculations:

  • re-projection is precise
  • reliably stores:
  • point overvations
  • edges
  • boundaries
  • attributes stay reliable if joined properly

Raster is Faster.

Raster data model:

raster data model

Raster alignment:

  • location, grid of values
  • raster become difficult when comparing locations (10m cell size vs 15m cell size)
  • need to decide which cells should overlap

Raster re-projection:

  • introduces errors
  • lossy data
  • less precise than vector

Multiband raster:

  • rasters can be stacked to work with multiple rasters as if they're one.
  • each "band" is a single raster.
  • ex: represent data from different sensors captured at the same time.

multi-band rasters

multi-band raster in ArcGIS

Geodatabase Data Types

Summary Table of Concepts in Data Types and Management

ConceptDescriptionExamples/DetailsKey Considerations
Data Types (General)Categories defining how data is stored, grouped, and processed. Determine valid operations and storage efficiency.Numbers, text strings, dates/times.Crucial for operations (e.g., math on numbers, not text). Affects tools, queries, and programming.
Binary InterpretationData is stored as binary (0s/1s). Interpretation depends on data type.Binary 1010100 = 84 (number) or T (text).Incorrect data type leads to misinterpretation (e.g., 84 vs. T).
Integer TypesWhole numbers (no decimals). Subtypes vary by storage size and range.Short Integer: -32,768 to 32,767 (16 bits). Long Integer: ±2 billion (32 bits).Use short integers for small ranges (e.g., 0-90° slope) to save space.
Decimal TypesReal numbers with decimal precision. Floating-point numbers allow variable decimal placement.Float (Single): ±10³⁸ (32 bits). Double: ±10³⁰⁸ (64 bits).Use doubles for extreme precision/size. More storage and computational cost than integers.
Text StringsStores text or numbers treated as text (non-mathematical).ZIP codes, categories (e.g., "A1", "100-MainSt").ArcGIS file geodatabases lack length limits. Other systems may require length specs.
Null ValuesRepresents undefined/unknown data (distinct from zero).Unpopulated fields, missing measurements.Math with nulls returns nulls (e.g., 5 + null = null). Requires data cleanup.
Field NamingRules for naming fields to ensure clarity and compatibility.Use camelCase or underscores; avoid spaces (e.g., SlopeDegrees or slope_deg).Consistency and descriptiveness aid usability. Spaces may break database commands.

Key Takeaways:

  • Storage Efficiency: Choose data types based on required range/precision (e.g., short vs. long integers).
  • Operations: Data types enable/disallow operations (e.g., math on numbers, text concatenation).
  • Null Handling: Nulls require special handling in calculations and analysis.
  • Naming Conventions: Improve readability and system compatibility.

Vector attribute datasets

Data Joins & Relational Concepts: Real-World Examples


Core Concepts with Global Examples

1. Primary & Foreign Keys

Concept:

  • Primary Key (PK): A unique identifier for records in a table (e.g., auto-generated IDs).
  • Foreign Key (FK): A field in one table that references the primary key of another table.

Real-World Example:

  • Landmarks Table (Primary Key: landmark_id):

    landmark_idnameheight_m
    101Eiffel Tower330
  • Maintenance_Logs Table (Foreign Key: landmark_id):

    log_idlandmark_iddate
    2011012023-09-01

2. Types of Joins

Concept: Linking tables based on shared attributes or location.

Scenario: Linking ride locations to wait times.

  • Rides Table (PK: ride_id):

    ride_idname
    501Space Mountain
  • Wait_Times Table (FK: ride_id):

    time_idride_idwait_minutes
    60150145

3. ArcGIS Relates vs. Joins

Concept:

  • Joins merge attributes into one table (1:1 or 1:many).
  • Relates link tables dynamically without duplication (1:many, many:many).

Real-World Use Case:
Great Barrier Reef Monitoring

  • Reef_Polygons (1 record per reef) ↔ Coral_Health (multiple health scores per reef).
Reef_IDReef_Name
701Agincourt Reef
Health_IDReef_IDHealth_Score
8017018.5

Why Use a Relate?
Avoids duplicating reef geometry for each health score entry.


📊 Summary Table: Join Types & Use Cases

ConceptExampleGlobal Use Case
Primary Keylandmark_id for Eiffel TowerUnique IDs for landmarks, users, products.
Foreign Keyrestaurant_id in Yelp reviewsLinking reviews to businesses.
Spatial JoinTokyo subway stations ↔ neighborhoodsUrban planning, disaster response.
RelateGreat Barrier Reef health dataEnvironmental monitoring, research.

Practical Exercise: Design a Tourism Database

Scenario: Machu Picchu Visitor Management
Task: Design tables to track visitors, tickets, and archaeological sites.