VEDA Ingest UI
Web interface for creating and managing dataset ingestion workflows
The VEDA Ingest UI is a web-based application that simplifies the process of creating dataset ingestion requests for the VEDA platform. It provides both form-based and manual JSON editing capabilities to help users create properly formatted dataset definitions and submit them to the VEDA data pipeline.
The VEDA Ingest UI is available at ingest.openveda.cloud
Overview
The Ingest UI serves as the primary interface for the VEDA ingestion pipeline. It automates the process of:
- Creating STAC-compliant dataset definitions
- Validating dataset metadata and configurations
- Opening pull requests in the veda-data repository
- Publishing datasets to the staging catalog for review
- Facilitating the transition from staging to production catalogs
Key Features
Dynamic Form Generation
- Schema-driven forms: Automatically generates input forms based on JSON schemas
- Field validation: Real-time validation with helpful error messages
Manual JSON Editing
- Direct JSON manipulation: Full control over dataset definition structure
- Syntax validation: Real-time JSON syntax checking and validation
- Schema compliance: Ensures adherence to VEDA dataset definition requirements
Collection Management
- Create new collections: Generate STAC collections from scratch
- Edit existing collections: Modify open pull requests and update configurations
GitHub Integration
- Automatic PR creation: Creates pull requests in veda-data repository with consistent naming conventions
- Commit tracking: Maintains commit history for configuration changes
- Review workflow: Integrates with VEDA team review process
User Workflow
Authentication
- Navigate to ingest.openveda.cloud
- Click “Sign in with Keycloak”
- Select your institutional identity provider through CILogon
- Complete authentication and return to the Ingest UI
Creating a Dataset Definition
Option 1: Form-Based Creation
- Navigate to the “Create Collection” section
- Fill out the guided form with your dataset information:
- Collection ID: Unique identifier (lowercase, alphanumeric with hyphens)
- Title: Human-readable dataset title
- Description: Detailed dataset description
- License: Data usage license (default: CC0-1.0)
- Spatial Extent: Geographic bounds of the dataset
- Temporal Extent: Time range covered by the dataset
- Discovery Items: Configuration for data discovery and processing
- Validate inputs with real-time feedback
Option 2: Manual JSON Editing
- Navigate to the “Manual JSON Edit” section
- Paste or type your dataset definition JSON
- Use the built-in validation to check syntax and schema compliance
Dataset Publication
- Submit to Staging: Click “Submit” to publish to the staging catalog
- GitHub PR Creation: Automatic pull request creation in veda-data repository
- Staging Review: Review your dataset at staging.openveda.cloud
- Production Approval: VEDA team reviews and approves for production deployment
Form Configuration
The Ingest UI uses JSON Schema and UI Schema files to generate dynamic forms:
- JSON Schema: Defines data structure and validation rules
- UI Schema: Controls form layout, field ordering, and presentation
- 24-Column Grid: Responsive layout system for optimal form organization
Customizing Form Fields
Form fields are configured through the UI Schema’s ui:grid property:
"ui:grid": [
{
"collection": 4,
"title": 4,
"license": 4,
"description": 12
}
]Each row must total 24 columns for proper layout alignment.
GitHub Workflow
- Repository: Creates PRs in veda-data
- Naming Convention:
'Ingest Request for [collectionName]' - Branch Strategy:
feat/[sanitizedCollectionName] - File Organization:
ingestion-data/staging/dataset-config/[collectionName].json
Troubleshooting
Common Issues
Collection Name Conflicts
Error: Collection already exists Solution: - Choose a unique collection identifier - Check existing collections in the STAC catalog - Use descriptive, specific naming conventions
Getting Help
- Documentation: Review dataset ingestion guides
- Examples: Study the GEOGLAM ingestion example
- GitHub Issues: Open issues in veda-data repository
- Feature Demonstrations: Visit the live test reports for visual feature overviews