Skip to content

Introduction to StructHub

Table of Contents

Introduction

StructHub is a comprehensive knowledge management and Retrieval-Augmented Generation (RAG) platform that centralizes your organizational knowledge across multiple data sources. With advanced AI capabilities, seamless integrations, and powerful search functionality, StructHub transforms how you interact with your data.

Whether you’re managing enterprise documents, building AI applications, or creating intelligent knowledge bases, StructHub provides the tools and infrastructure to unlock the full potential of your unstructured data.

Core Features

πŸ” Universal Document Processing

  • 30+ File Formats: Process PDFs, Word documents, PowerPoint presentations, Excel spreadsheets, images, and more
  • 20+ Languages: Comprehensive multilingual support including English, Spanish, French, German, Hindi, Chinese, Japanese, and more
  • Advanced OCR: Automatic optical character recognition for scanned documents and images
  • Smart Text Extraction: Page-by-page extraction with contextual metadata preservation
  • LLM-Ready Output: Structured output optimized for Large Language Models

πŸ€– AI-Powered Chat Interface

  • Real-time RAG Chat: Chat with your documents using advanced retrieval-augmented generation
  • Gemini 2.5 Model: Powered by Google’s advanced Gemini 2.5 model
  • Smart Citations: Automatic source attribution with page numbers and document references
  • Follow-up Questions: AI-generated contextual follow-up questions
  • Conversation Management: Organize chats by projects and threads
  • Export Capabilities: Export conversations to PDF for sharing and archiving

πŸ“Š Project-Based Organization

  • Multi-Project Support: Organize knowledge bases by projects or teams
  • Team Collaboration: Add team members with role-based permissions
  • Credit Management: Granular credit allocation and usage tracking per project
  • Advanced Settings: Customize LLM models, search parameters, and token limits
  • Metadata Tagging: Automatic and manual metadata extraction and tagging

Data Source Integrations

StructHub connects to your existing data infrastructure with comprehensive integrations:

☁️ Cloud Storage Platforms

  • Google Drive: Full folder sync with OAuth 2.0 authentication
  • Microsoft OneDrive: Seamless integration with Microsoft ecosystem
  • SharePoint: Enterprise-grade SharePoint sites and document libraries
  • Amazon S3: Secure bucket-level integration with folder path support
  • Azure Blob Storage: Container-level access with Azure AD authentication
  • Google Cloud Storage: GCP bucket integration with service account authentication

πŸ”§ Enterprise Systems

  • Confluence: Wiki and documentation integration with space-level sync
  • ServiceNow: Knowledge articles, service catalog, ITSM incidents, and APM applications
  • File Upload: Direct file upload with drag-and-drop interface

πŸ”„ Automated Sync Options

  • Flexible Scheduling: Hourly, daily, weekly, or monthly sync intervals
  • Incremental Updates: Only sync changed or new content
  • Real-time Processing: Process new files as they’re added
  • Sync Status Monitoring: Track sync progress and handle errors gracefully

AI Chat & RAG Capabilities

πŸ’¬ Advanced Chat Features

  • Multi-Source Search: Search across all connected data sources simultaneously
  • Configurable Search Depth: Adjust TopK values and search parameters
  • Web Search Integration: Combine internal knowledge with real-time web search
  • Streaming Responses: Real-time response generation with thinking indicators
  • Source Attribution: Complete transparency with document sources and page numbers

🧠 RAG Engine

  • Vector Database: Powered by Pinecone for high-performance semantic search
  • Hybrid Search: Combines semantic similarity with keyword matching
  • Context Preservation: Maintains conversation context across multiple exchanges
  • Smart Chunking: Intelligent document segmentation for optimal retrieval
  • Relevance Scoring: Advanced scoring algorithms for result ranking

🎯 Customization Options

  • Gemini 2.5 Model: Powered by Google’s latest Gemini 2.5 model
  • Token Limits: Configure response length (10K-800K tokens)
  • Search Parameters: Fine-tune TopK values (default: 50, max: 200)
  • Web Search Toggle: Enable/disable web search integration

Project Management

πŸ‘₯ Team Collaboration

  • Multi-User Support: Add team members to projects
  • Role-Based Access: Owner, Admin, and Member roles with appropriate permissions
  • Project Sharing: Share projects and knowledge bases across teams
  • Activity Tracking: Monitor team usage and contribution

πŸ“ˆ Analytics & Monitoring

  • Usage Analytics: Track credit consumption and API usage
  • Performance Metrics: Monitor query response times and success rates
  • Data Source Health: Monitor sync status and data freshness
  • Cost Management: Project-level cost tracking and budgeting

πŸ”‘ API Key Management

  • Project-Specific Keys: Generate API keys tied to specific projects
  • Multiple Keys: Create multiple keys for different use cases
  • Usage Monitoring: Track API key usage and rate limits
  • Key Rotation: Easy key regeneration for security

Advanced Features

πŸ” Security & Encryption

  • End-to-End Encryption: Optional encryption for sensitive documents
  • User-Provided Keys: Bring your own encryption keys
  • System-Generated Keys: Automatic key generation and management
  • Data Isolation: Complete data separation between organizations

🏷️ Metadata Management

  • Automatic Tagging: AI-powered metadata extraction
  • Custom Metadata: Define custom metadata fields per project
  • Metadata Prompts: Configure AI prompts for metadata generation
  • Searchable Metadata: Use metadata for enhanced search and filtering

API Capabilities

πŸ” Knowledge Base Search API

  • Semantic Search: Advanced vector-based search capabilities
  • Configurable Results: Adjust result count and relevance scoring
  • Source Attribution: Complete metadata and source information
  • Real-time Processing: Fast response times for production use

🌐 RESTful Architecture

  • Well-Documented APIs: Comprehensive API documentation
  • Standard HTTP Methods: RESTful design principles
  • JSON Responses: Structured, machine-readable responses
  • Rate Limiting: Built-in rate limiting for fair usage

Security & Privacy

πŸ›‘οΈ Data Protection

  • Data Encryption: At-rest and in-transit encryption
  • Access Controls: Role-based access control (RBAC)
  • Audit Logging: Comprehensive audit trails

Pricing & Free Tier

πŸ†“ Generous Free Tier

  • 2,000 Credits Monthly: Substantial free usage every month
  • Credit Rollover: Unused credits roll over to next month
  • Full Feature Access: Access to all core features
  • No Time Limits: Use the free tier indefinitely

πŸ’° Transparent Pricing

  • Credit-Based System: Pay only for what you use
  • Flexible Plans: Scale from individual to enterprise
  • Volume Discounts: Better rates for higher usage

Getting Started

πŸš€ Quick Start

  1. Sign Up: Create your free account with email verification
  2. Create Project: Set up your first knowledge base project
  3. Connect Data Sources: Link your cloud storage or upload files
  4. Start Chatting: Begin asking questions about your documents
  5. Invite Team: Add team members and collaborate

πŸ“š Resources

  • Documentation: Comprehensive guides and tutorials
  • API Reference: Complete API documentation
  • Professional Support: Enterprise support options

Contact

For questions, assistance, or enterprise inquiries:


Ready to transform your knowledge management? Start with our free tier and experience the power of AI-driven document intelligence. No credit card required – just sign up and start building your intelligent knowledge base today!