Introduction to StructHub
Table of Contents
- Introduction
- Core Features
- Data Source Integrations
- AI Chat & RAG Capabilities
- Project Management
- Advanced Features
- API Capabilities
- Security & Privacy
- Pricing & Free Tier
- Getting Started
- Contact
Introduction
StructHub is a comprehensive knowledge management and Retrieval-Augmented Generation (RAG) platform that centralizes your organizational knowledge across multiple data sources. With advanced AI capabilities, seamless integrations, and powerful search functionality, StructHub transforms how you interact with your data.
Whether youβre managing enterprise documents, building AI applications, or creating intelligent knowledge bases, StructHub provides the tools and infrastructure to unlock the full potential of your unstructured data.
Core Features
π Universal Document Processing
- 30+ File Formats: Process PDFs, Word documents, PowerPoint presentations, Excel spreadsheets, images, and more
- 20+ Languages: Comprehensive multilingual support including English, Spanish, French, German, Hindi, Chinese, Japanese, and more
- Advanced OCR: Automatic optical character recognition for scanned documents and images
- Smart Text Extraction: Page-by-page extraction with contextual metadata preservation
- LLM-Ready Output: Structured output optimized for Large Language Models
π€ AI-Powered Chat Interface
- Real-time RAG Chat: Chat with your documents using advanced retrieval-augmented generation
- Gemini 2.5 Model: Powered by Googleβs advanced Gemini 2.5 model
- Smart Citations: Automatic source attribution with page numbers and document references
- Follow-up Questions: AI-generated contextual follow-up questions
- Conversation Management: Organize chats by projects and threads
- Export Capabilities: Export conversations to PDF for sharing and archiving
π Project-Based Organization
- Multi-Project Support: Organize knowledge bases by projects or teams
- Team Collaboration: Add team members with role-based permissions
- Credit Management: Granular credit allocation and usage tracking per project
- Advanced Settings: Customize LLM models, search parameters, and token limits
- Metadata Tagging: Automatic and manual metadata extraction and tagging
Data Source Integrations
StructHub connects to your existing data infrastructure with comprehensive integrations:
βοΈ Cloud Storage Platforms
- Google Drive: Full folder sync with OAuth 2.0 authentication
- Microsoft OneDrive: Seamless integration with Microsoft ecosystem
- SharePoint: Enterprise-grade SharePoint sites and document libraries
- Amazon S3: Secure bucket-level integration with folder path support
- Azure Blob Storage: Container-level access with Azure AD authentication
- Google Cloud Storage: GCP bucket integration with service account authentication
π§ Enterprise Systems
- Confluence: Wiki and documentation integration with space-level sync
- ServiceNow: Knowledge articles, service catalog, ITSM incidents, and APM applications
- File Upload: Direct file upload with drag-and-drop interface
π Automated Sync Options
- Flexible Scheduling: Hourly, daily, weekly, or monthly sync intervals
- Incremental Updates: Only sync changed or new content
- Real-time Processing: Process new files as theyβre added
- Sync Status Monitoring: Track sync progress and handle errors gracefully
AI Chat & RAG Capabilities
π¬ Advanced Chat Features
- Multi-Source Search: Search across all connected data sources simultaneously
- Configurable Search Depth: Adjust TopK values and search parameters
- Web Search Integration: Combine internal knowledge with real-time web search
- Streaming Responses: Real-time response generation with thinking indicators
- Source Attribution: Complete transparency with document sources and page numbers
π§ RAG Engine
- Vector Database: Powered by Pinecone for high-performance semantic search
- Hybrid Search: Combines semantic similarity with keyword matching
- Context Preservation: Maintains conversation context across multiple exchanges
- Smart Chunking: Intelligent document segmentation for optimal retrieval
- Relevance Scoring: Advanced scoring algorithms for result ranking
π― Customization Options
- Gemini 2.5 Model: Powered by Googleβs latest Gemini 2.5 model
- Token Limits: Configure response length (10K-800K tokens)
- Search Parameters: Fine-tune TopK values (default: 50, max: 200)
- Web Search Toggle: Enable/disable web search integration
Project Management
π₯ Team Collaboration
- Multi-User Support: Add team members to projects
- Role-Based Access: Owner, Admin, and Member roles with appropriate permissions
- Project Sharing: Share projects and knowledge bases across teams
- Activity Tracking: Monitor team usage and contribution
π Analytics & Monitoring
- Usage Analytics: Track credit consumption and API usage
- Performance Metrics: Monitor query response times and success rates
- Data Source Health: Monitor sync status and data freshness
- Cost Management: Project-level cost tracking and budgeting
π API Key Management
- Project-Specific Keys: Generate API keys tied to specific projects
- Multiple Keys: Create multiple keys for different use cases
- Usage Monitoring: Track API key usage and rate limits
- Key Rotation: Easy key regeneration for security
Advanced Features
π Security & Encryption
- End-to-End Encryption: Optional encryption for sensitive documents
- User-Provided Keys: Bring your own encryption keys
- System-Generated Keys: Automatic key generation and management
- Data Isolation: Complete data separation between organizations
π·οΈ Metadata Management
- Automatic Tagging: AI-powered metadata extraction
- Custom Metadata: Define custom metadata fields per project
- Metadata Prompts: Configure AI prompts for metadata generation
- Searchable Metadata: Use metadata for enhanced search and filtering
API Capabilities
π Knowledge Base Search API
- Semantic Search: Advanced vector-based search capabilities
- Configurable Results: Adjust result count and relevance scoring
- Source Attribution: Complete metadata and source information
- Real-time Processing: Fast response times for production use
π RESTful Architecture
- Well-Documented APIs: Comprehensive API documentation
- Standard HTTP Methods: RESTful design principles
- JSON Responses: Structured, machine-readable responses
- Rate Limiting: Built-in rate limiting for fair usage
Security & Privacy
π‘οΈ Data Protection
- Data Encryption: At-rest and in-transit encryption
- Access Controls: Role-based access control (RBAC)
- Audit Logging: Comprehensive audit trails
Pricing & Free Tier
π Generous Free Tier
- 2,000 Credits Monthly: Substantial free usage every month
- Credit Rollover: Unused credits roll over to next month
- Full Feature Access: Access to all core features
- No Time Limits: Use the free tier indefinitely
π° Transparent Pricing
- Credit-Based System: Pay only for what you use
- Flexible Plans: Scale from individual to enterprise
- Volume Discounts: Better rates for higher usage
Getting Started
π Quick Start
- Sign Up: Create your free account with email verification
- Create Project: Set up your first knowledge base project
- Connect Data Sources: Link your cloud storage or upload files
- Start Chatting: Begin asking questions about your documents
- Invite Team: Add team members and collaborate
π Resources
- Documentation: Comprehensive guides and tutorials
- API Reference: Complete API documentation
- Professional Support: Enterprise support options
Contact
For questions, assistance, or enterprise inquiries:
- Email: support@structhub.io
- Documentation: docs.structhub.io
Ready to transform your knowledge management? Start with our free tier and experience the power of AI-driven document intelligence. No credit card required β just sign up and start building your intelligent knowledge base today!