CloudCore Datasets Overview
Dataset 1: CloudCore Sales Performance Data
File: cloudcore-sales-data.csv
Records: 138 rows
Time Period: 2023-2024 (8 quarters)
Purpose: Trend analysis and regional performance comparison using Datawrapper
Download: cloudcore-sales-data.csv
Business Context
CloudCore has been experiencing mixed performance across its product portfolio and regions. Management needs to understand which products and regions are driving growth versus decline to make informed strategic decisions about resource allocation and market focus.
Field Definitions
Field Name | Data Type | Description | Business Significance | Example Values |
---|---|---|---|---|
Region | Text | Sales territory (North, South, East, West, Central, Metro) | Geographic performance analysis | North, Metro |
Product | Text | CloudCore service offering | Product portfolio analysis | CloudSync, DataVault |
Quarter | Text | Business quarter (Q1-Q4) | Seasonal trend identification | Q1, Q2, Q3, Q4 |
Year | Integer | Calendar year | Year-over-year comparison | 2023, 2024 |
Revenue_AUD | Currency | Quarterly revenue in Australian dollars | Financial performance metric | 89500, 156000 |
Units_Sold | Integer | Number of service subscriptions sold | Volume performance metric | 450, 780 |
Sales_Rep | Text | Regional sales representative | Performance by salesperson | Sarah Chen, Marcus Wong |
Customer_Segment | Text | Target market category | Market segment analysis | Enterprise, SME |
Product Portfolio
- DataVault: Premium data storage and analytics service (highest revenue)
- CloudSync: Core cloud synchronisation platform (mid-tier)
- SecureLink: Security-focused connectivity solution (SME-focused)
- Analytics Pro: Advanced analytics tools (struggling product)
Key Patterns for Analysis
- Regional Trends: North/East showing decline, South/West showing growth
- Product Performance: DataVault strong across all regions, Analytics Pro underperforming
- Seasonal Patterns: Q4 typically strongest, Q1 typically weakest
- Market Segments: Enterprise customers generate higher revenue per unit
Dataset 2: CloudCore Customer Satisfaction Data
File: cloudcore-customer-data.csv
Records: 200 rows
Purpose: Customer segmentation and satisfaction analysis using ML Playground clustering
Download: cloudcore-customer-data.csv
Business Context
CloudCore’s customer satisfaction scores vary significantly across different customer segments. The company needs to understand which customer characteristics correlate with satisfaction levels to improve service delivery and reduce churn risk.
Field Definitions
Field Name | Data Type | Description | Business Significance | Example Values |
---|---|---|---|---|
Customer_ID | Text | Unique customer identifier | Individual customer tracking | CC001, CC002 |
Age | Integer | Customer age in years | Demographic segmentation | 34, 42, 29 |
Gender | Text | Customer gender identity | Demographic analysis | Female, Male |
Industry | Text | Customer’s business sector | Industry-based patterns | Healthcare, Finance |
Company_Size | Text | Customer organisation size | Business size segmentation | Large, Medium, Small |
Tenure_Months | Integer | Months as CloudCore customer | Loyalty/experience correlation | 18, 36, 8 |
Primary_Product | Text | Main CloudCore service used | Product adoption patterns | DataVault, CloudSync |
Monthly_Usage_Hours | Integer | Average monthly service usage | Engagement level indicator | 145, 89, 67 |
Support_Tickets_6M | Integer | Support tickets in last 6 months | Service quality indicator | 2, 1, 4 |
Satisfaction_Score | Decimal | Customer satisfaction (1-10 scale) | Key outcome measure | 8.2, 7.8, 6.1 |
Renewal_Likelihood | Text | Probability of contract renewal | Business risk assessment | High, Medium, Low |
Region | Text | Customer geographic location | Regional service patterns | Metro, North, South |
Contract_Value_AUD | Currency | Annual contract value | Customer economic value | 2400, 1800, 850 |
Industry Segments
- Finance: Typically high satisfaction, stable usage patterns
- Healthcare: Medium satisfaction, moderate support needs
- Manufacturing: High satisfaction, consistent usage
- Technology: Variable satisfaction, high usage
- Education: Lower satisfaction, budget constraints
- Retail: Medium satisfaction, seasonal usage patterns
Key Patterns for Analysis
- Satisfaction Clusters: Finance/Manufacturing (high 8-9), Healthcare/Technology (medium 6-7), Education (low 3-5)
- Usage Correlation: Heavy DataVault users generally more satisfied
- Support Impact: Higher ticket volumes correlate with lower satisfaction
- Tenure Effects: Longer tenure generally correlates with higher satisfaction
- Size Patterns: Large organisations generally more satisfied than small
Dataset 3: CloudCore Support Ticket Data
File: cloudcore-support-data.csv
Records: 100 rows
Time Period: July-October 2024
Purpose: Pattern identification and service quality analysis
Download: cloudcore-support-data.csv
Business Context
CloudCore’s support team handles various types of customer issues with varying resolution times and outcomes. Understanding patterns in support requests helps identify systemic problems and improve service delivery efficiency.
Field Definitions
Field Name | Data Type | Description | Business Significance | Example Values |
---|---|---|---|---|
Ticket_ID | Text | Unique support ticket identifier | Individual case tracking | TK001, TK002 |
Customer_ID | Text | Customer raising the ticket | Links to customer data | CC009, CC034 |
Date_Created | Date | Ticket creation date | Timeline analysis | 2024-07-15 |
Category | Text | Primary issue category | Issue type patterns | Technical, Billing |
Subcategory | Text | Specific issue type | Detailed problem analysis | Login Issues, Invoice Discrepancy |
Priority | Text | Urgency level assigned | Resource allocation patterns | High, Medium, Low |
Resolution_Hours | Decimal | Time to resolve in hours | Efficiency metric | 2.5, 24.0, 8.5 |
Customer_Segment | Text | Customer business size | Segment-based service patterns | Small, Medium, Large |
Product | Text | CloudCore service affected | Product-specific issues | Analytics Pro, CloudSync |
Industry | Text | Customer industry sector | Industry-specific patterns | Education, Manufacturing |
Outcome | Text | Final ticket resolution | Success rate tracking | Resolved, Escalated |
Satisfaction_Rating | Integer | Customer rating of support (1-5) | Service quality measure | 3, 4, 5 |
Follow_Up_Required | Text | Whether additional action needed | Service completion indicator | Yes, No |
Issue Categories
- Technical (60%): System functionality, performance, integration issues
- Billing (20%): Invoice, payment, contract-related queries
- Training (15%): User education, feature explanation requests
- Account (5%): Access, permissions, administrative changes
Key Patterns for Analysis
- Resolution Time Patterns: Simple account issues resolve fastest, complex technical issues take longest
- Product-Specific Issues: Analytics Pro generates most escalations
- Industry Patterns: Education sector experiences most issues, Finance sector least
- Priority Correlation: High priority tickets don’t always resolve fastest
- Satisfaction Drivers: Resolution time and outcome strongly correlate with satisfaction ratings
Dataset 4: CloudCore Cost Analysis Data
File: cost_analysis_2024.csv
Records: 6 rows
Purpose: Infrastructure cost analysis and depreciation planning
Download: cost_analysis_2024.csv
Business Context
CloudCore’s infrastructure investment requires careful cost management and depreciation planning. This dataset provides itemised costs for major infrastructure components to support financial planning and budgeting exercises.
Field Definitions
Field Name | Data Type | Description | Business Significance | Example Values |
---|---|---|---|---|
Item | Text | Infrastructure component name | Asset identification | Server, Workstation |
Unit Cost | Currency | Cost per individual item (AUD) | Per-unit investment | 4000, 1500 |
Quantity | Integer | Number of units purchased | Scale of investment | 2, 10 |
Total Cost | Currency | Total expenditure for item type | Budget impact | 8000, 15000 |
Depreciation (Years) | Integer | Expected useful life for accounting | Asset lifecycle planning | 5, 3, 10 |
Key Patterns for Analysis
- High-Value Items: Software Suite ($12,000) has longest depreciation period (10 years)
- Volume Purchases: Workstations represent highest total investment ($15,000 for 10 units)
- Lifecycle Planning: Hardware items depreciate faster (3-5 years) than software (10 years)
- Cost Distribution: Total infrastructure investment of $44,500 across 5 categories
Learning Activities Connection
Activity 1: Sales Trend Analysis (Datawrapper)
Students will explore the sales dataset to identify: - Regional performance trends over time - Product portfolio strengths and weaknesses - Seasonal patterns in revenue and units sold - Sales representative effectiveness
Key Questions: Which regions need attention? Which products should CloudCore prioritise?
Activity 2: Customer Segmentation (ML Playground)
Students will use clustering to discover: - Natural customer segments based on satisfaction and usage - Characteristics of high-value, satisfied customers - Risk factors for customer churn - Industry-specific service patterns
Key Questions: What makes customers satisfied? How can CloudCore reduce churn risk?
Activity 3: Support Pattern Analysis (Pattern Recognition)
Students will identify patterns in: - Issue types by customer segment and product - Resolution time factors - Escalation triggers - Service quality indicators
Key Questions: Where are CloudCore’s service gaps? How can support efficiency improve?
Data Quality & Limitations
Strengths
- Realistic business relationships and patterns
- Sufficient volume for meaningful analysis
- Clear correlations for learning objectives
- Diverse variables for multiple analysis approaches
Intentional Limitations
- Simplified compared to real-world complexity
- Limited time range (designed for 2-hour workshop)
- Clean data (minimal missing values or errors)
- Clear patterns (designed for learning, not research)
Ethical Considerations
- All data is fictional and anonymised
- No real customer information included
- Patterns reflect educational objectives, not actual bias
- Safe for classroom discussion and analysis
File Formats & Compatibility
CSV Structure
- UTF-8 encoding for international character support
- Comma-separated values with header row
- No special characters that might cause import issues
- Consistent date format (YYYY-MM-DD)
Tool Compatibility
- Datawrapper: Optimised for easy import and visualisation
- ML Playground: Structured for classification and clustering algorithms
- Spreadsheet Software: Opens cleanly in Excel, Google Sheets
- Database Import: Can be imported into most database systems
Last Updated: 21 August 2025
Created for: ISYS6014 Week 6 - Data Analysis Fundamentals
Contact: Course teaching team for questions or clarifications