Top 10 Pain Points for Data Scientists working in the real world
20/05/2022: - Access to relevant data Relevant data may not be directly available to the analyst (may need org permission, support infrastructure in place, different process for "one off" access vs. need to regularly refresh data) - Data availability Relevant data may still need to be identified and collected (same as above re. need for infrastructure in place before starting with the analysis job) - Data Integration Data from different sources need to be integrated into a normalised form, specific issues like record merge, record deduplication, missing attributes need to be tackled. Lack of documentation...