Big Data Analytics in Government and Large Enterprises

Program Overview

Since it was first defined in 2001, Big Data has made major advances into consumer lives and virtually every organization. According to IDG, a global market intelligence leader, “From 2005 to 2020, the digital universe will grow by a factor of 300 to 40 trillion gigabytes”, approximately the same as 625 Billion iPhone units.
Big Data benefits are becoming increasingly visible due to this growth and the growing organisational acceptance and maturity in the area. According to the National Bureau for Economic Research demand forecast in a retail environment improves as more and more data becomes available. The research found that the firm’s forecast performance, the subject of this study was Amazon, improved over time as a result of gradual improvements due to the introduction of “new models and improved technologies” [1].
According to IBM by 2020 up to 85% of customer services interactions will not involve a human. Growing machine learning and NLP capabilities will allow for chatbots, phone support and self-service interfaces to deliver customer service at an acceptable level. These technologies will reduce errors, increase the speed of resolving customer issues and remove some of the bias present in customer service interactions. The UBS bank has already deployed, using IBM Watson, an avatar based on economist Daniel Kalt to interact with clients. The avatar was trained by Kalt himself and allows customers to speak to Kalt as if he were present physically [2].
While the corporate sector was one of the early adopters, Big Data is making substantial inroads in other areas. Medical science is one of these. The presence of adverse effects in a drug, after clinical trials are completed, is an important consideration for pharmaceutical companies. Pharmacovigilance is an activity with the goal of discovering and understanding harmful side-effects, called adverse events (AE). Natural Language Processing (NLP), heavily reliant on Big Data, was used to enhance the existing pharmacovigilance process by using analysing user comments on various health-related sites and MEDLINE abstracts in addition to the structured information already in use. The results showed that using NLP allowed unreported AEs to be identified, thus improving the existing pharmacovigilance process [3].
At the Masterclass delegates will study how Big Data can help their organisations, review real-life success stories, learn about Big Data sources and Open-source Intelligence (OSINT), examine and evaluate major technologies, start using Artificial Intelligence and prepare a business case for a Big Data project.

[1] Bajari, P., Chernozhukov, V., Hortaçsu, A., & Suzuki, J. (2018). The Impact of Big Data on Firm Performance: An Empirical Investigation. IDEAS Working Paper Series from RePEc, IDEAS Working Paper Series from RePEc, 2018.
[2] Moore, M. (2018). Upgrading the Call Center. Fortune, 178(5), 104.
[3] Yeleswarapu, S., Rao, A., Joseph, T., Saipradeep, V., & Srinivasan, R. (2014). A pipeline to extract drug-adverse event pairs from multiple data sources. BMC Medical Informatics and Decision Making, 14(1), 13.


Learning Outcomes

Upon completion of this program, participants will:

  • Comprehend the business rationale for using Big Data
  • Appreciate Big Data characteristics and types
  • Recognise various Big Data sources and the mechanisms to access them
  • Evaluate the relevance of the most popular Big Data technologies at present and:
  • Understand the applications and functionality of each
  • Estimate the benefits and weaknesses of each
  • Develop a selection methodology for the most appropriate option
  • Design and run different types of Big Data analyses and visualisations, including:
  • Creating meaningful visualisations with high impact
  • Understanding Artificial Intelligence (AI) and its applications
  • Employing AI for a specific task, such as estimating construction costs
  • List the success factors and potential roadblocks
  • Create a business case for Big Data projects, including costings and expertise requirements

Intended Audience

  • Board Directors
  • CxO Leaders
  • Chief Internal Auditors
  • Chief Risk Officers
  • Governance Professionals
  • Finance Professionals
  • HR Professionals
  • Marketing Professionals
  • Engineers
  • Managers and Executives
  • Strategy and Innovation Professionals
  • Data Scientists
  • Business Intelligence Professionals
  • Business Analysts
  • Business Consultants
  • Risk and Compliance Analysts
  • Finance and Operational Auditors
  • IS Auditors

Program Level and Pre-requisites for Attending

Foundation level, no pre-requisites

Program Duration

2 days

Number of Attendees

Maximum 17 attendees

Training Methodology

A distinguishing feature of this program is its practical aspect. At the end of each module there is a practical exercise designed to reinforce the concepts of the module and to give participants a chance to “get their hands dirty”. The exercises are either scenarios where participants work in groups or individual tasks executed within a specially prepared software environment.

A specific module of the program is devoted to creating a Big Data analytics pilot project, which the participants will be able to use as a blueprint in their teams for a real-life Big Data analytics initiative.

At the start of the course, each participant will receive a USB with a fully configured Virtual Machine that will be used for the practical work throughout the course. Participants will be allowed to keep the USB after the course and use the software in their work.

Program Outline

Day 1

  • Introduction
  • Business Rationale for Big Data
  • Big Data Characteristics
  • Big Data Case Studies:
    • Government
    • Enterprises
    • Not-for-profit
  • Participant Round-table Discussion
  • Big Data Strategic Goal
  • Group Discussion
  • Building on the Wisdom of W. Edwards Deming and Peter Drucker
  • Case Study: Using predictive analytics to identify customer behaviors
  • Big Data Sources
  • Structured Data
    • Transactional Systems
    • Metadata
    • Data Warehouses
  • Semi-structured Data
  • Unstructured Data
  • Open-source Intelligence (OSINT)
    • Introduction
    • Applications and Techniques
    • Practical Examples
  • Practical work: Accessing, downloading and pre-processing external datasets
  • Major Big Data Technologies
  • Overview and Main Characteristics
  • Benefits and Weaknesses
  • Selection Methodology

Day 2

  • Big Data Visualization and Reporting
  • Introduction to Data Storytelling
  • Essentials of Impactful Visualizations
    • Overview
    • Main Principles
    • Tools
  • Big Data Reporting
  • Big Data Analytics and Artificial Intelligence/ML
  • Overview
  • Machine Learning Categories
  • Types of Machine Learning
  • Practical Work
    • Building a Machine Learning model of a business process
    • Training a model to identify network hacking attempts based on historical data
  • Success Factors and Stumbling Blocks
  • Required Skillsets
  • Legal and Privacy Considerations
  • Delivery Approach and Stages of Experts’ Involvement
  • Group Workshop: Implementing a Big Data Initiative
  • Defining the Business Need
  • Discovering Necessary Data-sources
  • Establishing Required Resources/Expertise and Costing
  • Program Review, Q&A, Comments

Other courses you may be interested in

  • 2 days
  • 17people
  • Foundation
  • 2 days
  • 70people
  • Foundation

Want to know more?

Please fill the form below if you would like to talk to us about this course and your needs. We will get back to you within 24 hours.

First Name*
Last Name*
Tel No*
Leave a message