Python in Data Science Training Course
The training course will help the participants prepare for Web Application Development using Python Programming with Data Analytics. Such data visualization is a great tool for Top Management in decision making.
This course is available as onsite live training in France or online live training.Course Outline
Day 1
- Data Science
- Data Science Team Composition (Data Scientist, Data Engineer, Data Visualizer, Process Owner)
- Business Intelligence
- Types of Business Intelligence
- Developing Business Intelligence Tools
- Business Intelligence and the Data Visualization
- Data Visualization
- Importance of Data Visualization
- The Visual Data Presentation
- The Data Visualization Tools (infographics, dials and gauges, geographic maps, sparklines, heat maps, and detailed bar, pie and fever charts)
- Painting by Numbers and Playing with Colors in Making Visual Stories
- Activity
Day 2
- Data Visualization in Python Programming
- Data Science with Python
- Review on Python Fundamentals
- Variables and Data Types (str, numeric, sequence, mapping, set types, Boolean, binary, casting)
- Operators, Lists, Tuples. Sets, Dictionaries
- Conditional Statements
- Functions, Lambda, Arrays, Classes, Objects, Inheritance, Iterators
- Scope, Modules, Dates, JSON, RegEx, PIP
- Try / Except, Command Input, String Formatting
- File Handling
- Activity
Day 3
- Python and MySQL
- Creating Database and Table
- Manipulating Database (Insert, Select, Update, Delete, Where Statement, Order by)
- Drop Table
- Limit
- Joining Tables
- Removing List Duplicates
- Reverse a String
- Data Visualization with Python and MySQL
- Using Matplotlib (Basic Plotting)
- Dictionaries and Pandas
- Logic, Control Flow and Filtering
- Manipulating Graphs Properties (Font, Size, Color Scheme)
- Activity
Day 4
- Plotting Data in Different Graph Format
- Histogram
- Line
- Bar
- Box Plot
- Pie Chart
- Donut
- Scatter Plot
- Radar
- Area
- 2D / 3D Density Plot
- Dendogram
- Map (Bubble, Heat)
- Stacked Chart
- Venn Diagram
- Seaborn
- Activity
Day 5
- Data Visualization with Python and MySQL
- Group Work: Create a Top Management Data Visualization Presentation Using ITDI Local ULIMS Data
- Presentation of Output
Requirements
- An understanding of Data Structure.
- Experience with Programming.
Audience
- Programmers
- Data Scientist
- Engineers
Open Training Courses require 5+ participants.
Python in Data Science Training Course - Booking
Python in Data Science Training Course - Enquiry
Python in Data Science - Consultancy Enquiry
Consultancy Enquiry
Testimonials (1)
Trainer was accommodative. And actually quite encouraging for me to take up the course.
Grace Goh - DBS Bank Ltd
Course - Python in Data Science
Upcoming Courses
Related Courses
Anaconda Ecosystem for Data Scientists
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at data scientists who wish to use the Anaconda ecosystem to capture, manage, and deploy packages and data analysis workflows in a single platform.
By the end of this training, participants will be able to:
- Install and configure Anaconda components and libraries.
- Understand the core concepts, features, and benefits of Anaconda.
- Manage packages, environments, and channels using Anaconda Navigator.
- Use Conda, R, and Python packages for data science and machine learning.
- Get to know some practical use cases and techniques for managing multiple data environments.
Big Data Business Intelligence for Telecom and Communication Service Providers
35 HoursOverview
Communications service providers (CSP) are facing pressure to reduce costs and maximize average revenue per user (ARPU), while ensuring an excellent customer experience, but data volumes keep growing. Global mobile data traffic will grow at a compound annual growth rate (CAGR) of 78 percent to 2016, reaching 10.8 exabytes per month.
Meanwhile, CSPs are generating large volumes of data, including call detail records (CDR), network data and customer data. Companies that fully exploit this data gain a competitive edge. According to a recent survey by The Economist Intelligence Unit, companies that use data-directed decision-making enjoy a 5-6% boost in productivity. Yet 53% of companies leverage only half of their valuable data, and one-fourth of respondents noted that vast quantities of useful data go untapped. The data volumes are so high that manual analysis is impossible, and most legacy software systems can’t keep up, resulting in valuable data being discarded or ignored.
With Big Data & Analytics’ high-speed, scalable big data software, CSPs can mine all their data for better decision making in less time. Different Big Data products and techniques provide an end-to-end software platform for collecting, preparing, analyzing and presenting insights from big data. Application areas include network performance monitoring, fraud detection, customer churn detection and credit risk analysis. Big Data & Analytics products scale to handle terabytes of data but implementation of such tools need new kind of cloud based database system like Hadoop or massive scale parallel computing processor ( KPU etc.)
This course work on Big Data BI for Telco covers all the emerging new areas in which CSPs are investing for productivity gain and opening up new business revenue stream. The course will provide a complete 360 degree over view of Big Data BI in Telco so that decision makers and managers can have a very wide and comprehensive overview of possibilities of Big Data BI in Telco for productivity and revenue gain.
Course objectives
Main objective of the course is to introduce new Big Data business intelligence techniques in 4 sectors of Telecom Business (Marketing/Sales, Network Operation, Financial operation and Customer Relation Management). Students will be introduced to following:
- Introduction to Big Data-what is 4Vs (volume, velocity, variety and veracity) in Big Data- Generation, extraction and management from Telco perspective
- How Big Data analytic differs from legacy data analytic
- In-house justification of Big Data -Telco perspective
- Introduction to Hadoop Ecosystem- familiarity with all Hadoop tools like Hive, Pig, SPARC –when and how they are used to solve Big Data problem
- How Big Data is extracted to analyze for analytics tool-how Business Analysis’s can reduce their pain points of collection and analysis of data through integrated Hadoop dashboard approach
- Basic introduction of Insight analytics, visualization analytics and predictive analytics for Telco
- Customer Churn analytic and Big Data-how Big Data analytic can reduce customer churn and customer dissatisfaction in Telco-case studies
- Network failure and service failure analytics from Network meta-data and IPDR
- Financial analysis-fraud, wastage and ROI estimation from sales and operational data
- Customer acquisition problem-Target marketing, customer segmentation and cross-sale from sales data
- Introduction and summary of all Big Data analytic products and where they fit into Telco analytic space
- Conclusion-how to take step-by-step approach to introduce Big Data Business Intelligence in your organization
Target Audience
- Network operation, Financial Managers, CRM managers and top IT managers in Telco CIO office.
- Business Analysts in Telco
- CFO office managers/analysts
- Operational managers
- QA managers
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets.
By the end of this training, participants will be able to:
- Set up the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
Data Science for Big Data Analytics
35 HoursBig data is data sets that are so voluminous and complex that traditional data processing application software are inadequate to deal with them. Big data challenges include capturing data, data storage, data analysis, search, sharing, transfer, visualization, querying, updating and information privacy.
Data Science essential for Marketing/Sales professionals
21 HoursThis course is meant for Marketing Sales Professionals who are intending to get deeper into application of data science in Marketing/ Sales. The course provides
detailed coverage of different data science techniques used for “upsale”, “cross-sale”, market segmentation, branding and CLV.
Difference of Marketing and Sales - How is that sales and marketing are different?
In very simplewords, sales can be termed as a process which focuses or targets on individuals or small groups. Marketing on the other hand targets a larger group or the general public. Marketing includes research (identifying needs of the customer), development of products (producing innovative products) and promoting the product (through advertisements) and create awareness about the product among the consumers. As such marketing means generating leads or prospects. Once the product is out in the market, it is the task of the sales person to persuade the customer to buy the product. Sales means converting the leads or prospects into purchases and orders, while marketing is aimed at longer terms, sales pertain to shorter goals.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led, live training in (online or onsite) is aimed at developers who wish to use the FARM (FastAPI, React, and MongoDB) stack to build dynamic, high-performance, and scalable web applications.
By the end of this training, participants will be able to:
- Set up the necessary development environment that integrates FastAPI, React, and MongoDB.
- Understand the key concepts, features, and benefits of the FARM stack.
- Learn how to build REST APIs with FastAPI.
- Learn how to design interactive applications with React.
- Develop, test, and deploy applications (front end and back end) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at developers who wish to use FastAPI with Python to build, test, and deploy RESTful APIs easier and faster.
By the end of this training, participants will be able to:
- Set up the necessary development environment to develop APIs with Python and FastAPI.
- Create APIs quicker and easier using the FastAPI library.
- Learn how to create data models and schemas based on Pydantic and OpenAPI.
- Connect APIs to a database using SQLAlchemy.
- Implement security and authentication in APIs using the FastAPI tools.
- Build container images and deploy web APIs to a cloud server.
Jupyter for Data Science Teams
7 HoursThis instructor-led, live training in France (online or onsite) introduces the idea of collaborative development in data science and demonstrates how to use Jupyter to track and participate as a team in the "life cycle of a computational idea". It walks participants through the creation of a sample data science project based on top of the Jupyter ecosystem.
By the end of this training, participants will be able to:
- Install and configure Jupyter, including the creation and integration of a team repository on Git.
- Use Jupyter features such as extensions, interactive widgets, multiuser mode and more to enable project collaboraton.
- Create, share and organize Jupyter Notebooks with team members.
- Choose from Scala, Python, R, to write and execute code against big data systems such as Apache Spark, all through the Jupyter interface.
Kaggle
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at data scientists and developers who wish to learn and build their careers in Data Science using Kaggle.
By the end of this training, participants will be able to:
- Learn about data science and machine learning.
- Explore data analytics.
- Learn about Kaggle and how it works.
MATLAB Fundamentals, Data Science & Report Generation
35 HoursIn the first part of this training, we cover the fundamentals of MATLAB and its function as both a language and a platform. Included in this discussion is an introduction to MATLAB syntax, arrays and matrices, data visualization, script development, and object-oriented principles.
In the second part, we demonstrate how to use MATLAB for data mining, machine learning and predictive analytics. To provide participants with a clear and practical perspective of MATLAB's approach and power, we draw comparisons between using MATLAB and using other tools such as spreadsheets, C, C++, and Visual Basic.
In the third part of the training, participants learn how to streamline their work by automating their data processing and report generation.
Throughout the course, participants will put into practice the ideas learned through hands-on exercises in a lab environment. By the end of the training, participants will have a thorough grasp of MATLAB's capabilities and will be able to employ it for solving real-world data science problems as well as for streamlining their work through automation.
Assessments will be conducted throughout the course to gauge progress.
Format of the Course also allows for the evaluation of participants.
- Course includes theoretical and practical exercises, including case discussions, sample code inspection, and hands-on implementation.
Note
- Practice sessions will be based on pre-arranged sample data report templates. If you have specific requirements, please contact us to arrange.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.
Python Programming for Finance
35 HoursPython is a programming language that has gained huge popularity in the financial industry. Adopted by the largest investment banks and hedge funds, it is being used to build a wide range of financial applications ranging from core trading programs to risk management systems.
In this instructor-led, live training, participants will learn how to use Python to develop practical applications for solving a number of specific finance related problems.
By the end of this training, participants will be able to:
- Understand the fundamentals of the Python programming language
- Download, install and maintain the best development tools for creating financial applications in Python
- Select and utilize the most suitable Python packages and programming techniques to organize, visualize, and analyze financial data from various sources (CSV, Excel, databases, web, etc.)
- Build applications that solve problems related to asset allocation, risk analysis, investment performance and more
- Troubleshoot, integrate, deploy, and optimize a Python application
Audience
- Developers
- Analysts
- Quants
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- This training aims to provide solutions for some of the principle problems faced by finance professionals. However, if you have a particular topic, tool or technique that you wish to append or elaborate further on, please please contact us to arrange.
GPU Data Science with NVIDIA RAPIDS
14 HoursThis instructor-led, live training in France (online or onsite) is aimed at data scientists and developers who wish to use RAPIDS to build GPU-accelerated data pipelines, workflows, and visualizations, applying machine learning algorithms, such as XGBoost, cuML, etc.
By the end of this training, participants will be able to:
- Set up the necessary development environment to build data models with NVIDIA RAPIDS.
- Understand the features, components, and advantages of RAPIDS.
- Leverage GPUs to accelerate end-to-end data and analytics pipelines.
- Implement GPU-accelerated data preparation and ETL with cuDF and Apache Arrow.
- Learn how to perform machine learning tasks with XGBoost and cuML algorithms.
- Build data visualizations and execute graph analysis with cuXfilter and cuGraph.