Uncategorized
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Scala, Notebooks and Spark part 1
Section Introduction Hi and welcome to this section in which we are going to have a primer on Python, on Scala. Just one video on Notebooks, Jupiter notebooks, and then we look at the spark pool that is available in Azure Synapse. Now, by no means whatsoever, is this supposed to be an extensive course on using Scholar or using Python? The reason that I’m having this in place, as students are always aware, based on my courses, I want to set a base for students to understand what I’m…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 6
Custom Serialization Formats Now, in this chapter I just want to give a note when it comes to the custom serialization feature that you have as part of Azure Streamatics. So when you go on to your inputs so if I go on to any one of our inputs here, if I scroll down on if you look at the event serialization format, so this means that what is the format of the events that are coming into the Schematics job. So here you have the options of JSON, CSV…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 5
Lab – Reading Network Security Group Logs – Server Setup Now I want to show you another example on how we can stream data in Azure Stream at six. Now, for this, I’m going to make use of a Windows virtual machine that I have in Azure. This is again based on the Windows Server 2019 operating system. So again, I use the same method to create a virtual machine. We had already seen this when working with the self hosted integration runtime environment when it came on to Azure…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 4
Lab – Adding multiple outputs Right? So in the last chapter I had shown you an example of how to use the tumbling window of endowing function to get the summary results. But this was just in the query, in the test query, we didn’t actually run the job. Now, the reason for this is I want to go on to my next script. Now here I want to show how to add multiple outputs. This is also possible. So here, this is the same, I am selecting this. This…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 3
Lab – Reading data from a JSON file – Setup Now, before we can actually connect our stream at Xjop onto our DB hub, I want to show you how you can also process data that’s blob in your Azure Data Lake Gen Two storage account. Now, in order to change our query, the first thing that we need to do do is to stop our job. So let me click on Stop and let me click on yes. Now, in my Data Lake 2000 account, I’ll go on to…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 2
What is Azure Stream Analytics In this chapter, I want to give an introduction onto Azure Stream analytics. So, this is a real time analytics and event processing service. So I’ve taken this diagram directly from the Microsoft documentation because this gives a good picture on what this service can achieve. So, on the left hand side, we are looking looking at ingesting data from a variety of data sources. So you could ingest your data that’s coming in from your IoT devices, from your log files, from your SaaS…
DP-203 Data Engineering on Microsoft Azure – Design and Develop Data Processing – Azure Event Hubs and Stream Analytics part 1
Batch and Real-Time Processing Now in this section, we are going to see how to start working with streaming data. So, in the earlier section, we had looked at using Azure Data Factory as an ETL tool, an extract, transform and load tool. And this is normally used in the use cases when it comes to batch processing. So in batch processing, you normally take large amounts of data from the source, you transform it and then load it onto a destination data store. For example, you can take data…
CompTIA Project+ PK0-004 – Managing the Project Stakeholders
What’s a stakeholder? Identifying stakeholders is a project management process that you want to do as early as possible in the project. What you don’t want to happen is to be months into your project and then suddenly discover a group of stakeholders that you have not contacted before. That group is not going to be very happy with you, and they’re probably going to have some influence that can change your project. So identifying the stakeholders, you want to do as quickly and as early as possible. Well, we…
CompTIA Project+ PK0-004 – Managing the Project Risks part 2
Using quantitative risk analysis We’ve discussed qualitative risk analysis, where we qualify risk for additional analysis. Well, that additional analysis is quantitative, where we are trying to quantify by the probability and the impact of risk events. So in order to do quantitative risk analysis, we have to first understand well what risk events will come into this, what risk events qualify, and then we may quantify them. And what we’re quantifying is probability and impact. This helps with decision making for risk response. It really allows us to see,…
CompTIA Project+ PK0-004 – Managing the Project Risks part 1
What’s a risk appetite? A term that we use often in project management, especially from a portfolio management perspective, is risk appetite. In this lecture, we’ll take a quick look at exploring attitudes towards risk. Often when we think about risk, we think of when we’re taking a risk that’s dangerous, but really risk, it’s not bad. It’s the impact that can affect us. So for example, you have investments in the stock market. You could make money or you could lose money. So that is a risk. Now, financial…