Skip to main content

Node.js: Downloading a xml file from given url and reading its data elements.



In this article, we will see how we can download an XML file from a given URL
and then access its elements.
This can be used in many cases, for example, scraping data from a site which has data in the XML file or multiple files.
The article is pretty short and most of it is just self-explanatory code.
Also nowadays, you can get a lot of freelance jobs in data scrapping as data is the new oil today so this might be helpful.

So let's begin with the code,

var fs = require('fs')
var https = require('https');//For accessing https url we need this module instead of http.
var xml2js = require('xml2js');//Required for xml parsing.
var file_name = 'data.xml'//This will be the name of file we will be generating.
var DOWNLOAD_DIR =__dirname+'/';


//This function reads data from URL and writes data into new file
//with respect to the given name and directory path.


function download(){
 var file_url='https://www.w3schools.com/xml/note.xml'
 var file =

 fs.createWriteStream(DOWNLOAD_DIR +file_name,{'flags': 'w'});
 const request = https.get(file_url, function(response) {
 response.pipe(file);
 });
}


//This function reads data from the XML file and parses it into JSON 

//format to access its elements.

function read(){
 var fileData = fs.readFileSync(file_name, 'ascii');
 var parser = new xml2js.Parser();
 parser.parseString(fileData.substring(0, fileData.length),

 function (err, result) {
   console.log(result)//Here you will get data in json format.
 }); 
}

Note: 
1)In Node.js, __dirname is always the directory in which the currently executing script resides.
So if you typed __dirname into /A1/A2/script.js, the value would be /A1/A2.

2)The pipe() function reads data from a readable stream as it becomes available and writes it to a destination writable stream. In our code variable file is the writable stream and response is the readable stream. This is also a most asked interview question for node.js developer profile.



Comments

Post a Comment

Popular posts from this blog

Coding Interview Questions Set 1

From this article, I will be sharing with you all a series of articles on coding interview questions. So please stay connected for the latest set of questions. It will be a good brainstorming exercise and will also prepare you for coding interviews and will definitely boost your confidence. So let's start, 1)Reverse of a string with only O(1) extra memory. Solution: var reverse = function(string) {     let result = ''     for(let i= string.length -1; i >= 0; i--){         result += string[i];     }     return result; }; 2)Fizz Buzz: Write a program that will accept a number n and will output number till n but for multiples of three it should output “Fizz” instead of the number and for the multiples of five output “Buzz”. For numbers which are multiples of both three and five output “FizzBuzz”. Solution: var fizzBuzz= function(n) {     const arr=[]     for(i=1; i<=n; i++){ ...

JavaScript: Debouncing | Flipkart UI Interview Question.

To understand this programming practice will go with a quick example. We all do online shopping and we do search for products on Flipkart on the search bar. So when we type on search bar i.e on every keypress we are actually giving API requests to the server, which in response gives us suggestions. Actually, it's a time-consuming task and may degrade the performance of the webpage. So to improve the performance we use debounce function which actually acts as a controller which reduces the API request to the server on keypress event. In simple terms, we are giving API request only after a certain amount of time between two keypress events. A debounce is a higher-order function. Higher-order function: A function that returns another function. Let's dive deep: HTML Code: <input type="text" onkeyup="searchProducts()"/> Consider this as a search bar for Flipkart. On any keypress, we are calling searchProducts() function. Javascri...

Node.js: Bundling your Node.js application to single executable for Windows.

In this article, we will see how to bundle Node.js application to a single executable for Windows. What's the need? Well recently, I had taken a work where I needed to convert pdf's(Of similar format) to excel sheet. So I was reading the pdf's from a folder in desktop and I was storing the output excel sheet into a separate folder on the desktop. I used Node.js for the program. Now the client wanted it to install the program on 25 windows machine and his budget was really low. So it was also not possible for me to install node.js for 25 machines and then install the required dependency for each one. One of the solution: While I was searching for an easy solution I found this amazing npm module pkg . This module can make your node.js app work like plug and play type. No need to install Node.js on the client machine or any other dependency.  It helps to make a commercial or trial version of your node.js application without exposing the source code. I found ...