Scenario-based Learning - A New Learning Perspective. Node.js

Why it is Important?

Every time, we learn something. we go and find something to implement it to get a good grasp of it. From, a beginner perspective of the technology. it is great.

But, there is a gap between a beginner who learns the technology to a person who works in the production level application. we really don't know what kind of a problem the particular technology solves in real industrial applications until we work for a company/ a freelancing project

What I am going to do is, I will be sharing all the problem scenarios that I faced in the production. So, the beginner for the particular technology can replicate the scenario on his own and learn from it.

Basically, he/she is going to gain my experience through learning on his own. So, In the future, if he faces the same problem scenario, he/she can tackle it in an efficient way.

Node.js Experience

This Blog series is from my Node.js Experience. Basically, I am a React,Node.js and MongoDB Developer.

Soon, I will share problem scenarios for React as well. I will start with a simple scenario this week. In upcoming articles, I will share more complex scenarios where you can learn from it.

Scenario based Learning A New Learning Perspective

Problem Scenario

Recently, I faced a situation where I need to read a large sized file from internet and write it in my server.

To do this in Node.js, you can think of it like just read the file and write directly into the server.

But there is a problem with this approach, Let's say that we implement something like this

1const fs = require("fs")
2
3fs.readFileSync("sample.mkv", (err, data) => {
4  if (err) throw err
5
6  fs.writeFileSync("output", data, err => {
7    if (err) throw err
8  })
9})

the problem is,

There is a limit in Node.js Buffer, we can't store more than the Buffer size.

To address this problem, we need something called Stream in Node.js.

What is Stream?

Storing a complete file in Memory is so expensive. also, we need to store the file without having this problem. To solve this, we use Stream which is nothing but processing the data in chunks.

Stream process the huge data in a chunk by chunk which store the chunk in memory one at a time.

Solution

we need to create a readable stream which reads the data from the source and writable streamwhich writes the data to the destination.

If you are new to Node.js, I would suggest you watch some tutorials and try this problem scenarios.then, it will be easy to understand what is going on.

Solution Code

1const fs = require("fs")
2const stream = require("stream")
3
4//creating a readable stream
5const readable = fs.createReadStream("sample")
6
7//creating a writable stream
8const writable = fs.createWriteStream("output")
9
10fs.stat("sample", (err, stats) => {
11  this.filesize = stats.size
12
13  this.counter = 1
14
15  //this is an event which handles the data in chunks
16  readable.on("data", chunk => {
17    let percentageCopied = ((chunk.length * this.counter) / this.fileSize) * 100
18    process.stdout.clearLine()
19    process.stdout.cursorTo(0)
20    process.stdout.write(`${Math.round(percentageCopied)}%`)
21
22    //writing the chunk into writable stream output
23    writable.write(chunk)
24    this.counter += 1
25  })
26
27  readable.on("end", e => {
28    console.log("Read is completed")
29  })
30
31  readable.on("error", e => {
32    console.log("Some error occured: ", e)
33  })
34
35  writable.on("finish", () => {
36    console.log("Successfully created the file copy!")
37  })
38})

It reads the data from the local file and writes it again from another local file. For the conceptual purpose, I have used the local file itself rather than a file from the internet.

There is also a problem with this approach, if you analyze the Memory Manager in your machine while this code runs. it will take a lot of memory.

The reason being is, Disk write will not cope up with a speed of Disk Read . Reading a disk will be faster than Writing into a disk.

Beauty is, we can solve this problem too.

Efficient Solution

Node.js Stream has a solution for the above problem which is backpressure

1const stream = require("stream")
2const fs = require("fs")
3
4let fileName = "sample"
5
6const readabale = fs.createReadStream(fileName)
7const writeable = fs.createWriteStream("output")
8
9fs.stat(fileName, (err, stats) => {
10  this.fileSize = stats.size
11  this.counter = 1
12  this.fileArray = fileName.split(".")
13
14  try {
15    this.fileoutputName =
16      "output" + "/" + this.fileArray[0] + "_Copy." + this.fileArray[1]
17  } catch (e) {
18    console.exception("File name is invalid")
19  }
20
21  process.stdout.write(`File: ${this.fileoutputName} is being created:`)
22
23  readabale.on("data", chunk => {
24    let percentage = ((chunk.length * this.counter) / this.fileSize) * 100
25    process.stdout.clearLine() // clear current text
26    process.stdout.cursorTo(0)
27    process.stdout.write(`${Math.round(percentage)}%`)
28    this.counter += 1
29  })
30
31  //Note this line : Read Stream pipes the Write Streams
32  readabale.pipe(writeable)
33
34  // In case if we have an interruption while copying
35  writeable.on("unpipe", e => {
36    process.stdout.write("Write Failed!")
37  })
38})

The only change that we did with the previous solution is to pipe the readable stream to the writable stream

it will automatically control the disk read and write speed, thus it will not choke the RAM.

this is a simple solution to implement. This same concept can be used in some other Technical problem scenarios also

Scenario #2

consider a system where we have implemented a crawler which feeds the data to the Kafka. we need to get the data from Kafka pipeline and store it to Database.

Scenario #3

A User is uploading a huge size of files, we need to store it but, we can't able to store the size after a certain file size limit. what we can do is, implement the stream which reads the data and compresses it. store it in the server.

That's it for this article, Hope you like this series. I am planning to write more articles on this series if I can get a good response from this initiative.

NEW TECH UPDATES

Search This Blog