Data Segregation using Akka Streams

Euqinu · August 25, 2019, 4:50am

I am trying to segregate the JSON data using Akka Stream. I have taken out the key tags for the segregation from the JSON data and made a case class

Eg:- Bucket1(id:String, category:String)

   Bucket1(1, type1)
  Bucket(2, type2)
  Bucket(1, type3)

Now I want to segregate on the basis of id.
I want an output like Person with id 1 has type1 and type3. If a person has same id then I want to display all its types.

How can I implement such logic?

TimMoore · September 3, 2019, 6:49am

You can use the groupBy operator to split the stream by ID and then reduce to combine the categories.

Euqinu · September 3, 2019, 8:56am

Can you give an example of similar scenario using groupBy and reduce?

Also, is it suitable for real time data streaming?

TimMoore · September 3, 2019, 10:59pm

The problem with real-time data streaming is: how do you know when a group is complete?

Euqinu · September 4, 2019, 8:32am

Source-----MyClass(122, Set(“apple”,“banana”))-------------122->apple,banana,peach
MyClass(123,Set(“papaya”)) 123->papaya
MyClass(122,Set(“apple”,“peach”)

So this is the achitecture . I will be getting data from a source. After that I will put every data to a case class I created. Now in the flow, it should aggregate all the information of one userid in one place.
In the above example, there are three data. And the output should come like visitor with id 122 has apple,banana and peach and visitor with id 123 has papaya.
So these kinds of aggregation should be done in the flow and sent to the sink. And also the source is infinite.So every second we will be getting data. This is done in order to track the user activities.

So for this using groupBy is good?
Or is there a different logic or approach we can solve this case efficiently?

TimMoore · September 5, 2019, 1:08am

If the source is infinite, then what should trigger sending the aggregated element downstream? Could you add some more details about the expected behavior?

Topic		Replies	Views
GroupWhile on Akka Streams? Akka Streams & Alpakka streams	7	2460	January 9, 2020
Separate instance of Flow per substream when using groupBy Akka Streams & Alpakka	2	550	January 4, 2021
Grouping and merging sorted Streams Akka Streams & Alpakka	2	870	June 5, 2020
Akka Stream Infrastructure Setup Akka Streams & Alpakka	1	421	December 9, 2019
[Akka-Stream] How to let the groupBy operator without limited substreams? Akka Streams & Alpakka streams	2	553	June 3, 2020

Data Segregation using Akka Streams

Related Topics