In this paper we present a distributed system for social media scraping which aims to acquire an arbitrarily large number of information from social networks, by exploiting an actor-based solution able to orchestrate efficiently several services on cloud. Our goal is to ensure that correct operations among actors occur, thanks to a master node, based on the ActoDeS architecture, which takes care of managing communications, interface and messages exchanged by client nodes. As a use case, we consider Twitter as social media platform for the key role that is playing in the modern society, as shown by Google Trends data. However, Twitter’s search API have many limitations and there is definitely no way to make it work when it comes to obtaining millions of records within a monthly or annual time range. Thus, we have designed a distributed solution that is able to overcome these constraints without breaking the current laws on this subject and the policies of Twitter.

Actor-based architecture for cloud services orchestration: The case of social media data extraction / Cavalli, S.; Cagnoni, S.; Lombardo, G.; Poggi, A.. - 2706:(2020), pp. 174-183. (Intervento presentato al convegno 21st Workshop "From Objects to Agents", WOA 2020 tenutosi a ita nel 2020).

Actor-based architecture for cloud services orchestration: The case of social media data extraction

Cavalli S.;Cagnoni S.;Lombardo G.
;
Poggi A.
2020-01-01

Abstract

In this paper we present a distributed system for social media scraping which aims to acquire an arbitrarily large number of information from social networks, by exploiting an actor-based solution able to orchestrate efficiently several services on cloud. Our goal is to ensure that correct operations among actors occur, thanks to a master node, based on the ActoDeS architecture, which takes care of managing communications, interface and messages exchanged by client nodes. As a use case, we consider Twitter as social media platform for the key role that is playing in the modern society, as shown by Google Trends data. However, Twitter’s search API have many limitations and there is definitely no way to make it work when it comes to obtaining millions of records within a monthly or annual time range. Thus, we have designed a distributed solution that is able to overcome these constraints without breaking the current laws on this subject and the policies of Twitter.
2020
Actor-based architecture for cloud services orchestration: The case of social media data extraction / Cavalli, S.; Cagnoni, S.; Lombardo, G.; Poggi, A.. - 2706:(2020), pp. 174-183. (Intervento presentato al convegno 21st Workshop "From Objects to Agents", WOA 2020 tenutosi a ita nel 2020).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2886640
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact