-
Legal Materials as Big Data: (algo)Rithms Support Legal Interpretation. A Dia...
This webinar, which took place on 6 July 2021, focused on the interplay between legal data and data science. The webinar, entitled ‘Legal Materials as Big Data: (algo)Rithms to...-
.webloc
The resource: 'Webinar Link' is not accessible as guest user. You must login to access it!
-
.webloc
-
Compressed and Learned Data Structures Seminar
In this seminar cycle, students are guided in the direct usage of a powerful C++ library implementing many state-of-the-art compressed data structures for big data. Other than...-
PDF
The resource: 'A gentle introduction to ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Learned indexes, the ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'GitHub Repository' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'GitHub Repository Instructions' is not accessible as guest user. You must login to access it!
-
PDF
-
General confidentiality and utility metrics for privacy-preserving data publi...
Anonymization for privacy-preserving data publishing, also known as statistical disclosure control (SDC), can be viewed under the lens of the permutation model. According to... -
Second SoBigData Plus Plus Awareness Panel R. I. Platforms Data Part 1
This webinar, which took place on 10 November 2020, was aimed at exploring the theme of data protection and intellectual property issues in platforms. The first speaker was...-
.webloc
The resource: 'Link to the webinar' is not accessible as guest user. You must login to access it!
-
.webloc
-
Private Reddit Remote Work Dataset
The dataset was collected exclusively from Reddit using the Python library praw [Boe and Payne, 2023]. Posts were extracted from the subreddits remotework, workfromhome, and... -
NetMe
The huge amount of biological literature, which daily increases, represents a strategic resource to automatically extract and gain knowledge concerning relations among... -
Fair detection of poisoning attacks in federated learning
Federated learning is a decentralized machine learning technique that aggregates partial models trained by a set of clients on their own private data to obtain a global model....-
PDF
The resource: 'Link to Publication' is not accessible as guest user. You must login to access it!
-
PDF
-
Fairness and Abstraction in Sociotechnical Systems
A key goal of the fair-ML community is to develop machine-learning based systems that, once introduced into a social context, can achieve social and legal outcomes such as... -
Experimenting ASPen on the DBLP dataset
It has been recently proposed ASPen: an Answer Set Programming (ASP) encoding for LACE, which is a novel declarative approach to Collective Entity Resolution in the classical... -
Structural Invariants in Individuals Language Use The Ego Network of Words
The cognitive constraints that humans exhibit in their social interactions have been extensively studied by anthropologists, who have highlighted their regularities across... -
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.-
CSV
The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
-
CSV
The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
-
HTML
The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Designing for human rights in AI
In the age of Big Data, companies and governments are increasingly using algorithms to inform hiring decisions, employee management, policing, credit scoring, insurance... -
OpenAirInterface 5G Dataset: UDP Downlink Traffic Experiment with Three UEs
The dataset comprises logs generated by OpenAirInterface (OAI) at the Medium Access Control (MAC) layer. It consists of a CSV file containing data collected from experiments... -
Tutorial on Learning To Rank
Efficiency/Effectiveness Trade-offs in Learning to Rank” tutorial by Claudio Lucchese and Franco Maria Nardini at the European Conference on Machine Learning and Principles...-
DOCX
The resource: 'Instructions' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Slides-Part1' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Slides-Part2' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Slides-Part3' is not accessible as guest user. You must login to access it!
-
ipynb
The resource: 'HandsOn-1' is not accessible as guest user. You must login to access it!
-
ipynb
The resource: 'HandsOn-2' is not accessible as guest user. You must login to access it!
-
tar.gz
The resource: 'Hands-On 1/2, QuickRank ...' is not accessible as guest user. You must login to access it!
-
DOCX
-
Democratizing Algorithmic Fairness
Machine learning algorithms can now identify patterns and correlations in (big) datasets and predict outcomes based on the identified patterns and correlations. They can then... -
European Data Governance Act
The proposal for a Regulation of the European Parliament and of the Council on data governance is the first of a set of measures announced in the 2020 European strategy for...-
HTML
The resource: 'html' is not accessible as guest user. You must login to access it!
-
HTML
-
Telegram data qanonEN chats
This dataset consists of English-language chats involved in conspiracy discussions on Telegram. The data was collected using a snowball crawling technique that leverages... -
Proposal for a Regulation of the European Parliament and the Council laying d...
Our remarks focus on two main issues: 1) providing operational tools to link the ethics and the legal dimension of a Trustworthy AI avoiding risks of ethics washing; 2) the... -
OpenAirInterface 5G Dataset: TCP Downlink Traffic Experiment with Two UEs
The dataset comprises logs generated by OpenAirInterface (OAI) at the Medium Access Control (MAC) layer. It consists of a CSV file containing data collected from experiments... -
OpenAirInterface 5G Dataset: UDP Uplink Traffic Experiment with Two UEs
The dataset comprises logs generated by OpenAirInterface (OAI) at the Medium Access Control (MAC) layer. It consists of a CSV file containing data collected from experiments...
