-
Wi-Fi Dataset of wireless channel samplings
The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...-
ZIP
The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
dolly-15k-it
This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...-
jsonl
The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
-
jsonl
-
Private Environmental Monitoring of Fluorescence Response
We study a novel sequential decision-making setting, namely the dissimilarity bandits. At each round, the learner pulls an arm that provides a stochastic d-dimensional... -
Private ThinkEngine
ThinkEngine is a tool that allows to integrate declarative automated reasoning modules in 3D simulations and videogames in the Unity development engine. -
Integrating Direct Intracranial Stimulation with the Human Connectome
Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12... -
Private ltlf2asp
Linear Temporal Logic over Finite Traces (LTLf) is a popular logic to reason about finite sequences of events. In LTLf, the (bounded) satisfiability problem refers to whether... -
Private Cybersecurity NER BERT-base-cased model
This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that... -
How Can Big Data Analytics Help Understand Migrant Integration?
Adequate data are key for evidence-based policymaking. However, while a large amount of official statistics is produced across European Union member States, only a small part... -
Experimental results from the Empirical Investigation of the Completeness of ...
This is the raw data from the empirical investigation of the paper “Completeness of Datasets Documentation on ML/AI repositories: an Empirical Investigation”. This work aim of... -
Graph-Informed Neural Networks
In this repository, we publish the codes necessary to implement the Graph-Informed Neural Networks (GINNs), presented for the first time in the paper: Graph-Informed Neural... -
Private Optimizing Empty Container Repositioning and Fleet Deployment via Configurabl...
We introduce a novel framework, Configurable SemiPOMDPs, to model this type of problems. Furthermore, we provide a two-stage learning algorithm, “Configure & Conquer”... -
Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...
This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022.... -
Academic mobility from a big data perspective
Understanding the careers and movements of highly skilled people plays an ever-increasing role in today’s global knowledge-based economy. Researchers and academics are sources... -
EnviroStream (Benchmark)
Stream Reasoning (SR) focuses on developing advanced approaches for applying inference to dynamic data streams; it has become increasingly relevant in various application... -
EMAKG: Enhanced Microsoft Academic Knowledge Graph
The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and... -
Where do migrants and natives belong in a community: a Twitter case study and...
Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours... -
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Iperf K8s-based Power and Resource consumption dataset
The data were collected in a Prometheus-like data format: each entry has a timestamp, a value and key-value labels containing additional information. Metrics were gathered...-
CSV
The resource: '5G_Power_and_Resource_consu ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Stroke and sepsi
The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
