41 items found

Formats: ZIP Groups: sobigdata-eu

Filter Results
  • Dataset

    InCop Test Data Set

    The datasets were designed to monitor and predict stress levels among operators in an industrial environment through the use of smartwatches. The structure is hierarchical and...
    • ZIP
      The resource: 'dataset_fair_incop' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over L'Aquila, Amatrice and Avezzano Regions

    These datasets have been collected through ESA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. The extracted...
    • ZIP
      The resource: 'dataAqAmatriceAvezzano' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'DatiSentinelAQAmatrice1625' is not accessible as guest user. You must login to access it!
  • Dataset

    LLM-Driven Explanations for Quantum Algorithms

    This item contains the replication package of the paper Exploring LLM-Driven Explanations for Quantum Algorithms. In particular, it contains the explanations generated by a...
    • ZIP
      The resource: 'Replication Package' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over Pescara Region

    These datasets have been collected through ESA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. The extracted...
    • ZIP
      The resource: 'dailyPEdata' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Dataset for Photovoltaic Plants

    This synthetic dataset was generated using Gaussian Copula Synthesizer, based on real data from three different photovoltaic plants. The dataset is structured to preserve the...
    • ZIP
      The resource: 'Synthetic_PhotovoltaicSystems' is not accessible as guest user. You must login to access it!
  • Dataset

    Gender Equality Plans in Italian Universities

    This dataset contains the documents describing the gender equality plans extracted for each public Italian university. Documents are divided by Italian regions and have been...
    • ZIP
      The resource: 'final_gep_dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Privacy Policies Compliance

    Dataset composed of 30 privacy policies of online platforms, annotated to assess the level of comprehensiveness of information. This work focuses on the processed categories...
    • ZIP
      The resource: 'Privacy-Policies-Compliance ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over L'Aquila Region

    These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.
    • CSV
      The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
    • HTML
      The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Italian Common Procurement Vocabulary (CPV)

    This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...
    • ZIP
      The resource: '10007545' is not accessible as guest user. You must login to access it!
  • Dataset

    EVALITA 2020 HT

    This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...
    • ZIP
      The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
  • Experiment

    Annotazione semantica di delibere comunali

    Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei...
    • PDF
      The resource: 'Annotazione Delibere' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Codice sorgente' is not accessible as guest user. You must login to access it!
  • Dataset

    EUR-Lex MOSTA

    This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...
    • ZIP
      The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi Dataset of wireless channel samplings

    The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...
    • ZIP
      The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Dataset

    Reddit Echo Chamber dataset

    In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...
    • ZIP
      The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
  • Dataset

    Fire smoke detection dataset

    Dataset of fire, non fire, and smoke images
    • ZIP
      The resource: 'Ilenia Ficili' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 12-mers

    A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!