27 items found

Licenses: Academic Free License 3.0 Organisations: SoBigData Catalogue

Filter Results
  • Access required...

    ×

    Dataset

    Private Terrorists recruitment text classification dataset for PRESERVE (AI generated)

    This dataset contains labelled conversations that correspond to conversation in forums, social media or instant messaging applications. It's a dataset for binary...
  • Access required...

    ×

    Dataset

    Private SubCat: A Dataset of Subordinate Categories in Human Mind and LLMs for the It...

    People can categorize the same entity at multiple taxonomic levels, such as basic (bear), superordinate (animal), and subordinate (grizzly bear). While prior research has...
  • Access required...

    ×

    Method

    Private AE-SAD

    Tensorflow implementation of AE-SAD This repository provides a Tensorflow implementation of the AE-SAD method for (semi-)supervised anomaly detection. Citation and Contact...
  • Dataset

    Synthetic data for recruitment

    The datasets consist of a pair of tabular 2000 curricula and 2000 job offers generated by a trained generative causal model. The generation process followed a causal graph...
    • ZIP
      The resource: 'Synthetic%20data%20for%20re ...' is not accessible as guest user. You must login to access it!
  • Dataset

    CoSRec

    CoSRec is the first dataset explicitly designed for joint Conversational Search and Recommendation (CSR) tasks. CoSRec comprises approximately 9,000 user-system conversations...
    • The resource: 'CoSRec GitHub' is not accessible as guest user. You must login to access it!
    • The resource: 'CoSRec SBD' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Shifting LLMs style to fool Machine Generated Text detectors

    Datasets of synthetic news article generated by aligning LLMs using Direct Preference Optimization to shift the machine-generated texts' (MGT) style toward human-written text...
  • Dataset

    LLM-Driven Explanations for Quantum Algorithms

    This item contains the replication package of the paper Exploring LLM-Driven Explanations for Quantum Algorithms. In particular, it contains the explanations generated by a...
    • ZIP
      The resource: 'Replication Package' is not accessible as guest user. You must login to access it!
  • Dataset

    Gender Equality Plans in Italian Universities

    This dataset contains the documents describing the gender equality plans extracted for each public Italian university. Documents are divided by Italian regions and have been...
    • ZIP
      The resource: 'final_gep_dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Bark Beetle Outbreak Czech Republic

    Repository containing satellite dataset created for bark beetle outbreak detection in satellite (Sentinel-1 and Sentinel-2) images. The dataset refer to scenes observed in...
    • The resource: 'Czech Republic' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    Frank Experiments

    Dataset with experimental results for the "Frank" hybrid decision-making system, with simulated users. Features: - CA. Co-evolutionary Accuracy. Accuracy reached by the user...
    • JSON
      The resource: 'Frank Experiments Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    HANSEN: Spoken Text Authorship Analysis

    HANSEN encom- passes meticulous curation of existing speech datasets accompanied by transcripts, along- side the creation of novel AI-generated spo- ken text datasets....
    • The resource: 'Datasets' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'Churn Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Medical Dataset

    The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis...
    • ZIP
      The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Dataset Adult

    The adult dataset includes $48,842$ instances with demographic information like age, workclass, marital-status, race, capital-loss, capital-gain etc. The income attribute...
    • CSV
      The resource: 'Adult' is not accessible as guest user. You must login to access it!
  • Dataset

    German Credit

    In the german credit dataset each one of the 1,000 persons is classified as a good or bad creditor according to attributes like age, sex, checking_account, credit_amount,...
    • CSV
      The resource: 'German Credit' is not accessible as guest user. You must login to access it!
  • Dataset

    Compas

    The compas dataset contains the features used by the COMPAS algorithm for scoring defendants and their risk (Low, Medium and High), for over $4,000$ individuals. We considered...
    • CSV
      The resource: 'https://www' is not accessible as guest user. You must login to access it!
  • Experiment

    Minimizing Hitting Time between Disparate Groups with Shortcut Edges

    Experiments on real-world datasets to evaluate the effectiveness of the algorithms proposed in paper...
    • Github
      The resource: 'experiment data and code' is not accessible as guest user. You must login to access it!
  • Dataset

    Interaction bias. Experiments dataset

    Artificial Intelligence (AI) is increasingly used to build Decision Support Systems (DSS) across many domains. In our work, we conducted a series of experiments designed to...
    • JSON
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Method

    Visualizing the Results of Biclustering and Boolean Matrix Factorization Algo...

    This archive contains the code to visualize biclusters from the paper "Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations" by Thibault Marette, Pauli...