-
Private Terrorists recruitment text classification dataset for PRESERVE (AI generated)
This dataset contains labelled conversations that correspond to conversation in forums, social media or instant messaging applications. It's a dataset for binary... -
Private SubCat: A Dataset of Subordinate Categories in Human Mind and LLMs for the It...
People can categorize the same entity at multiple taxonomic levels, such as basic (bear), superordinate (animal), and subordinate (grizzly bear). While prior research has... -
Private AE-SAD
Tensorflow implementation of AE-SAD This repository provides a Tensorflow implementation of the AE-SAD method for (semi-)supervised anomaly detection. Citation and Contact... -
Synthetic data for recruitment
The datasets consist of a pair of tabular 2000 curricula and 2000 job offers generated by a trained generative causal model. The generation process followed a causal graph...-
ZIP
The resource: 'Synthetic%20data%20for%20re ...' is not accessible as guest user. You must login to access it!
-
ZIP
-
CoSRec
CoSRec is the first dataset explicitly designed for joint Conversational Search and Recommendation (CSR) tasks. CoSRec comprises approximately 9,000 user-system conversations... -
Private Shifting LLMs style to fool Machine Generated Text detectors
Datasets of synthetic news article generated by aligning LLMs using Direct Preference Optimization to shift the machine-generated texts' (MGT) style toward human-written text... -
LLM-Driven Explanations for Quantum Algorithms
This item contains the replication package of the paper Exploring LLM-Driven Explanations for Quantum Algorithms. In particular, it contains the explanations generated by a...-
ZIP
The resource: 'Replication Package' is not accessible as guest user. You must login to access it!
-
ZIP
-
Gender Equality Plans in Italian Universities
This dataset contains the documents describing the gender equality plans extracted for each public Italian university. Documents are divided by Italian regions and have been...-
ZIP
The resource: 'final_gep_dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Bark Beetle Outbreak Czech Republic
Repository containing satellite dataset created for bark beetle outbreak detection in satellite (Sentinel-1 and Sentinel-2) images. The dataset refer to scenes observed in... -
GiveMeSomeCreditSC
The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...-
ZIP
The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
-
ZIP
-
Frank Experiments
Dataset with experimental results for the "Frank" hybrid decision-making system, with simulated users. Features: - CA. Co-evolutionary Accuracy. Accuracy reached by the user...-
JSON
The resource: 'Frank Experiments Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
HANSEN: Spoken Text Authorship Analysis
HANSEN encom- passes meticulous curation of existing speech datasets accompanied by transcripts, along- side the creation of novel AI-generated spo- ken text datasets.... -
-
CSV
The resource: 'Churn Dataset' is not accessible as guest user. You must login to access it!
-
CSV
-
Medical Dataset
The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis...-
ZIP
The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Dataset Adult
The adult dataset includes $48,842$ instances with demographic information like age, workclass, marital-status, race, capital-loss, capital-gain etc. The income attribute...-
CSV
The resource: 'Adult' is not accessible as guest user. You must login to access it!
-
CSV
-
German Credit
In the german credit dataset each one of the 1,000 persons is classified as a good or bad creditor according to attributes like age, sex, checking_account, credit_amount,...-
CSV
The resource: 'German Credit' is not accessible as guest user. You must login to access it!
-
CSV
-
Compas
The compas dataset contains the features used by the COMPAS algorithm for scoring defendants and their risk (Low, Medium and High), for over $4,000$ individuals. We considered...-
CSV
The resource: 'https://www' is not accessible as guest user. You must login to access it!
-
CSV
-
Minimizing Hitting Time between Disparate Groups with Shortcut Edges
Experiments on real-world datasets to evaluate the effectiveness of the algorithms proposed in paper...-
Github
The resource: 'experiment data and code' is not accessible as guest user. You must login to access it!
-
Github
-
Interaction bias. Experiments dataset
Artificial Intelligence (AI) is increasingly used to build Decision Support Systems (DSS) across many domains. In our work, we conducted a series of experiments designed to...-
JSON
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
Visualizing the Results of Biclustering and Boolean Matrix Factorization Algo...
This archive contains the code to visualize biclusters from the paper "Visualizing Overlapping Biclusterings and Boolean Matrix Factorizations" by Thibault Marette, Pauli...
