Pair-wise dataset

Dataset Source

LncRNA2Target (LC\ HC)

URL:

http://123.59.132.21/lncrna2target

Citation:

Cheng L, Wang P, Tian R, et al. LncRNA2Target v2. 0: a comprehensive database for target genes of lncRNAs in human and mouse[J]. Nucleic acids research, 2019, 47(D1): D140-D144.

—————————————————————————-

LncTarD  (IT)

URL:

https://lnctard.bio-database.com/

Citation:

Zhao H, Shi J, Zhang Y, et al. LncTarD: A manually-curated database of experimentally-supported functional lncRNA–target regulations in human diseases[J]. Nucleic acids research, 2020, 48(D1): D118-D126.

—————————————————————————-

Case Study (Case study dataset)

Statello L, Guo C J, Chen L L, et al. Gene regulation by long non-coding RNAs and its biological functions[J]. Nature reviews Molecular cell biology, 2021, 22(2): 96-118.

—————————————————————————-

Original Dataset(After filter)

DatasetDownload LinksReference
IncTar.csvIncTar_remove_refine_889
CaseStudy.csvCase6, Case7, Case86 for IT,LC, 7 for IT, 8 for LC
lncrna_target_high_throughput V3.csvHC_45331
Incrna_target_Low_throughput V3.csvLC_remove_refine_493
These are the original dataset leveraged in the experiments. They only have the positive samples( which are labeled as 1).

Experimental Dataset (Constructed dataset, to train the models)

tips: still need features from lncRNA and gene to construct the constructed embedding dataset.

Contructed embedding dataset is the input materials for the machine learning/deep learning algorithms.

DatasetDownload Links
IncTar_1Inctar889 _1
IncTar_2IT889_2
IncTar_3IT889_3
Low_throughput_rs1.csvLC493_1
Low_throughput_rs2.csv LC_493_2
Low_throughput_rs3.csvLC493_3
LC+IT 1 LC+IT 1
LC+IT 2LC+IT 2
LC+IT 3LC+IT 3
HC 45331 1HC_45331_1
HC 45331 2HC_45331_2
HC 45331 3HC_45331_3
The dataset in this table is constructed dataset. The negative samples are generated by randomly selecting from lncRNA gene pairs. Those pairs are labeled with zero.

Constructed embedding dataset generation

Constructed embedding dataset generation samples<(Please unzip

lncRNA mix kpca 4096, Gene mix kpca 4096 into the same root folder to run this demo)