Gene PICST_87861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_87861 
SymbolRDS2 
ID4837174 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1340444 
End bp1342923 
Gene Length2480 bp 
Protein Length663 aa 
Translation table12 
GC content43% 
IMG OID640388489 
Productputative Fungal transcriptional regulatory protein 
Protein accessionXP_001383022 
Protein GI150864271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.328163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTAATATCTT GGAATTACCG TCAAGGCAGC GACCTACGAT GTACGCCACG AGTCATCAGG 
AACAGGATAT GAAGCCCGGC ATTGACCGGG ACAGTGCCGG CCATAGCAAT GTGAATTCTC
ATGAATCTCG GGACAAGAGT GATATTGGTA ATAGCAACAA TAATAGAAAT ATTAATAGTG
ATAGCAATAA CGACAACAAC AATAGCATTG GCAATAGTAA TAACAACAGC AACAATACCA
ACGGTCATAG TAGTCATACT GCTAGTACTA CCACCAATAG CAACGACACT AATAGTAAAA
TTATTCATGC TGGTAATAGT CTTAATACTA CCAACTCCAA TAACAGCGAT ACTTCTGGCA
ACACCCATTC GGACTCACTT GCCTATGCAG CTGTAGATTT CAAGTCTGAC TCCATCAATA
GTAACAGTGG CACCACAAAC AGCACGAACT ACAATCACAG TAATCCACAA TCTCATTCTC
ATTCCAACAC TAATAACAAT AGCACTAATA ATAATCACCA TTCTACCACT CTTTCTACGA
CTGCGCTGTC ATCGCCTGAG CCATCAGGAT CGTCTTCGTC GAAGAAACGT AAAAAGAAAG
TAGAAATTGC CTGCGTCTAT TGTCGTAGGT CTCATATGAT CTGTGACGAT TCTAGGCCCT
GTCAACGTTG TATCAAAAGA GGCATTGGTC ATTTGTGTTA CGACGAGCCT TCCAATTCTC
GTCAACGTAA AAAGGCAGCT GCTCTCCGTA AATCCACTAG TGATAGCTCC GCTCCGATAT
CGTTATCGAC AGAAAGAACT TCTTTTCCCA ACGTATCGTC GTCTCTGCCG ATTATACCTG
TTCCCGTAAC CTTGGAACAA CAGCTTCAAA GCCAGCTTCA AGGTCCACTA GACAAAGATC
TGCTTTCATC ACCAAATCCT GGCAGTGGTG GTATTACCAA TTCTGGAGTT GGATCTATTA
ATAATACTAG TACGACTAGT AATTCCGGAC CTGGTTCAAA AGTTCCGCAG CCTCTTCCAG
GTCCGCCTAT TCAATCTGTA AACAGCCAAC TTCAGCTTCA TAAACCGCAC ACGTCTCTCT
CCAACTCAGT CTTGTCACAA ACATTACCGT ATAACCAGGA GCCTTTCTTC TATTCTGAGC
ACGCTGGAAG TGAGTTCAGT TCTCTTAACG ATTTCCTTCT GATGATTGAC GATCCCGAGC
TTGTGAATGG AGCTCTCAAT GACGATCCTA CTGCTGGAGA CCCCATGCTA GCATTCCAGA
CTGCTGGCAA TTTCGACTCT ACTGGCGCTA ACACTGCTGC TGGTGGAGCC ACAGTCACAG
CCAGCAACTC AGCAACTAAC CTCAACAATA TCCTCTCCTT TTCACCAAAC CCATCTTTGT
TCCAAAGCAC ATTTCAGAAT GACACCCACC AACTCAACCA ACAACAACAA CAGCAGCAGC
AGCAGCAGCA GCAACAACAA CAACAACAGC AGCAGCAACT AAATCAGCAG CTACAAGAGC
AGCAACAACA GCACCAATTC AATAGCAGCA GTAGTAACCA GTTCTTAAAG CCTCTGCCTC
TTGTTCCAAT CCAGTCTGGA CAGATGACAG TTCCATCCAG TGGAAGTAAT ACTAATGGAA
ACCAGCTGGA GCACCAACCT GTGATATCAG ATTCTGCTAG AGACAAGTTC TTCTTGACAG
CAGCTGATCC AACTACAGAA ATCTCGCCAG AAGAAAGATT AAAACAGGTT ATTAAAGCAA
AGTTGGAAGC TGGTTTGCTA CAGCCCTACA ACTATGCAAA GGGTTATGCT CGATTACAGA
GTTATATGGA CAACTACATG AACATCTCGA GCAGGCAAAG GATCTTAAAG CCGTTGTCTA
TTTTCCGTCC AGCTTTCAGG GCCATTGCCA GAACTTTGAA GGATGTAGAT TTGGTACTTG
TAGAAGAGAA CTTTGAGCGC ATGTTGTTAG ACTACGATCG TGTTTTCACA TCCATGGCTA
TACCAGCATG TCTCTGGAGA CGTACAGGAG AAATATATCG TGGAAACAAA GAGTTTGCCT
CGTTGGTTGG CGTTATGACG GACGATCTCA AAGACGGCAA GCTCGCAATC TACGAGTTGA
TGAGTGAAGA AAGTGCCGTC AACTTCTGGG AGAAGTATGG TGCTATTGCC TTTGATAAGG
GCCAGAAGGC AGTATTGACG AGTTGTAATT TGAGAACAAG AGACGGGATC AAAAGGAAAA
GTTGTTGTTT TAGCTTCACC ATCAGACGTG ATCGCTACAA CATTCCCAGT TGCATAGTAG
GAAACTTCAT TCCTATTGAC CCTTAATTGA ACAAATTTTA TTCCTTGTAT TATTTTCATT
ATCGTTTTGA CTATCAATAT CATTACGGTG TTGATTTTCG TCATCATATC AGTATTGTTT
TCCTATTTGT CGTGTCTATT GTTAGATTTA TGTCTTTCAA ATGACGTTGA TAGCATATTT
ATTATAATTT ATTCTTCTTC
 
Protein sequence
MYATSHQEQD MKPGIDRDSA GHSNVNSHES RDKSDIGNSN NNRNINSDSN NDNNNSIGNS 
NNNSNNTNGH SSHTASTTTN SNDTNSKIIH AGNSLNTTNS NNSDTSGNTH SDSLAYAAVD
FKSDSINSNS GTTNSTNYNH SNPQSHSHSN TNNNSTNNNH HSTTLSTTAS SSPEPSGSSS
SKKRKKKVEI ACVYCRRSHM ICDDSRPCQR CIKRGIGHLC YDEPSNSRQR KKAAALRKST
SDSSAPISLS TERTSFPNVS SSSPIIPVPL HKPHTSLSNS VLSQTLPYNQ EPFFYSEHAG
SEFSSLNDFL SMIDDPELVN GALNDDPTAG DPMLAFQTAG NFDSTGANTA AGGATVTASN
SATNLNNILS FSPNPSLFQS TFQNDTHQLN QQQQQQQQQQ QQQQQQQQQQ LNQQLQEQQQ
QHQFNSSSSN HNTNGNQSEH QPVISDSARD KFFLTAADPT TEISPEERLK QVIKAKLEAG
LLQPYNYAKG YARLQSYMDN YMNISSRQRI LKPLSIFRPA FRAIARTLKD VDLVLVEENF
ERMLLDYDRV FTSMAIPACL WRRTGEIYRG NKEFASLVGV MTDDLKDGKL AIYELMSEES
AVNFWEKYGA IAFDKGQKAV LTSCNLRTRD GIKRKSCCFS FTIRRDRYNI PSCIVGNFIP
IDP