Gene PICST_85565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85565 
SymbolYND2 
ID4840837 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp127427 
End bp130210 
Gene Length2784 bp 
Protein Length815 aa 
Translation table12 
GC content43% 
IMG OID640392152 
ProductYeast Nucleoside Diphosphatase 
Protein accessionXP_001386624 
Protein GI150866882 
COG category[G] Carbohydrate transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5371] Golgi nucleoside diphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCAC TTGAACCTCG CAAAAAGGAC AAAAACAAGT CCAAGTTTGT GAAGCCAGGC 
CCAATCTACT CTGAAGATGG CATTCCTTAC GACTACATTA TTATTATTGA TTCGGGCTCC
AAGGGGCTGA GGGTGTTTGT ATACAACTGG TTGAATCCAG CTGCTGCTCT AAACAAAACT
CTCGACATGA GTGTTGTTCT TAAAAAGCCC AATCTCAACT TGATCAAACG GTACTCTGTG
GCCAGAGAAG AAGCGGAAAT TGAAGAATCA GATACAGATA CCGAGGATGA GTCCGAAATC
GAAATAGATG AGGAAGACGT AGAGTCGGAG ACCGAAGAGA AGGACAACTC CAAACTGAAG
GACAATGGTA AAGGTGCTGC GGTTGAAAAC AAAGACAAGA GTTCCAAGAA GAATCCCAAA
ACCATTCCTA TTAGGTTTCC CAAGATCCAG TCGAAAAAGA AATGGCATCA AAAGGTGAAA
CCTGGTATTT CGTCCTTCAA TTTGAGTCCG CAGAAAATAG GCAATTATCA TTTGAAGCAC
TTGCTCCAAT TGGCCAGCTC TGTTGTTCCC AAGTCTCAAC ACTATAGGAC CCCCATCTTC
TTGCATTCAA CCGCAGGAAT GCGGTTGTTG ACGCCTACCG AACAGACACT GATTCTCAAC
AACATCTGCT CGTATATCAC CAACAATTCA GACTTCTACA TTCCTGAATG TGCCAGTCAT
ATAAATGTAA TCGATGGGGA CTTCGAGGGT ATCTACGGCT GGTTGTCTAT AAATTCGCTC
AAAGGTGCTT TCGATAATCC CGAACAACAC GACCATGGTA AGAGTCACAG CACCTACGGA
TTGTTGGATA TGGGAGGAGC TTCAACACAA GTTGTTTTCC AGCCCAACAA CACTGAGATT
AAAGAACATC AGAATAATCT TTTCAAAATA ACTCTCGCAC AAGTTCCACA TCTAGCTCCT
GTAGATTCTG CAAATGCTGC CGACCCTGTA CAACAACGAG GAGATGAGCC TGTAGTAGGT
AAGTATTCGT TACCTCAACC AGCGGAATTC AACGTCTTCT CAGATTCGTT CTTGGGCTTT
GGAATGTACC AAGCTCACAA CCGTTACTTG GCTACCCTTC TTAGTCTGTA TTTGGAAGAG
AACAATATGA ACGAGCAGAA TAGAGTCATT AAGAAAATAA GTACTCCAGT GCCAGATCCT
TGTTTGCCAA AGGGCTACAT GTCCGTTTCA AATGTGGAAG AAATTTCTGT AGATTTCACG
GGGGAGAGCA ATTTTCAGAA ATGTCTCGAA GGCATCTTTC CGGTGTTGCT GAAGAGTTCT
GCGTCTGAAA ATATGGGGAA CTGCAAGCAG TTTAGCGACG AAATACAGGC TAGCTCTTGC
TTATTGAAAG ACTCTATTCC AGCCTTCGAC TTCGAGGTCA ATCACTTCGT TGGTGTCAGC
GGCTACTGGG AAGCTATCAG CAACTTGTTG AGCTACGAGA ACATCAATGA TATCCGTGAA
GATGTGGATG ATGCAGAAGA GAACACCAAT AAGGACAGTA AAAGTGATCT GAATAACAAG
GACGGCAAGA ATAGCGAGGA CAAGAATAGC GAAGACGACA AGATCGGTGA AGTCGAAAAC
GATGAAGACA AGGACGATGA CGACAACAAG ATTAATGAAG ACGGCAAGAG TAAGGAAGAC
GACAAGAATA AGGAAGACGA CAAGAATAAG GAAGACGGCA AGAATAAGGA AGACGACAAG
AATAAGGAAG ACGGCAAGAA TAAGGAAGAC GGCAAGGCCA ACAAGAACAA TAAAGCCAAC
AAGGGCAATA AAAACGGCAA GGGTAATAAA AGCGACAAGG GTAATAAAGG TGACAAAAGC
AACAAGAACA ACGATAAGTC GGATACCTAC GATTACAAAG TCATCTACGA ACAGACATCC
AAGATATGTT CTAAGGACTG GAGTAGTTTG TTCGAAATGA ATAAAAGAAA GCCAGAGAAC
AAACAATTGA CTGAAACTGA CTTACTGGAG TTGTGTTTCA AGTCGTCTTG GATCTTGAAT
TTCTTACATC TTGGATTGGG ATTTCCTCGT CTTGGCATCG ACGAGATGCC CACCAAGAAC
GACCAGTTCA AGTCGTTGGA GTTGGTAGAA GATATTGATG GCTCATCGTT TTCGTGGACT
TTAGGTAGAG CAATCCTTTA TTCCAACGAT GAATACGTTC AGGCATTCAA TAATTACACT
ATCAAGACAT TACAACTTTC CGAAGAGGAT GTGGAGAATT CCAAATCTGA GTACATAATG
AAGAGGCCGG GATTTTATCA TTCCGCCTCT GCTTCCGTGT ATCATTTTGG AGCCGAGCAG
AATGGCATTT CCCCACGTCC ACAGTTTATT GCACCTAGAG ACGGTATCAA GTATCCTCAC
TACGACTACG AGTACGAAGC TGAAAGTCTG GAACTGAAAT GGGACATCGA GCCACATCGC
TGGTACGGAA TCATCATCTT TTTAGGCTTA CTAGGGTTTA TAGTCTGGCT TATGGTAGGA
AGAAGTGGTC GTTCTCTCAT CCTTGACCGG GCCAAAAACA GAGTCAGAAG AATGGTTCTG
CCATTATTCG GAAGATCTAG CTACGTTGCA GTTCCATTAG ATGAAGAGCA TGAAATTAGC
GCTGCTGGTC GAAGCTCTGA TCTTGATTTA GAGAATGGCT ACGAGTTGGA CGATATCGAT
TCGACGCCAG ACTTCAAGAC TAGTAACTCG TCCTCTGATG AAGTGGAGGC GCAATTCAGA
ATCGATAGCG ATGACGAAGA TTAG
 
Protein sequence
MVSLEPRKKD KNKSKFVKPG PIYSEDGIPY DYIIIIDSGS KGSRVFVYNW LNPAAALNKT 
LDMSVVLKKP NLNLIKRYSV AREEAEIEES DTDTEDESEI EIDEEDVESE TEEKDNSKSK
DNGKGAAVEN KDKSSKKNPK TIPIRFPKIQ SKKKWHQKVK PGISSFNLSP QKIGNYHLKH
LLQLASSVVP KSQHYRTPIF LHSTAGMRLL TPTEQTSILN NICSYITNNS DFYIPECASH
INVIDGDFEG IYGWLSINSL KGAFDNPEQH DHGKSHSTYG LLDMGGASTQ VVFQPNNTEI
KEHQNNLFKI TLAQVPHLAP VDSANAADPV QQRGDEPVVG KYSLPQPAEF NVFSDSFLGF
GMYQAHNRYL ATLLSSYLEE NNMNEQNRVI KKISTPVPDP CLPKGYMSVS NVEEISVDFT
GESNFQKCLE GIFPVLSKSS ASENMGNCKQ FSDEIQASSC LLKDSIPAFD FEVNHFVGVS
GYWEAISNLL SYENINDIRE DGNKGDKSNK NNDKSDTYDY KVIYEQTSKI CSKDWSSLFE
MNKRKPENKQ LTETDLSELC FKSSWILNFL HLGLGFPRLG IDEMPTKNDQ FKSLELVEDI
DGSSFSWTLG RAILYSNDEY VQAFNNYTIK TLQLSEEDVE NSKSEYIMKR PGFYHSASAS
VYHFGAEQNG ISPRPQFIAP RDGIKYPHYD YEYEAESSES KWDIEPHRWY GIIIFLGLLG
FIVWLMVGRS GRSLILDRAK NRVRRMVSPL FGRSSYVAVP LDEEHEISAA GRSSDLDLEN
GYELDDIDST PDFKTSNSSS DEVEAQFRID SDDED