Gene PICST_31312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31312 
SymbolADH5 
ID4839054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp490204 
End bp491319 
Gene Length1116 bp 
Protein Length371 aa 
Translation table12 
GC content46% 
IMG OID640390369 
ProductNAD/NADP dependent alcohol dehydrogenase 
Protein accessionXP_001384388 
Protein GI126135728 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTC AAGAAACTAC TATCCCAGCC ACATTCCAAG GTTTTGGAGT TGACAAGCCA 
GAAAACTGGA ACAAGCCCAA GTTGGTGGAA TACAAGAGAA AAATCGTCAA CCCACACGAT
GTTGTTGTCA AGAACATTGC CTGTGGTCTT TGTGGTTCCG ACATTCTTAC CTTGAAAGCC
GATTGGTCCC CATTGCTCAG AAACGATGCT GTCGTAGGTC ACGAAATTAT TGGCCATGTC
ATCGCCATTG GTGATAAAGT TACCCAAGTC AAGATTGGCG ACAGAGTCGG TATTGGTGCA
GCTTCCAACT CCTGTAGAGA TTGTTCCAGA TGTACCCACG ACAACGAGCA ATACTGTGCC
GACGGTGCTG GTACTTACAA CTCGGTAGAT GCTGCTGCTG AAGACTACAT CACCCAAGGT
GGTTACTCTT CCCACTCCAT TGCTAACGAA CAATTCGTAT TCCCTATTCC AGAAGCTATG
GAAACCGTAC ATGCAGCTCC TTTGATGTGT GCTGGTTTGA CTGTCTACTC TCCATTGGTA
CGTAACCTTG GTACCGATGC CAAGGGAAAG ACGGTTGGTA TCATTGGTAT TGGTGGTCTT
GGACATCTCG CCCTTCAATT TGCCAACGCC CTTGGTGCCA ATGTTGTTGC CTTTTCTAGA
ACTTCTTCAA AGAAGGAACA AGCTCTCAAG TTGGGAGCTC ATGAATTCAT TGCTACTGCT
GAAGAAAAGG ACTGGAAGAA GAAGTATGCC GACCACTTCG ACTTGATCTT GAACTGTGCT
TCTGGTATCG ACGGTTTGGT TCTTGACAAC TACTTACAAG TATTGAAGGT CGACAAGAAG
TTTGTCTCTG TGGGTTTACC ACCAACCAAG GACAACATCC AAGTGTCTCC ACACACCTTC
CTCCACCAAG GTGCATCTTT TGGTTCGTCT TTGTTAGGAT CTAAGACTGA GGCTTTGCAG
ATGTTGGAAT TGGCTACTGC AAAGGGTGTC AAGCCATGGG TTGAGGAAAT CCAAATTGGT
GAAGACGGCT GTCACGAAGC GTTGACTAGA TGTGACAAGG GTGACATTAG ATACAGATTC
GTGTTCACCG GTTTTGACAA GGCTTTCACT GCCTAA
 
Protein sequence
MTVQETTIPA TFQGFGVDKP ENWNKPKLVE YKRKIVNPHD VVVKNIACGL CGSDILTLKA 
DWSPLLRNDA VVGHEIIGHV IAIGDKVTQV KIGDRVGIGA ASNSCRDCSR CTHDNEQYCA
DGAGTYNSVD AAAEDYITQG GYSSHSIANE QFVFPIPEAM ETVHAAPLMC AGLTVYSPLV
RNLGTDAKGK TVGIIGIGGL GHLALQFANA LGANVVAFSR TSSKKEQALK LGAHEFIATA
EEKDWKKKYA DHFDLILNCA SGIDGLVLDN YLQVLKVDKK FVSVGLPPTK DNIQVSPHTF
LHQGASFGSS LLGSKTEALQ MLELATAKGV KPWVEEIQIG EDGCHEALTR CDKGDIRYRF
VFTGFDKAFT A