Gene PICST_37376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37376 
SymbolAAD2 
ID4851559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2136636 
End bp2137700 
Gene Length1065 bp 
Protein Length354 aa 
Translation table 
GC content42% 
IMG OID640393267 
Productaryl-alcohol dehydrogenases 
Protein accessionXP_001387652 
Protein GI126274855 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.485365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0942071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCTT CAATTGAATA CAAAAAGCTT GGTGCCTCTG GTTTGGCTAT TTCTCCTATC 
ATCGTGGGAT GCATGTCCTA CGGTAAGAAA TTTTGGGCCG ACTGGGTTAT GGAAGATGAA
GAACAGATCT TCAAAATCTT GAAGAAGTGC TACGACTCTG GTATTAGAAC TTTTGATACT
GCTGACTTGT ACTCCAATGG TCATTCGGAA GTTATCTTGG GTAAGTTCTT GAAGAAGTAC
AATATTCCAA GAGAGAAAGT TGTAATTTTA ACTAAGTGCT TCTGTCTAAT TGACACAAAT
ATCCCTGATT TAAACATCGA AACGCAATAC AATTACCCAT CCTATGAGTT TGTTCATAAC
CAGGGTTTGT CAAGAAAGCA TATTTTCGAT GCCGTCAAAG GTTCAGTTGA AAGATTGGGA
ACCTACATCG ATGTCTTGCA AATTCACAGA TTGGATGAGG AGACCCCAAA GGCTGAAATT
ATGAGAGCCT TGCACGATGT CGTTTCTAGT GGTGATGTCA GGTATATCGG TGCTTCCTCT
ATGAGAGCCG CTGACTTCGT TGAATTACAG TTCATTGCTG ATAAGAATGG CTGGACTAAG
TTCATCAGTA TGCAAAACTT CTACAACTTA ATCTACCGTG AGGAAGAAAG AGAAATGATT
CCTTTCTGTA ACGATAACTC CCTTGGTAAG GTTGGCTTGA TCCCATGGTC TCCAATTGCC
AGAGGACTTT TGGCTAGACC TCTTGGTGTA GAGTCTAACC ATAACAGATC TGCCGACACT
GACTTGGCAT TTGAGTTCTT TGGTTTGGCA AACTTGACTG AAGCCGACAA GGAGATTATC
AAGAGAGTCG AAGAAGTTGC CAAAAAGCAT GAAGTCAGTA TGGCTGTAAT CTCCTCTGCT
TGGGTCTTGA GCAAGGGTGC GTTCCCTATC ATCGGTCTCA ACTCCGAAGC AAGAGTTGAC
GATGCACTTA ACTCTCTCAC TGTTAAGTTA ACTGATGAAG AAGTCGCATA CTTGGAAGAG
CCTTACAAAC CTAAGCCAGT ATATGGTTTT CAACATTTTA AGTGA
 
Protein sequence
MSSSIEYKKL GASGLAISPI IVGCMSYGKK FWADWVMEDE EQIFKILKKC YDSGIRTFDT 
ADLYSNGHSE VILGKFLKKY NIPREKVVIL TKCFCLIDTN IPDLNIETQY NYPSYEFVHN
QGLSRKHIFD AVKGSVERLG TYIDVLQIHR LDEETPKAEI MRALHDVVSS GDVRYIGASS
MRAADFVELQ FIADKNGWTK FISMQNFYNL IYREEEREMI PFCNDNSLGK VGLIPWSPIA
RGLLARPLGV ESNHNRSADT DLAFEFFGLA NLTEADKEII KRVEEVAKKH EVSMAVISSA
WVLSKGAFPI IGLNSEARVD DALNSLTVKL TDEEVAYLEE PYKPKPVYGF QHFK