Gene PICST_28221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28221 
SymbolALD7 
ID4850998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp668822 
End bp670501 
Gene Length1680 bp 
Protein Length559 aa 
Translation table 
GC content47% 
IMG OID640392706 
Productaldehyde dehydrogenase 
Protein accessionXP_001387358 
Protein GI126273959 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.272017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCTC CTTCAGCAAA CACCCTCAAG GACTCCGTTT CGTCCGTCGA ATCGTTCACA 
AAGATCGACA AGGCCACCCC CTCGGCTTCC ACCAAGGCTT CTGAGTCCAT CTTGAAGTAC
ACTGCTTTGG ACGACATTCC TGTTGGAGTC AAGAAGCTTA CTGACTCGTT CCACAAGAAT
GGAAAGACTC ACTCCATCCA GTACAGATTG AACCAGTTGA GAAACCTCTA TTTTGTCATC
AAGGATAACC AGGATGCCAT CTGCGATGCC CTCTTCAAGG ATTTCGGCAG AGTGCCTACT
GAGTCGCAGA TCTTGGAAAT CGATGGAAGC TTGAACGAGT TGGTGCACAC CATGGCCAAC
TTGCACAACT GGTTGAAACC CGAACCTGTC AAGGACTTGC CCATTACATT GAAGCTGAAT
CCTATCTACA TTGAAAGAAT TCCCTACGGA GTTGTGTTGA TCATCTCTCC CTTCAACTAC
CCATTCTTCC TCTCATTTTC AGCCATCATT GGGGCCATTG CTGCTGGTAA CGTGGTTGTG
TTCAAGCCCT CCGAATTGAC CCCACACTTT TCGCAGTTGT TCACTAACTT GTTGTCTGAG
GCTCTTGATG ATGACCTTTT GTTCACCATA AACGGTTCGA TTCCCGAGAC CACGAAGGCC
TTGGAACAGA AGTACGACAA GATCATGTAC ACCGGTAATA ACGCTGTAGG AACCATTATC
GCCAAGAAGG CTGCCGAGAC TTTAACTCCT GTCATCTTAG AGTTGGGAGG AAAGTCACCA
GCGTTTGTCT TGGACGATCT TTCTGAAAAG GAGTTGACCG TCGTAGCCAG AAGAATCGCA
TGGGGTAGAT TTACCAATGC TGGTCAAACC TGTGTGGCCG TGGACTATGT TTTGGCTCAT
GACTCTATCA AAGAGAGATT GGTTCGCGAG ATCGTCAAGG TGGTCAAGGA GGAGTTCTAT
CCAGAACTTA ACAAAGACAA CAAGGACTTC ACCCACATTA TCCACGACCG TGCTGTATCC
AACTTATCCA AGATCATCAA GACCACCAAG GGTAAGATTG TTGTTGGAGG TGATGTAGAC
GAAGCCTCGA GGTACGTGGC TCCAACTGTC GTGGACAATG TAGACTGGGA CGACTCTACC
ATGAAGGGCG AGATCTTTGG TCCTATCTTG CCTATCTTGT CGTACAGCTC TTTGGATGAA
GCTTTGTCAA GATTGCAGAG TCGTCATGAC ACTCCTTTGG CACAGTATAT CTTCACTGGT
GGATCGACTT CCCGTGCTAA GAACCCTAAA CTTAACAAGA TCTCACAGCA GATCAGATCC
GGTGGTGCTG TCATTAACGA TGTCTTGATG CACGTAGCCT TAACTAATGC CCCATTCGGA
GGTATTGGTA GTTCCGGTTC GGGATCGTAC CATGGTTGGT TTTCTTTCCG TGCATTTACT
CATGAGAGAA CCACCATGGA ACAAAAGTTG TGGAACGACT TTGTTTTGAA GGCTAGATAC
CCTCCTTTCA CAGAAAAGAA CCAGAAGTTG GTCAGTGCTT CACAGACAGA CTACAACGGT
AAGGTCTGGT TTGATAGACA AGGGGACGTC AGAGTCAAGG GTCCATCGGG CTTGTTCAGC
ACTTGGACTT CTGTTGCTGG TGTCGCAGCC TTGATCTACT ACTTTGTTGG TAATTTGTAG
 
Protein sequence
MTPPSANTLK DSVSSVESFT KIDKATPSAS TKASESILKY TALDDIPVGV KKLTDSFHKN 
GKTHSIQYRL NQLRNLYFVI KDNQDAICDA LFKDFGRVPT ESQILEIDGS LNELVHTMAN
LHNWLKPEPV KDLPITLKLN PIYIERIPYG VVLIISPFNY PFFLSFSAII GAIAAGNVVV
FKPSELTPHF SQLFTNLLSE ALDDDLLFTI NGSIPETTKA LEQKYDKIMY TGNNAVGTII
AKKAAETLTP VILELGGKSP AFVLDDLSEK ELTVVARRIA WGRFTNAGQT CVAVDYVLAH
DSIKERLVRE IVKVVKEEFY PELNKDNKDF THIIHDRAVS NLSKIIKTTK GKIVVGGDVD
EASRYVAPTV VDNVDWDDST MKGEIFGPIL PILSYSSLDE ALSRLQSRHD TPLAQYIFTG
GSTSRAKNPK LNKISQQIRS GGAVINDVLM HVALTNAPFG GIGSSGSGSY HGWFSFRAFT
HERTTMEQKL WNDFVLKARY PPFTEKNQKL VSASQTDYNG KVWFDRQGDV RVKGPSGLFS
TWTSVAGVAA LIYYFVGNL