Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28221 |
Symbol | ALD7 |
ID | 4850998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 668822 |
End bp | 670501 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 47% |
IMG OID | 640392706 |
Product | aldehyde dehydrogenase |
Protein accession | XP_001387358 |
Protein GI | 126273959 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.272017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCCTC CTTCAGCAAA CACCCTCAAG GACTCCGTTT CGTCCGTCGA ATCGTTCACA AAGATCGACA AGGCCACCCC CTCGGCTTCC ACCAAGGCTT CTGAGTCCAT CTTGAAGTAC ACTGCTTTGG ACGACATTCC TGTTGGAGTC AAGAAGCTTA CTGACTCGTT CCACAAGAAT GGAAAGACTC ACTCCATCCA GTACAGATTG AACCAGTTGA GAAACCTCTA TTTTGTCATC AAGGATAACC AGGATGCCAT CTGCGATGCC CTCTTCAAGG ATTTCGGCAG AGTGCCTACT GAGTCGCAGA TCTTGGAAAT CGATGGAAGC TTGAACGAGT TGGTGCACAC CATGGCCAAC TTGCACAACT GGTTGAAACC CGAACCTGTC AAGGACTTGC CCATTACATT GAAGCTGAAT CCTATCTACA TTGAAAGAAT TCCCTACGGA GTTGTGTTGA TCATCTCTCC CTTCAACTAC CCATTCTTCC TCTCATTTTC AGCCATCATT GGGGCCATTG CTGCTGGTAA CGTGGTTGTG TTCAAGCCCT CCGAATTGAC CCCACACTTT TCGCAGTTGT TCACTAACTT GTTGTCTGAG GCTCTTGATG ATGACCTTTT GTTCACCATA AACGGTTCGA TTCCCGAGAC CACGAAGGCC TTGGAACAGA AGTACGACAA GATCATGTAC ACCGGTAATA ACGCTGTAGG AACCATTATC GCCAAGAAGG CTGCCGAGAC TTTAACTCCT GTCATCTTAG AGTTGGGAGG AAAGTCACCA GCGTTTGTCT TGGACGATCT TTCTGAAAAG GAGTTGACCG TCGTAGCCAG AAGAATCGCA TGGGGTAGAT TTACCAATGC TGGTCAAACC TGTGTGGCCG TGGACTATGT TTTGGCTCAT GACTCTATCA AAGAGAGATT GGTTCGCGAG ATCGTCAAGG TGGTCAAGGA GGAGTTCTAT CCAGAACTTA ACAAAGACAA CAAGGACTTC ACCCACATTA TCCACGACCG TGCTGTATCC AACTTATCCA AGATCATCAA GACCACCAAG GGTAAGATTG TTGTTGGAGG TGATGTAGAC GAAGCCTCGA GGTACGTGGC TCCAACTGTC GTGGACAATG TAGACTGGGA CGACTCTACC ATGAAGGGCG AGATCTTTGG TCCTATCTTG CCTATCTTGT CGTACAGCTC TTTGGATGAA GCTTTGTCAA GATTGCAGAG TCGTCATGAC ACTCCTTTGG CACAGTATAT CTTCACTGGT GGATCGACTT CCCGTGCTAA GAACCCTAAA CTTAACAAGA TCTCACAGCA GATCAGATCC GGTGGTGCTG TCATTAACGA TGTCTTGATG CACGTAGCCT TAACTAATGC CCCATTCGGA GGTATTGGTA GTTCCGGTTC GGGATCGTAC CATGGTTGGT TTTCTTTCCG TGCATTTACT CATGAGAGAA CCACCATGGA ACAAAAGTTG TGGAACGACT TTGTTTTGAA GGCTAGATAC CCTCCTTTCA CAGAAAAGAA CCAGAAGTTG GTCAGTGCTT CACAGACAGA CTACAACGGT AAGGTCTGGT TTGATAGACA AGGGGACGTC AGAGTCAAGG GTCCATCGGG CTTGTTCAGC ACTTGGACTT CTGTTGCTGG TGTCGCAGCC TTGATCTACT ACTTTGTTGG TAATTTGTAG
|
Protein sequence | MTPPSANTLK DSVSSVESFT KIDKATPSAS TKASESILKY TALDDIPVGV KKLTDSFHKN GKTHSIQYRL NQLRNLYFVI KDNQDAICDA LFKDFGRVPT ESQILEIDGS LNELVHTMAN LHNWLKPEPV KDLPITLKLN PIYIERIPYG VVLIISPFNY PFFLSFSAII GAIAAGNVVV FKPSELTPHF SQLFTNLLSE ALDDDLLFTI NGSIPETTKA LEQKYDKIMY TGNNAVGTII AKKAAETLTP VILELGGKSP AFVLDDLSEK ELTVVARRIA WGRFTNAGQT CVAVDYVLAH DSIKERLVRE IVKVVKEEFY PELNKDNKDF THIIHDRAVS NLSKIIKTTK GKIVVGGDVD EASRYVAPTV VDNVDWDDST MKGEIFGPIL PILSYSSLDE ALSRLQSRHD TPLAQYIFTG GSTSRAKNPK LNKISQQIRS GGAVINDVLM HVALTNAPFG GIGSSGSGSY HGWFSFRAFT HERTTMEQKL WNDFVLKARY PPFTEKNQKL VSASQTDYNG KVWFDRQGDV RVKGPSGLFS TWTSVAGVAA LIYYFVGNL
|
| |