Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1024 |
Symbol | |
ID | 3832644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1052627 |
End bp | 1053727 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828952 |
Product | Iron-containing alcohol dehydrogenase |
Protein accession | YP_429881 |
Protein GI | 83589872 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00115837 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00000000647947 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCCTTCA GTTTTTATTT ACCAACAAAG GTCTTTTTTG GCGAAGGGGC TGTTAACAAT CATGGCGTCT TTCTTAAAGG CAGGGGCCGC CGGGCCCTGG TAGTCACCGG GCGCCACAGC GCTACAGCCA GCGGTGCTAT GGCCGATATT GAAGCCTTGG CCAAGAAACT AGATATAACC CTGGCGACCT TCAATCAGGT CCCTTCTAAC CCCACCCTGG AAGTAGTGGG CCGGGGTGTT GAAATGGCCC GCAGCGAGGG GGCGGATTTT ATCATCGGAA TTGGCGGTGG TTCTCCCCTG GATACGGCCA AAGCCATCGC CCTCCTGGCA ACCAACAAAG TACCGGCAAC CGCCCTTTAT GAGGCTGAGC TACCGGAGCC GCCTCTACCT GTGATTGCTA TACCGACTAC TGCCGGCACC GGGAGCGAAG TCACCCAGCA TGCCGTTTTT ACCCTGCCGG AAAAGAAAAT CAAGAAGGGC TTTAGTGACG ACCGCTGTTT TCCCCTGGCA GCATTGGTGG ATCCCCGTTA TACCGCCTCT CTCCCCCTGG AGGTTACCAT TGATACCGCC CTGGACGCCC TGAGCCATGC CATCGAGGGT TACCTGTCCA GGCGGGCGAC GCCTTTAAGC GATACTCTGG CCCTTGAGGC CATGGGCCTC TTCGCCAGGC ATAAGGAAGC CCTGGTGAGG GGAGAGTTGA CTCCTGCTAC CAGGTACGAT CTCATGTATG CTTCCACCCT TGGTGGCATG GTCATTGCCC AGACGCGTAC GACCATCCTG CATACCCTGG GTTATCCTCT AACCTTCAGC CATAATATCC CCCATGGCCG GGCCAATGGC CTCCTGCTGG CAGCCTACCT GGAGTTCGTA CAACCGGCGG AACCGGTGAA GGTCGCTCGC ATCCTGACTG TTTTAGGGAT GACCTCCCTG GCAGAGGTGC AACAGATGAT CCGGCTACTC CTGCCGGTGC CGGGAAAATA TCCGGAGAAG GAATTGGAAC GCATGGCGGA TCTGGTAACC GGGGCCAGCA GTATGGCCTG GACGGCCCGC CAGGGGACCC GGGCCGATCT GGTCCGGATA CTGCGCCAGA GTTTGGGTTA G
|
Protein sequence | MSFSFYLPTK VFFGEGAVNN HGVFLKGRGR RALVVTGRHS ATASGAMADI EALAKKLDIT LATFNQVPSN PTLEVVGRGV EMARSEGADF IIGIGGGSPL DTAKAIALLA TNKVPATALY EAELPEPPLP VIAIPTTAGT GSEVTQHAVF TLPEKKIKKG FSDDRCFPLA ALVDPRYTAS LPLEVTIDTA LDALSHAIEG YLSRRATPLS DTLALEAMGL FARHKEALVR GELTPATRYD LMYASTLGGM VIAQTRTTIL HTLGYPLTFS HNIPHGRANG LLLAAYLEFV QPAEPVKVAR ILTVLGMTSL AEVQQMIRLL LPVPGKYPEK ELERMADLVT GASSMAWTAR QGTRADLVRI LRQSLG
|
| |