Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1911 |
Symbol | |
ID | 3830835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1979946 |
End bp | 1981145 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829844 |
Product | Iron-containing alcohol dehydrogenase |
Protein accession | YP_430754 |
Protein GI | 83590745 |
COG category | [C] Energy production and conversion |
COG ID | [COG1454] Alcohol dehydrogenase, class IV |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000928726 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGAAA CAAAAATAAA CATCAACGAA GTCCGGGAAA TCCGGGCTAA AACAACCGTC TACTTTGGAG TTGGAGCTAT TAAAAAGATT GACGACATAG CCAGGGAATT TAAGGAAAAG GGATACGATA GGATCATCGT AATAACCGGC AAGGGGGCTT ATAAAGCCAC CGGCGCGTGG GAATATATAG TTCCGGCCTT AAATAAAAAC CAGATAACCT ATATCCATTA CGACCAGGTG ACGCCCAACC CGACGGTAGA CCAGGTTGAC GAGGCAACCA AACAAGCCCG GGAATTCGGT GCCCGAGCCG TCCTGGCCAT CGGCGGGGGT AGCCCCATTG ATGCCGCTAA AAGCGTAGCC GTCTTGCTCT CCTACCCCGA CAAAAATGCC CGACAGCTCT ACCAGTTAGA ATTTACACCT GTTAAGGCCG CACCTATCAT CGCTATTAAT CTTACCCATG GTACGGGGAC GGAAGCCGAT CGCTTTGCCG TTGTCAGCAT CCCTGAAAAG GCATATAAAC CCGCTATTGC CTATGATTGC ATTTACCCCT TATATTCAAT TGACGACCCG GCCCTCATGG TAAAACTGCC GTCCGACCAG ACAGCTTATG TCTCTGTTGA TGCCCTCAAC CATGTCGTCG AAGCAGCCAC CAGCAAAGTA GCCAGCCCCT ATACTATTAT CCTGGCCAAG GAAACGGTAC GGCTCATCGC CCGATACCTG CCCCAGGCCC TGTCCCATCC GGCGGATTTG ACGGCCAGGT ATTATCTCCT CTATGCTTCC CTGATTGCCG GAATAGCCTT TGACAACGGT TTGCTCCACT TCACCCACGC CCTGGAACAC CCCCTGAGCG CCGTCAAACC GGAGCTCGCC CACGGTCTGG GGCTGGGTAT GCTGCTGCCG GCCGTAGTCA AGCAGATTTA CCCGGCAACC CCGGAGGTAC TGGCGGAGAT ACTGGAGCCC ATTGTTCCCG ATCTCAAAGG CGTTCCCGGT GAAGCAGAAA AGGCTGCCAG CGGGGTGGCA AAATGGCTTG CCGGAGCCGG TATTACCATG AAGCTAAAAG ATGCGGGCTT TCAAGCGGAA GATATCGCCA GGTTAACTGA CCTGGCCTTT ACCACCCCGA GTCTCGAGCT TCTCCTGAGT ATGGCCCCGG TAACGGCCGA CAGGGAAAGG GTTAAGGCAA TTTACCAGGA CGCCTTTTAA
|
Protein sequence | MWETKININE VREIRAKTTV YFGVGAIKKI DDIAREFKEK GYDRIIVITG KGAYKATGAW EYIVPALNKN QITYIHYDQV TPNPTVDQVD EATKQAREFG ARAVLAIGGG SPIDAAKSVA VLLSYPDKNA RQLYQLEFTP VKAAPIIAIN LTHGTGTEAD RFAVVSIPEK AYKPAIAYDC IYPLYSIDDP ALMVKLPSDQ TAYVSVDALN HVVEAATSKV ASPYTIILAK ETVRLIARYL PQALSHPADL TARYYLLYAS LIAGIAFDNG LLHFTHALEH PLSAVKPELA HGLGLGMLLP AVVKQIYPAT PEVLAEILEP IVPDLKGVPG EAEKAASGVA KWLAGAGITM KLKDAGFQAE DIARLTDLAF TTPSLELLLS MAPVTADRER VKAIYQDAF
|
| |