Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2259 |
Symbol | |
ID | 3830754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2364747 |
End bp | 2366405 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637830179 |
Product | dihydroxyacid dehydratase |
Protein accession | YP_431089 |
Protein GI | 83591080 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAGCG ATGCTATGAA AACGGGCCTG GCCCGGGCTC CCCACAGATC GTTACTTAAA GCCATGGGCC TGACCGAGAC GGAAATCGAG CGGCCGATTA TAGGTGTTGT CAATGCCCAT AATGAACTCA TTCCCGGCCA TATACATTTA AATAACCTGG TGGAAGCCGT AAAAGCAGGG GTGCGCCTGG CCGGCGGTAC GCCCCTGGAG TTTCCCACTA TCGGTGTCTG TGATGGCCTG GCCATGAACC ATGTGGGGAT GAAGTATTCC CTGGCCAGCC GGGAGCTCAT CGCCGATATG ATTGAAGTAA TGGCTATGGC CCATCCCTTT GACGCCCTGG TATTTATCCC TAACTGCGAT AAGATCGTCC CCGGGATGCT TATGGCAGCA GCCCGCCTGA ACCTCCCTGC CATTTTTATC AGCGGCGGTC CCATGCTGGC CGGCCGTTAC CAGGGCCGGG ATGTTTCCCT GAGTACTATG TTCGAGGCTG TAGGGGCTGT CCAGGCCGGG AAGATGACAG AACAGGAACT GGCCGCCCTG GAGGACTGCG CCTGCCCGGG CTGTGGTTCC TGTGCGGGTA TGTTTACCGC CAACACCATG AACTGCATGG TTGAGGCCTT AGGGATGGGC CTGCCGGGTA ACGGTACTAC CCCTGCAGTG AGCGGCTCCC GGGTACGTTT GGCTAAAGAA GCCGGTATGC AGGTGATGAA ATTACTCCAG GAAAATATCC GGCCCCTGGA TATTATGACG GCTACAGCCT TCCGCAATGC CGTGGCCGTG GATATGGCCC TGGGTGGTTC GACCAATACC TGCCTGCACC TGCCGGCCAT AGCCCATGAA GCCGGCGTAA AACTTGACCT GAATACTTTC AACGAAATCA ATCGGCGGAC GCCCCAGATC TGTAAGCTCA GCCCGGCCGG CAGCCAGCAC ATCCAGGACC TGGATGAGGC CGGCGGTATC CCGGCGGTGA TGAATGAGCT CTACCGTCAT GGCCTGATTG ACGGCAGCGC CCTTACTGTG ACCGGACGGA CAGTGGCTGA TAACGTCAGC GGTCGGGTGG TAAGCCGGCG CGAGGTTATC CGGCCTGTGG AAGACCCCTA CAGCAGGGAA GGTGGCCTGG CCGTATTGTA TGGCAACCTG GCTCCTGAGG GTGCCGTTGT AAAGAAGGGC GCCGTGCTGC CGGAGATGAT GCGGCATGAA GGGCCGGCGC GGGTATTTAA CAGCGAGGAA GAGGCTTTTG CCGCCATTAT GGGGAAGCAG ATTAAACCCG GGGATGTGGT GGTAATCCGC TACGAAGGCC CTCGTGGCGG TCCGGGTATG CAGGAAATGC TCAGTCCCAC GGCAGCCCTG GCCGGTATGG GCTTGGACAG CTCCGTGGCT CTGATCACTG ACGGCCGTTT CTCCGGTGCC AGCCGTGGCG CCTCTATCGG TCACGTCTCG CCGGAAGCAG CGGCCGGGGG GCTCATCGCC CTGGTGGAAG AAGGAGATAT CATCGCCATC GATATTGAAG CCGGCAAGCT GGAACTTAAG GTGCCGGAAG AAGAAATTGC CCGCCGCCGC CAGAATTGGC AGGCGCCGCC GCCGAAGATC ACCGGCGGTT ACCTGGGCCG CTACGCGCGC ATGGTTACTT CCGGAGCCAG GGGCGCGGTG TTGGAGTAA
|
Protein sequence | MRSDAMKTGL ARAPHRSLLK AMGLTETEIE RPIIGVVNAH NELIPGHIHL NNLVEAVKAG VRLAGGTPLE FPTIGVCDGL AMNHVGMKYS LASRELIADM IEVMAMAHPF DALVFIPNCD KIVPGMLMAA ARLNLPAIFI SGGPMLAGRY QGRDVSLSTM FEAVGAVQAG KMTEQELAAL EDCACPGCGS CAGMFTANTM NCMVEALGMG LPGNGTTPAV SGSRVRLAKE AGMQVMKLLQ ENIRPLDIMT ATAFRNAVAV DMALGGSTNT CLHLPAIAHE AGVKLDLNTF NEINRRTPQI CKLSPAGSQH IQDLDEAGGI PAVMNELYRH GLIDGSALTV TGRTVADNVS GRVVSRREVI RPVEDPYSRE GGLAVLYGNL APEGAVVKKG AVLPEMMRHE GPARVFNSEE EAFAAIMGKQ IKPGDVVVIR YEGPRGGPGM QEMLSPTAAL AGMGLDSSVA LITDGRFSGA SRGASIGHVS PEAAAGGLIA LVEEGDIIAI DIEAGKLELK VPEEEIARRR QNWQAPPPKI TGGYLGRYAR MVTSGARGAV LE
|
| |