Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0401 |
Symbol | |
ID | 3832341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 406413 |
End bp | 407636 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828338 |
Product | malate dehydrogenase |
Protein accession | YP_429278 |
Protein GI | 83589269 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00944416 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCATCC AGGAAAGAGC CCTTAACCTG CACAGGGAAT GGCAGGGTAA AATCGAAACC CGGCCGCGGG TACAGGTTAG AAATGCGGAT GATCTAACCA TGGCTTATAC CCCCGGGGTA GCCGAGCCCT GCAAAGAGAT CCACAAGGAC CCCAATTTGG TTGATGTTTA CACCCGCCGC TGGAACCTGG TGGCTGTCGT TTCCGACGGT TCGGCCGTCC TCGGCCTGGG GAACATTGGC GCCCGGGCCG CCATGCCTGT AATGGAAGGT AAAAGCGTCC TCTTCAAATC CTTTGCCGGC GTGGATGCCT TCCCTATTTG CATTGATTCC CAGGATGTCG ATGAAATAGT CCGTACCGTC CAGCTCCTGA CCCCGACTTT TGGCGGCGTG AACCTGGAAG ACATTGCCGC ACCCCGTTGC TTTGAGATCG AGCGGCGCCT CAAGGAGACT ACGGACATCC CCATCTTCCA CGACGACCAG CACGGCACCG CTGTAGTTGT CCTGAGCGCC ATCATCAACG CTTGCAAGAT TACCAAACGT GAACTGTCCG ACCTGAAGGT GGTCATCAAC GGTGCCGGCG CGGCAGGCAT TGCCTGCGGT AAACTCCTCG TGGATGTGGG CGTTAGTGAT GTAATCCTCT GCGACTCGAA GGGGATTATC TGCTCTAAGC GCGACGACCT CAACGCCATC AAAAAAGAAA TGCTCCAAAT AACCAATAAA GAGGATCGCT GTGGCACCCT GGCCGATGCC ATGGAAGGAG CCAATTGCTT CATTGGGCTT TCCGTCAAGG ATGCCGTCAC CCCCGAGATG GTCCGTTCCA TGGACAAAGA TTCCATTCTT TTCGCCATGG CCAATCCGGT GCCGGAGATC TTGCCTGACG TGGCCCGGGC TGCCGGAGCC GCTGTGGTGG GAACGGGCCG GAGCGACTTC CCCAACCAGG TGAACAACGT CCTGGGCTTC CCCGGTATTT TCCGCGGCGC CCTGGATGTC AAAGCTTCGG ATATCAATGA TGCCATGAAG ATTGCCGCCG CCCACGCCCT GGCCGATCTG GTTGGTGACA GACTCTCGGC TGACTTTGTC ATGCCCGAAG CCTTCGATCC CAGGGTAGCG CCGGCGGTAG CCATGGCCGT AGCCAGGGCA GCCATAGAAA GCGGTGTGGC CCGCGATCCC AAGGATCCTG AATGGGTAAA ACGTCACACT GAGGAACTCA TCGCTAGACA ATAG
|
Protein sequence | MSIQERALNL HREWQGKIET RPRVQVRNAD DLTMAYTPGV AEPCKEIHKD PNLVDVYTRR WNLVAVVSDG SAVLGLGNIG ARAAMPVMEG KSVLFKSFAG VDAFPICIDS QDVDEIVRTV QLLTPTFGGV NLEDIAAPRC FEIERRLKET TDIPIFHDDQ HGTAVVVLSA IINACKITKR ELSDLKVVIN GAGAAGIACG KLLVDVGVSD VILCDSKGII CSKRDDLNAI KKEMLQITNK EDRCGTLADA MEGANCFIGL SVKDAVTPEM VRSMDKDSIL FAMANPVPEI LPDVARAAGA AVVGTGRSDF PNQVNNVLGF PGIFRGALDV KASDINDAMK IAAAHALADL VGDRLSADFV MPEAFDPRVA PAVAMAVARA AIESGVARDP KDPEWVKRHT EELIARQ
|
| |