Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2414 |
Symbol | |
ID | 3832165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2536030 |
End bp | 2537340 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637830333 |
Product | putative manganese-dependent inorganic pyrophosphatase |
Protein accession | YP_431239 |
Protein GI | 83591230 |
COG category | [C] Energy production and conversion [T] Signal transduction mechanisms |
COG ID | [COG1227] Inorganic pyrophosphatase/exopolyphosphatase [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000798008 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAG AGATTCTGGT TATCGGACAC CAGCGACCGG ACACAGATTC CATCGCCGCA GCCATCGGTT ACGCTGCCCT GCGGAACAAA ACGGATGGGG GCGGTTTTCA AGCGGCGCGT TGCGGCAAGC TAAACGGGGA GACGGAATTC GTGCTGTCAT ATTTTGATGT ACCCGTACCT CCCCTGGTAA ACGACGTTCG CGCCCGGGTG AAGGATGTCC TGGACGGGGG ACTGCTCTTC ATCCAGCCGG GGGCTACAGT GCGCCAGGCC GGGATTTTTA TGCGCCAGCA CGGCGTCAAG ACCCTGGCGG TGGTCGATGA AAACCGGCAT CTTCTGGGCC TGTTTACCGT CGGTGACCTG GCCCGGCTCC TCCTGGAGGC CTGGGATACC GGTAATGTGC CCATGGATGA ACCCGTTTAT AAGGTTATGC AGAGCGATAA CCTGGTAATC TTTAACCAGG ACGATTTAAT TACCGAGGTC CGCCGCACCA TGCTGGAAAC TCGCTACCGC AACTACCCGG TAGTAGACGA CAATCACTGT CTGGTCGGCC TGATTGCCCG TTATCACCTG CTGGCCATGC GAGGAAAAAG GGTAATCCTG GTCGATCACA ATGAAAAAAG CCAGGCGGTA CCCGGGATAG AAGAAGCCGA AACTGTGGAG ATAATCGACC ACCACCGGGT GGCTGATATT GAGACGGCCG AACCCATCAT GGTGCGTAAC GAGCCCGTCG GTAGTACGGC AACCATCATC GCCAGGATGT ATAAAGAGCG GGGCCTGGAT CCAGATGCAG CCATAGCCGG GGTTTTATGC GCTGCTATTC TCTCGGATAC CCTGTTGTTT AAATCGCCGA CAACTACCCA AGTTGATAAA GAACTGGCGG CCTGGCTTGC TGATATTGCC GGGTTAGACG TCGCCAATTT TGGCCGCGAA ATGTTCCGGG CCGGGTCTTC CTTGAGGGGC CGCTCGGGCC GGGAAATAAT TCTGGAGGAC TTCAAGAGCT TCAATTTTGG CAGCAACCGG GTCGGCATCG GTCAGATTGA GATTATTGAC CCCGACACCC TGCCCGTGGG CCGGGACGAA CTCCAGGCCG AATTGGAAAA ACTTCAGGCC GAGAAGCAGT ACGACCTGGT CGTCCTTATG GTAACCGATT TAATGCGCAA CGGTACGGAA TTACTTTTTG CCGGGCCCCA GGGCCGGGCG GTAGAACTGG CCTTTAACGT CACCCCGGGG GAGAAAAGTG TCTTCCTGCC CGGGGTCATG TCCCGTAAAA AACAGGTCGT ACCTCCCCTG CGGCGGTTGC TGCAGGGATA A
|
Protein sequence | MGKEILVIGH QRPDTDSIAA AIGYAALRNK TDGGGFQAAR CGKLNGETEF VLSYFDVPVP PLVNDVRARV KDVLDGGLLF IQPGATVRQA GIFMRQHGVK TLAVVDENRH LLGLFTVGDL ARLLLEAWDT GNVPMDEPVY KVMQSDNLVI FNQDDLITEV RRTMLETRYR NYPVVDDNHC LVGLIARYHL LAMRGKRVIL VDHNEKSQAV PGIEEAETVE IIDHHRVADI ETAEPIMVRN EPVGSTATII ARMYKERGLD PDAAIAGVLC AAILSDTLLF KSPTTTQVDK ELAAWLADIA GLDVANFGRE MFRAGSSLRG RSGREIILED FKSFNFGSNR VGIGQIEIID PDTLPVGRDE LQAELEKLQA EKQYDLVVLM VTDLMRNGTE LLFAGPQGRA VELAFNVTPG EKSVFLPGVM SRKKQVVPPL RRLLQG
|
| |