Gene Moth_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2414 
Symbol 
ID3832165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2536030 
End bp2537340 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID637830333 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_431239 
Protein GI83591230 
COG category[C] Energy production and conversion
[T] Signal transduction mechanisms 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000798008 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAG AGATTCTGGT TATCGGACAC CAGCGACCGG ACACAGATTC CATCGCCGCA 
GCCATCGGTT ACGCTGCCCT GCGGAACAAA ACGGATGGGG GCGGTTTTCA AGCGGCGCGT
TGCGGCAAGC TAAACGGGGA GACGGAATTC GTGCTGTCAT ATTTTGATGT ACCCGTACCT
CCCCTGGTAA ACGACGTTCG CGCCCGGGTG AAGGATGTCC TGGACGGGGG ACTGCTCTTC
ATCCAGCCGG GGGCTACAGT GCGCCAGGCC GGGATTTTTA TGCGCCAGCA CGGCGTCAAG
ACCCTGGCGG TGGTCGATGA AAACCGGCAT CTTCTGGGCC TGTTTACCGT CGGTGACCTG
GCCCGGCTCC TCCTGGAGGC CTGGGATACC GGTAATGTGC CCATGGATGA ACCCGTTTAT
AAGGTTATGC AGAGCGATAA CCTGGTAATC TTTAACCAGG ACGATTTAAT TACCGAGGTC
CGCCGCACCA TGCTGGAAAC TCGCTACCGC AACTACCCGG TAGTAGACGA CAATCACTGT
CTGGTCGGCC TGATTGCCCG TTATCACCTG CTGGCCATGC GAGGAAAAAG GGTAATCCTG
GTCGATCACA ATGAAAAAAG CCAGGCGGTA CCCGGGATAG AAGAAGCCGA AACTGTGGAG
ATAATCGACC ACCACCGGGT GGCTGATATT GAGACGGCCG AACCCATCAT GGTGCGTAAC
GAGCCCGTCG GTAGTACGGC AACCATCATC GCCAGGATGT ATAAAGAGCG GGGCCTGGAT
CCAGATGCAG CCATAGCCGG GGTTTTATGC GCTGCTATTC TCTCGGATAC CCTGTTGTTT
AAATCGCCGA CAACTACCCA AGTTGATAAA GAACTGGCGG CCTGGCTTGC TGATATTGCC
GGGTTAGACG TCGCCAATTT TGGCCGCGAA ATGTTCCGGG CCGGGTCTTC CTTGAGGGGC
CGCTCGGGCC GGGAAATAAT TCTGGAGGAC TTCAAGAGCT TCAATTTTGG CAGCAACCGG
GTCGGCATCG GTCAGATTGA GATTATTGAC CCCGACACCC TGCCCGTGGG CCGGGACGAA
CTCCAGGCCG AATTGGAAAA ACTTCAGGCC GAGAAGCAGT ACGACCTGGT CGTCCTTATG
GTAACCGATT TAATGCGCAA CGGTACGGAA TTACTTTTTG CCGGGCCCCA GGGCCGGGCG
GTAGAACTGG CCTTTAACGT CACCCCGGGG GAGAAAAGTG TCTTCCTGCC CGGGGTCATG
TCCCGTAAAA AACAGGTCGT ACCTCCCCTG CGGCGGTTGC TGCAGGGATA A
 
Protein sequence
MGKEILVIGH QRPDTDSIAA AIGYAALRNK TDGGGFQAAR CGKLNGETEF VLSYFDVPVP 
PLVNDVRARV KDVLDGGLLF IQPGATVRQA GIFMRQHGVK TLAVVDENRH LLGLFTVGDL
ARLLLEAWDT GNVPMDEPVY KVMQSDNLVI FNQDDLITEV RRTMLETRYR NYPVVDDNHC
LVGLIARYHL LAMRGKRVIL VDHNEKSQAV PGIEEAETVE IIDHHRVADI ETAEPIMVRN
EPVGSTATII ARMYKERGLD PDAAIAGVLC AAILSDTLLF KSPTTTQVDK ELAAWLADIA
GLDVANFGRE MFRAGSSLRG RSGREIILED FKSFNFGSNR VGIGQIEIID PDTLPVGRDE
LQAELEKLQA EKQYDLVVLM VTDLMRNGTE LLFAGPQGRA VELAFNVTPG EKSVFLPGVM
SRKKQVVPPL RRLLQG