Gene Moth_2150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2150 
Symbol 
ID3832999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2251289 
End bp2253445 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content57% 
IMG OID637830072 
Productmetal dependent phosphohydrolase 
Protein accessionYP_430982 
Protein GI83590973 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain
[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.070124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGAG GAAAATTAAT GCAGTACTAC GAAGAACTGA AACACATCGG CTTACAAAGA 
CTGCAAGAGA GCATGAGCCA GGCCCTGGGG CTGGCAGCTT CGGTCACCTA CCCGGACGGA
CAACTCCTCA CCAAAACCTC CAACCTATGC TCTTTCTGCG CCCTCCTTAA TGCTAATTCA
GAAGGAAGAG CCAAGTGTGG GGCTTCACGT GTAATCTTTG CCAGGGCTGC CGTGGACGCA
GGGAGAGCGA TTCTCGACAC CTGCCATGCC GGGCTGGTAC ATGTGGCAGT ACCCCTCCGG
GTAGCAGGAA AAACAGTAGC GGTACTGGTG GGCGGCAGCG TAGCACTTAA GCCGCTCACA
GAAGAGGAAG TAGCCGAACT TGCCCGGGAG ACAGGCATAG ACCAGGAAGA GCTCTGGGTA
GCGGCCCAAG GGGTACCTTT GTGGTCTGAA GAACGGCTGC GGACAGCGGC GGAGATGATA
AGGGCAGTAA CGGAAACTTT GGCCCAGCTG CTATACACCA AGCAGGAACA GCAGAAAAAG
GCAGACGAAC TCAGCGCTCT CTTTGAATTC AGCAAAACAG TTTCAGGTAG CCTGCAGGTG
GCCGAAGCTG CCCGGCAGGG ACTTCAGGCG GTGCTGGAAT TGACTGGTGC CACCAGCGGG
TCGGTGATAA TGCTGGGCGA AGCGGAACCA GGGGCGGCGA CTCTTGAGGT GGCGGCTACC
CTGGAGCCGG ACAACGAATT AAGGGTTATA CCTGCAGGGG AAATAATAGC CGCGGTTGAG
CGGGAAGCTG TCGCCGCGCA CTTTGAGAGC CGTCCCGGAG AAAGCACGCC CGAAGAAAAG
CGGCCGGCAG TTGCAGTACC TCTTACAGCT GGGGGCAAGG TGACGGGGGT ACTCACCTTA
GCAGGCAAGC CAGGGGGGCA ACGCTTCACC GGAGAGGAAG CCATCTTTTT GACCACCCTG
GGCACCATTC TGGGGCTGGC GCTGGAAAAT GCCCGGCTTT TCCGGAAGGT GCGGGAAAGG
GCAGCGATGC TTGAACGGCT AATCGAAGTA GGGCAGGTGT TATCGAGCCA CCTTGATGTG
GATCTAGTGC TTGAATCGGC CCTGGCAAGT GTAAGGGACG TGCTGGATGC ACGGTGGTGT
GCGCTGCGGG TGCTTGACGA AAATACCGGC GAACTGGTGC TGAGGGCTAG CCTGGGTATG
AACCAGAAGT TGCAGGCGAG GGTAGCCCGC GTTCGGCCGG AGGATAACTT GCTGGGTGAA
GTGTTGCAAA AAGGGGAACC TGTAGTGTTG GAGGACCTGG CTACAGACAA ATCCGGAAGG
CATCTACCTT ACTGTGCCCT GGAGATGCGG GCCCTGGTTG TGGTGCCTGT GAAAGCAGGC
GGAAAGATCC TGGGCACACT GAAGCTTTAT TCTCCTGTAC CGCGTCGCTG GTCGGAAGAG
GAAGTTGAGT ACCTGGGTAC CGTGGCAGCT CAAATCGGGC TGGCGCTGGA AAACGCCCGC
CTTTATTCAT CCCTGCGGGA GTACTACTTG AGTACCGTAC AGTCGCTGGC AGCGGCATTG
GAGGCCAAGG ACGTATACAC GAGGGGGCAT TCCATCCGGG TAGCCAAATG GGCACGCTCC
TGCGCCCGTA TGCTGGGACT TGGTGCTGAA GTGGAGGAAC AGGTTTATCT AGCCGGACTT
TTACACGACC TGGGCAAAAT TGGTGTACAA GAGGACATTC TTCTTAAACC GGGCCCCCTC
ACCCCGGAAG AAAGAAAAGA GATGCAGGGT CATCCCGAAG TAGGAGCCAG GATCCTGGAA
CCGGCCCGGT TCCCTGCGGC GGTCATTGCA GCCGTACGTC ACCATCACGA AGACTATGAG
GGTGGGGGTT ATCCGGCTGG CCTTTCAGGA GAGGAGATCC CGCTTCTAGC GCGCATTATT
CGTGTTGCTG ATGCCTACGA CGCCATGACC TCCGCCAGGC CATACAGAAA AGCGTTCGCC
CCGGAGAAAG CGCGGAATGA ATTGAAAAGG TGTGCAGGTC AGCAATTTGA CCCCCAGGTG
GTAAAGGCAT TTTTACGGAT TCCGAAAGAG GAAATGGAGA ATATTTCCAT GGGGGGGGGG
GGTACCCTAA TAGCTTTGCT GGGCGAAATA CTTTTTTTAC TGAGGCGGCT GCACTGA
 
Protein sequence
MERGKLMQYY EELKHIGLQR LQESMSQALG LAASVTYPDG QLLTKTSNLC SFCALLNANS 
EGRAKCGASR VIFARAAVDA GRAILDTCHA GLVHVAVPLR VAGKTVAVLV GGSVALKPLT
EEEVAELARE TGIDQEELWV AAQGVPLWSE ERLRTAAEMI RAVTETLAQL LYTKQEQQKK
ADELSALFEF SKTVSGSLQV AEAARQGLQA VLELTGATSG SVIMLGEAEP GAATLEVAAT
LEPDNELRVI PAGEIIAAVE REAVAAHFES RPGESTPEEK RPAVAVPLTA GGKVTGVLTL
AGKPGGQRFT GEEAIFLTTL GTILGLALEN ARLFRKVRER AAMLERLIEV GQVLSSHLDV
DLVLESALAS VRDVLDARWC ALRVLDENTG ELVLRASLGM NQKLQARVAR VRPEDNLLGE
VLQKGEPVVL EDLATDKSGR HLPYCALEMR ALVVVPVKAG GKILGTLKLY SPVPRRWSEE
EVEYLGTVAA QIGLALENAR LYSSLREYYL STVQSLAAAL EAKDVYTRGH SIRVAKWARS
CARMLGLGAE VEEQVYLAGL LHDLGKIGVQ EDILLKPGPL TPEERKEMQG HPEVGARILE
PARFPAAVIA AVRHHHEDYE GGGYPAGLSG EEIPLLARII RVADAYDAMT SARPYRKAFA
PEKARNELKR CAGQQFDPQV VKAFLRIPKE EMENISMGGG GTLIALLGEI LFLLRRLH