Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2150 |
Symbol | |
ID | 3832999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2251289 |
End bp | 2253445 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830072 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_430982 |
Protein GI | 83590973 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2206] HD-GYP domain [COG3605] Signal transduction protein containing GAF and PtsI domains |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.070124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGAG GAAAATTAAT GCAGTACTAC GAAGAACTGA AACACATCGG CTTACAAAGA CTGCAAGAGA GCATGAGCCA GGCCCTGGGG CTGGCAGCTT CGGTCACCTA CCCGGACGGA CAACTCCTCA CCAAAACCTC CAACCTATGC TCTTTCTGCG CCCTCCTTAA TGCTAATTCA GAAGGAAGAG CCAAGTGTGG GGCTTCACGT GTAATCTTTG CCAGGGCTGC CGTGGACGCA GGGAGAGCGA TTCTCGACAC CTGCCATGCC GGGCTGGTAC ATGTGGCAGT ACCCCTCCGG GTAGCAGGAA AAACAGTAGC GGTACTGGTG GGCGGCAGCG TAGCACTTAA GCCGCTCACA GAAGAGGAAG TAGCCGAACT TGCCCGGGAG ACAGGCATAG ACCAGGAAGA GCTCTGGGTA GCGGCCCAAG GGGTACCTTT GTGGTCTGAA GAACGGCTGC GGACAGCGGC GGAGATGATA AGGGCAGTAA CGGAAACTTT GGCCCAGCTG CTATACACCA AGCAGGAACA GCAGAAAAAG GCAGACGAAC TCAGCGCTCT CTTTGAATTC AGCAAAACAG TTTCAGGTAG CCTGCAGGTG GCCGAAGCTG CCCGGCAGGG ACTTCAGGCG GTGCTGGAAT TGACTGGTGC CACCAGCGGG TCGGTGATAA TGCTGGGCGA AGCGGAACCA GGGGCGGCGA CTCTTGAGGT GGCGGCTACC CTGGAGCCGG ACAACGAATT AAGGGTTATA CCTGCAGGGG AAATAATAGC CGCGGTTGAG CGGGAAGCTG TCGCCGCGCA CTTTGAGAGC CGTCCCGGAG AAAGCACGCC CGAAGAAAAG CGGCCGGCAG TTGCAGTACC TCTTACAGCT GGGGGCAAGG TGACGGGGGT ACTCACCTTA GCAGGCAAGC CAGGGGGGCA ACGCTTCACC GGAGAGGAAG CCATCTTTTT GACCACCCTG GGCACCATTC TGGGGCTGGC GCTGGAAAAT GCCCGGCTTT TCCGGAAGGT GCGGGAAAGG GCAGCGATGC TTGAACGGCT AATCGAAGTA GGGCAGGTGT TATCGAGCCA CCTTGATGTG GATCTAGTGC TTGAATCGGC CCTGGCAAGT GTAAGGGACG TGCTGGATGC ACGGTGGTGT GCGCTGCGGG TGCTTGACGA AAATACCGGC GAACTGGTGC TGAGGGCTAG CCTGGGTATG AACCAGAAGT TGCAGGCGAG GGTAGCCCGC GTTCGGCCGG AGGATAACTT GCTGGGTGAA GTGTTGCAAA AAGGGGAACC TGTAGTGTTG GAGGACCTGG CTACAGACAA ATCCGGAAGG CATCTACCTT ACTGTGCCCT GGAGATGCGG GCCCTGGTTG TGGTGCCTGT GAAAGCAGGC GGAAAGATCC TGGGCACACT GAAGCTTTAT TCTCCTGTAC CGCGTCGCTG GTCGGAAGAG GAAGTTGAGT ACCTGGGTAC CGTGGCAGCT CAAATCGGGC TGGCGCTGGA AAACGCCCGC CTTTATTCAT CCCTGCGGGA GTACTACTTG AGTACCGTAC AGTCGCTGGC AGCGGCATTG GAGGCCAAGG ACGTATACAC GAGGGGGCAT TCCATCCGGG TAGCCAAATG GGCACGCTCC TGCGCCCGTA TGCTGGGACT TGGTGCTGAA GTGGAGGAAC AGGTTTATCT AGCCGGACTT TTACACGACC TGGGCAAAAT TGGTGTACAA GAGGACATTC TTCTTAAACC GGGCCCCCTC ACCCCGGAAG AAAGAAAAGA GATGCAGGGT CATCCCGAAG TAGGAGCCAG GATCCTGGAA CCGGCCCGGT TCCCTGCGGC GGTCATTGCA GCCGTACGTC ACCATCACGA AGACTATGAG GGTGGGGGTT ATCCGGCTGG CCTTTCAGGA GAGGAGATCC CGCTTCTAGC GCGCATTATT CGTGTTGCTG ATGCCTACGA CGCCATGACC TCCGCCAGGC CATACAGAAA AGCGTTCGCC CCGGAGAAAG CGCGGAATGA ATTGAAAAGG TGTGCAGGTC AGCAATTTGA CCCCCAGGTG GTAAAGGCAT TTTTACGGAT TCCGAAAGAG GAAATGGAGA ATATTTCCAT GGGGGGGGGG GGTACCCTAA TAGCTTTGCT GGGCGAAATA CTTTTTTTAC TGAGGCGGCT GCACTGA
|
Protein sequence | MERGKLMQYY EELKHIGLQR LQESMSQALG LAASVTYPDG QLLTKTSNLC SFCALLNANS EGRAKCGASR VIFARAAVDA GRAILDTCHA GLVHVAVPLR VAGKTVAVLV GGSVALKPLT EEEVAELARE TGIDQEELWV AAQGVPLWSE ERLRTAAEMI RAVTETLAQL LYTKQEQQKK ADELSALFEF SKTVSGSLQV AEAARQGLQA VLELTGATSG SVIMLGEAEP GAATLEVAAT LEPDNELRVI PAGEIIAAVE REAVAAHFES RPGESTPEEK RPAVAVPLTA GGKVTGVLTL AGKPGGQRFT GEEAIFLTTL GTILGLALEN ARLFRKVRER AAMLERLIEV GQVLSSHLDV DLVLESALAS VRDVLDARWC ALRVLDENTG ELVLRASLGM NQKLQARVAR VRPEDNLLGE VLQKGEPVVL EDLATDKSGR HLPYCALEMR ALVVVPVKAG GKILGTLKLY SPVPRRWSEE EVEYLGTVAA QIGLALENAR LYSSLREYYL STVQSLAAAL EAKDVYTRGH SIRVAKWARS CARMLGLGAE VEEQVYLAGL LHDLGKIGVQ EDILLKPGPL TPEERKEMQG HPEVGARILE PARFPAAVIA AVRHHHEDYE GGGYPAGLSG EEIPLLARII RVADAYDAMT SARPYRKAFA PEKARNELKR CAGQQFDPQV VKAFLRIPKE EMENISMGGG GTLIALLGEI LFLLRRLH
|
| |