Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2131 |
Symbol | |
ID | 3833131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2229345 |
End bp | 2230340 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637830056 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_430966 |
Protein GI | 83590957 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2206] HD-GYP domain |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000504483 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.911258 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGAA GTGCAGGCCT GTACAGCGAT GGTATCATTG ATAGGTTGGC TATCAATCGC CTGAATTTAG CCACCCCGGC AGTAAAGACC CTTTTCCGGT GGGCGGCGGT TTTATCTCTA GCTATTATTA CGATTTTAAA CCATTATGCC CGGGGCCTGC CTTTCCACTT TCTCCTTGAC TTCCTCTACC TCGTCCCGGT AACCGTCGCC GCCCTCAATT CTTTACTCGA AGGCCTGGCG GTTGCCCTGG TGGCTGGTAG CCTGCGTATG CTTACGAACC CTTTAGTTTT TTCTATTATT AAACCCAGCG ATTATGTTGA TTTTCTAGTG GTAACCGGCT TTTACCTGGT CGACCCGGTG ACTATAGAGC TATTGAGACG CCTGGCATGG CAGCGGCAGC AGCTCCAGCG TAATCTGCAA CTGACGACGG CTGCCTTGCT CGAGGCCCTG CAGATGCGCG ACCAGTATAC CGGTTGGCAC TCACGCCAGG TAGCCATTTA TGCGCGCCGG ATAGCCGCCA GCTTAGGTTT ATCGCCGTAC CACCAGGAGT GCCTTTACCT GGCCGGGCTG CTCCATGACA TCGGGAAAAT CGGCGTTGAT GACGCCTGCC TGAACAAACC CGGCCTGCTG ACGCCGGAAG AATGGCAAAA CGTCCGCCGC CACCCCGGAT TGGGATACAA GATAATAAGA AAAGTAACCA GTCGAGAAGA AGTTATTGCC CGGGCAGTCC TGTATCACCA CGAGCGCTAT GACGGCCGCG GTTATCCCAG AGGGCTGAAA GGTACAAGCA TCCCCTTGGA AGCCCGCATT TTAAGTGTGG CCGACTGCTT TGACGCCATG ACCACGGACC GGGTTTACCG GCCGGCCCTG TCCCCGGCTG AGGCTGTCAA GGAACTAATG CGCTGCGCCG GCAGCCAGTT TGACCCTGGC ATAGTCGAGG TCTTCTACCG CATCCTGGCC GCGGATGGCC TGATACAGAA CCCGGAGGAG GGGTAA
|
Protein sequence | MGRSAGLYSD GIIDRLAINR LNLATPAVKT LFRWAAVLSL AIITILNHYA RGLPFHFLLD FLYLVPVTVA ALNSLLEGLA VALVAGSLRM LTNPLVFSII KPSDYVDFLV VTGFYLVDPV TIELLRRLAW QRQQLQRNLQ LTTAALLEAL QMRDQYTGWH SRQVAIYARR IAASLGLSPY HQECLYLAGL LHDIGKIGVD DACLNKPGLL TPEEWQNVRR HPGLGYKIIR KVTSREEVIA RAVLYHHERY DGRGYPRGLK GTSIPLEARI LSVADCFDAM TTDRVYRPAL SPAEAVKELM RCAGSQFDPG IVEVFYRILA ADGLIQNPEE G
|
| |