Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2495 |
Symbol | |
ID | 3831598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2600794 |
End bp | 2601996 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637830417 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_431320 |
Protein GI | 83591311 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2206] HD-GYP domain |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.211386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTGCC CGCCGGATCG ATCCTTTTGT GAGCTGGTGG TAGCACTATC AACAATCCTT GATATCGAGG AAGAAACCAA ACTCTATCAT GCCTGGCGGG TAGCCCTGGT AGCCCAGGAA CTGGCCCGGA GGGTCATCCC CGACGAGGCG ACCCTGGTGT TTTATGGTGG GCTGCTCCAC GATATAGGCG CCATGGGGCT GGATGACCAC TTGGTTCATC TGGCCCTCCA GCGTGGTAGC AGAAATAATC CCGAGGTCGT AAACCACCCC CTGCGGGGGG CGGATATGGT GGCAGCCATT CCGGGTCTTG GGGAAAAGGT TGCGGCCATG ATCCGGGACC ATCATGAGCG CTGGAATGGC TCCGGGTACC CCCGGGGTAT TGCCGGGAAT CATATCGTCA CGGGAGCTAT GCTCCTGGGA CTGGCCGATG AACTGGACCT GGTCCTGCGG GTCCATCCGG GTACTTCTTG GGCCAGGCTG AGGGAGACCC TGAACCGCAG GGTTCAGGGT GGGTTTCCGC CGGAACTACT GGCTATCCTG GATAAAATGA TGAATGGTCC CTTGTACGCC GAAATTGCCA CCAATACGGC CCTGGAATTA AAGATGTTTA AGGTGATTTT GGACCTGCCG CCTATTAATT TCCAGGTACC CGATCCGATG AAGATCACTA TCGATCTCTT CGCCCGGATA ATTGACGCCA AACATGCCTA TACTGCCGGC CATTCCCACC GGGTAGCTGC TTATGCTTTA AACCTGGCCC GCTGCCTCGG ATATAATGAT GCTAAATTGC GGCGTCTGGA GATCGCCGGC CTCCTCCACG ATTTCGGCAA AATCGCCGTT CCCCGCGCTA TTCTGGATAA ACGGGGCCGG CTCAACAGCG AAGAATTAAA AGTGGTACGC CGCCACCCGG CCTGGACGAT AGAACTCCTG GAGGGGGTGA CTAGCCTCAA GGATATTGCC CGGGACGCCG GCCTGCATCA CGAACGTTAT GATGGTAAAG GATATCCCTA TGGCCTCCAT GATGGTGAAA TTCCCTTGGG GGCCAGGATC ATCGCCGTGG CCGATGCCTT TGACGCCATG ACTTCTAACC GGCCCTATCA GCCTACCCGT ACGCCGGAAG AGGCTTTAAA AATCCTGGCA GGGGGGGCCG GCACTCAGTT TGACCCGGAG GTGACAGCAG TTGCCTCCTG CCTGCTAGCC TGA
|
Protein sequence | MGCPPDRSFC ELVVALSTIL DIEEETKLYH AWRVALVAQE LARRVIPDEA TLVFYGGLLH DIGAMGLDDH LVHLALQRGS RNNPEVVNHP LRGADMVAAI PGLGEKVAAM IRDHHERWNG SGYPRGIAGN HIVTGAMLLG LADELDLVLR VHPGTSWARL RETLNRRVQG GFPPELLAIL DKMMNGPLYA EIATNTALEL KMFKVILDLP PINFQVPDPM KITIDLFARI IDAKHAYTAG HSHRVAAYAL NLARCLGYND AKLRRLEIAG LLHDFGKIAV PRAILDKRGR LNSEELKVVR RHPAWTIELL EGVTSLKDIA RDAGLHHERY DGKGYPYGLH DGEIPLGARI IAVADAFDAM TSNRPYQPTR TPEEALKILA GGAGTQFDPE VTAVASCLLA
|
| |