Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0783 |
Symbol | |
ID | 8136098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 932499 |
End bp | 933167 |
Gene Length | 669 bp |
Protein Length | 222 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868400 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 1 |
Protein accession | YP_003020615 |
Protein GI | 253699426 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 124 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCCTA CACGGCTGCT CATTTTCGAC CTGGACGGGA CGCTGATCGA TTCGCTCCCC GACCTGACCG ACGCCACAAA CCTGATCCGC AGCAGGCACG GCCTCCCCGG GATAGGTATT CCCGAGGTGC GCAAGCTGGT AGGGCAGGGG GCGCGCAACC TGGTGGAGCG TGCGCTCCCC GGGGCCACGG CGGCACAGGT GGACGAGGCG CTGGGAGTAT TTCTCGACTA CAACCTGGCG CACATAGCCG ACAAGACCCG TCCCTATCCC GGCGTCCCCG AGACCCTGGA AAAACTGCGA ACCTTCGGCA TCCCCATGGT GGTCCTTTCC AACAAGAACG TCGCCCTCTG CAAGGAAGTC CTCGCCAAGC TTGGTATCGG GGACGCCTTC GCCGAGGTGT TCGGGGCCGA CTCCTTCCCC TACCGAAAGC CCTCGCCCGA GCCCGTGCTG GCCGTCCTGA GGCAATATGG AATTGAAGCT GCGGAATGCG TCATGGTGGG GGACAGTATC AACGACATCG CGGCAGGAGT AGGGGCAGGC GTTTTCACCG TCGGCTGCAG TTACGGTTAC GGCGAGGCGA GCGAGCTAGT CAAAGCCCAC TATCAGGTTT CCGATTTTCC CTCACTGCTC CAATTGCCGT TCTTTAACAT AAAAAGCAAT GAGCAATGA
|
Protein sequence | MGPTRLLIFD LDGTLIDSLP DLTDATNLIR SRHGLPGIGI PEVRKLVGQG ARNLVERALP GATAAQVDEA LGVFLDYNLA HIADKTRPYP GVPETLEKLR TFGIPMVVLS NKNVALCKEV LAKLGIGDAF AEVFGADSFP YRKPSPEPVL AVLRQYGIEA AECVMVGDSI NDIAAGVGAG VFTVGCSYGY GEASELVKAH YQVSDFPSLL QLPFFNIKSN EQ
|
| |