Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1994 |
Symbol | |
ID | 8137328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2312785 |
End bp | 2314317 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869607 |
Product | Ppx/GppA phosphatase |
Protein accession | YP_003021804 |
Protein GI | 253700615 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00000000000000431495 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAAGCAGA CCAGGCTTGC CGCCATCGAC ATCGGCACCA ACTCCATCCG CAGCATAATC ATCGAGACCT CCGGAAACGG CAAATACAAG ATCCTTGACG ACGAGAAGGT GCTGGTGCGG CTGGGCGAAG GGCTGCACCA AAGCGGCGCC ATCTCCCCCG CTGCCTGCAG CCGCGCGTTG GAGGCCCTTT CCCGACAAAA GAAAATCATA GACGGCTACG GCGTCGCCTC CATCGAGGCG GTGGCCACCA GCGCGATGCG CAAGGCGAGT AACGGCGCCG CCCTGGTGCA GGCGATCAAG GACGCCACCG GCGTCGAGGT GGAAGTCATC AGCGGCGAGG AGGAGGCCGA ACTCGCGGCC CTGAGCGCCG CGCACAATTT CGAGCTGGAA GGGGTCAGGC ACCTTATCTT CGACATCGGC GGCGGGAGCA TGGAGCTGAT AGCCGCGCTC GGCTCCCATA CCGAGGAGAT GATCTCCCTG GAACTGGGAG CGGTTTTCCT CACCGAGAGC TTCCTCAAGG GAGACCCGGT GCACCCCTCC GAGCACGAAA AGCTGCGCAA GCACGTCCGC AAGACGCTGA AGCGGGCCTA TACCGGGGAA CGCAGCGGCA TGCAGTGCCT GGTAGGGTCC GGCGGAACCG TCACCTCGAT CGCCGCCATG ATCGCCGCCA CCAGGAAGGA GAAGTACGAC TCGGTGCACG GCTACGAGCT CCTCCGCTCG GAGGTGGTGC ACCTTCTGGC GATGCTGGTC AGAAAGAACG ACAAGGAGCG GCGTACCATC CCCGGGCTCA ACCCGGACCG ATCCGACATC ATCGTGGCCG GGGTCACCGT AATCGACGAA CTGATGGATT TTTTCCAGGT GAACCTGCTC AAGGTGAACG AGCGGGGGAT CAGGGAAGGG CTGATACTGA GGGGGCTGCG GCGGCAGAAC CTGCTCCCCC ACGAGAAAAG GACCCGCTCC TGGCGCAACT CGGCGCTGGA GTTCGGCCAT TCCTGCCATT TCGACCAGGG TCACGCGGAG CATGTGGCCA AACTGGCCCA GCAGGTGGCG AAGGCATTGG CGCCCAAGTT CAAGCTGGCC GAACGGGAGC TGCGGCTACT GGAGGCGGCG GCGCTTTTGC ACGACGTCGG GTATTTCATC AACTATTCCA GTCACCACAA GCACTCCTAC CACCTGATCC GCCATGCCGA CCTCTTCGGT TTCACCCCGC GCGAACGGGA GTTGATCGCC AACGTGGCGC GCTACCACCG TAAATCTATC CCCAAGAAAA AACACGACCA GTTCGTGCGG CTTCCGGCTG GCGACCAGTT GCTGGTTTCG CGCCTGGGAG GGATCCTGCG GCTTTGCGAC GGGCTGGACC GGCGCCGAAA TGGAGTGGTT AAAGAGCTTC GCTGCCGGCT TTCGCCGGAC GGCACGCTGC GCGTGACCCT GGTGGGCGAT GAGGACATGT CGGTGGAACT CTACGGTGCG AAGGCCAAGG GAGACCTGCT GCAGGAGGCT TTCCATCTGA AGCTTGCGCT GGAGGCGGGC TGA
|
Protein sequence | MKQTRLAAID IGTNSIRSII IETSGNGKYK ILDDEKVLVR LGEGLHQSGA ISPAACSRAL EALSRQKKII DGYGVASIEA VATSAMRKAS NGAALVQAIK DATGVEVEVI SGEEEAELAA LSAAHNFELE GVRHLIFDIG GGSMELIAAL GSHTEEMISL ELGAVFLTES FLKGDPVHPS EHEKLRKHVR KTLKRAYTGE RSGMQCLVGS GGTVTSIAAM IAATRKEKYD SVHGYELLRS EVVHLLAMLV RKNDKERRTI PGLNPDRSDI IVAGVTVIDE LMDFFQVNLL KVNERGIREG LILRGLRRQN LLPHEKRTRS WRNSALEFGH SCHFDQGHAE HVAKLAQQVA KALAPKFKLA ERELRLLEAA ALLHDVGYFI NYSSHHKHSY HLIRHADLFG FTPRERELIA NVARYHRKSI PKKKHDQFVR LPAGDQLLVS RLGGILRLCD GLDRRRNGVV KELRCRLSPD GTLRVTLVGD EDMSVELYGA KAKGDLLQEA FHLKLALEAG
|
| |