Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2548 |
Symbol | |
ID | 8137890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2976989 |
End bp | 2978149 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870157 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_003022347 |
Protein GI | 253701158 |
COG category | [R] General function prediction only |
COG ID | [COG3481] Predicted HD-superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1.98451e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAAAAGA AATGCGTTGC CGAGATAAAG GACCGCGACC TGGTGGACGC GGTCTTCCTG GTCAAGGAGA AGATCGTGGC CATGGCCAAG AACGGCAAGC CGTACCTCAC CCTGAAGCTT ATGGACAGAA GCGGCGAGGT CGACGCCAAG GTCTGGGACA ACGCGGACCA GGTGGGGGCG CTCTTCGACC GCAACGACTT CCTCGCGGTG CGCGCGAAGG CGAGCGTCTA CCTGGGAAAG ATGCAGCTGA TCGTCTCGGA GCTTAAGAAG GTCCCCGACG ACTCGGTGGA TCTGGCGGAC TTCCTTCCCG AAACCGACCG GGACGTCAAG GCGATGGTCG AGGAGCTGCA CGCCCTCGTC GCCGGCGTGA AGGACCCGGA CCTCGCGCGG CTTTTGTCCT CCTTCTTCCA CGACCCAGAG CTTTTGGCCC AGTATCGCGT CGCCCCCGCG GCCAAGGGGA TGCACCACGT CTATCTCGGG GGGCTTTTGG AGCACTCGCT CGCCGTGGCG AAGCTGGTGG ACGCCATGGT CCCGCTCTAC CCGGGGCTGA ACCGGGACCT CCTCGTCGCC GGGGCACTTT TGCACGACGT GGGGAAGGTG CGCGAGATGA CCTACCTGCG CTCCTTCGAC TACTCCGACG AGGGGAAGCT GATCGGCCAC ATCACCATCG GCGCCGAGAT GCTGCACGAG CGGATCACGG CGCTGCCGGG TTTTCCGGCC GAGCTCGCCA TGCTCTTGAA GCACATGATC CTGTCGCATC ACGGCCAGTA CGAGTACGGC TCCCCCAAGC GCCCGAAGAC GCTGGAGGCG ACCATCCTCA ACTACCTGGA CGACCTCGAC TCCAAGATCA ACGGCATCAG GACCCACATC CGCAAGGAGC CGGACAATCC CTCGCGCTGG ACCGCGTACC ACCGCCTCTA CGACCGCTAC TTCTTCAAGG AGAACTGCCT GCCTGAGGAG GAGCTGGAAA TCTCCCCCGC GGATTGCCTG GAGCCGTCCG AGCTGATGCC GCAGACGGTG GAGGCGCCGA GCCCTCTCCC GGCGAGCGTG CCGGAGCAGG AAGCGCCGCG CCGGGAGCGC CCTGAGGCAC CCCGCGGCGA CCAGGGGCGC AAGAGCTTCA GCAACAACCC TTTCGCCGCG CTTAAAAACG GCAAGGGTTA A
|
Protein sequence | MKKKCVAEIK DRDLVDAVFL VKEKIVAMAK NGKPYLTLKL MDRSGEVDAK VWDNADQVGA LFDRNDFLAV RAKASVYLGK MQLIVSELKK VPDDSVDLAD FLPETDRDVK AMVEELHALV AGVKDPDLAR LLSSFFHDPE LLAQYRVAPA AKGMHHVYLG GLLEHSLAVA KLVDAMVPLY PGLNRDLLVA GALLHDVGKV REMTYLRSFD YSDEGKLIGH ITIGAEMLHE RITALPGFPA ELAMLLKHMI LSHHGQYEYG SPKRPKTLEA TILNYLDDLD SKINGIRTHI RKEPDNPSRW TAYHRLYDRY FFKENCLPEE ELEISPADCL EPSELMPQTV EAPSPLPASV PEQEAPRRER PEAPRGDQGR KSFSNNPFAA LKNGKG
|
| |