Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1934 |
Symbol | |
ID | 8137268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2245005 |
End bp | 2246033 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869548 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_003021745 |
Protein GI | 253700556 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 101 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAGTTC TTGCTTTGGA ATCATCCTGC GACGAAACGG CAGCCGCGGT GGTCAAGGAC GGCCGCACCG TCCTCTCCAG CATCGTCGCC TCCCAGATCA GCGTCCACGC CGAATACGGC GGCGTGGTCC CCGAAATCGC ATCCCGAAAG CACCTGGAGT CCGTCTCCTT CGTGGTGGAA CAGGCGTTAG CGGAGGCCGG CGTCGGTCTC GACCGGATCG ATGGGATCGC CGTGACCCAG GGGCCCGGCC TTGCCGGGGC GCTCCTGGTG GGGATCTCCG TCGCCAAGGG GCTCGCCTTC GGCCGTTCGC TCCCGCTCGT CGGGGTGAAC CACATCGAGG GGCACCTTTT GGCCGTCTTC CTGGAGGCGC CGGTGCAGTT TCCCTTCATC GCGCTCGCCG TCTCCGGGGG GCACTCGCAC CTGTACCGCG TGGACGGGAT CGGACGCTAC CAGACTCTGG GGCAGACGGT CGACGACGCC GCAGGCGAAG CCTTCGACAA GGTGGCGAAG CTGATCGGGC TCCCTTACCC GGGGGGCGTG GCGATAGACC GGCTCGCCGT CTCGGGTGAC CCTAAGGCCA TCAAGTTCCC GCGCCCGCTT CTGCACGACG GCACCTTCAA CTTCAGCTTC TCGGGGTTGA AGACCGCGGT GCTGACCCAC GTCGGCAAGC ATCCGGAGGC GAAGGAGGCC GGGATCAACG ATCTCGCCGC CTCGTTCCAG GCCGCGGTCT GCGAGGTGCT CACCAAGAAA ACGGCGGCCG CCGTCGCCGC AACCGGGATC AAAAGGCTGG TCGTGGCCGG AGGTGTCGCC TGCAACAGCG CGCTGCGCCG CTCCATGGCC GAGTATGCCG CGGCGAACGG GGTGGAACTT TCCATTCCCT CGCCCGCCCT TTGCGCCGAC AACGCCGCCA TGATAGCGGT CCCCGGCGAC TACTACTTAG GGCTCGGGGT GACGAGCGGT TTCGATCTCG ACGCGCTTCC GGTCTGGCCC CTGGACAAGC TGGCCCTCCG GCTGAAGGAG CATTGCTGA
|
Protein sequence | MLVLALESSC DETAAAVVKD GRTVLSSIVA SQISVHAEYG GVVPEIASRK HLESVSFVVE QALAEAGVGL DRIDGIAVTQ GPGLAGALLV GISVAKGLAF GRSLPLVGVN HIEGHLLAVF LEAPVQFPFI ALAVSGGHSH LYRVDGIGRY QTLGQTVDDA AGEAFDKVAK LIGLPYPGGV AIDRLAVSGD PKAIKFPRPL LHDGTFNFSF SGLKTAVLTH VGKHPEAKEA GINDLAASFQ AAVCEVLTKK TAAAVAATGI KRLVVAGGVA CNSALRRSMA EYAAANGVEL SIPSPALCAD NAAMIAVPGD YYLGLGVTSG FDLDALPVWP LDKLALRLKE HC
|
| |