Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1526 |
Symbol | |
ID | 8136855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1788683 |
End bp | 1789921 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869138 |
Product | peptidase U32 |
Protein accession | YP_003021340 |
Protein GI | 253700151 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 2.31073e-27 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGGCAGA AGCCGCGCCA AGGACGGGTG AAGCCCGAGC TTTTGGCGCC TGCGGGGAAC ATGGAGAAAC TGAAGGTCGC CATCCGTTAT GGCGCCGACG CGGTGTACCT GGGGGGCAAA TCGTTTGGCC TTAGAAACCT GGCCGGCAAC TTCAGCCTCC CGGAGCTCTC GCACGCAGTC GACTACGCGC ACCGGCACGG CGTCAAGGTG TATCTGACCG TGAACGCCTT CGCCGACAAC CGCGATTTGA TCGAGCTGGA GCGCTACCTG GAGGAGATCC GCGAGATCCC CTTCGACGCC CTGATCGCCG CCGACCCCGG CGTCGTTGCG CTGATAGCCG AGCGCTGCCC CGGCCGCGAC ATCCACCTTT CCACCCAGGC CAACACCACC AACTGGCGCT CCGCCCGCTT CTGGCAGGCG CAGGGGGTGA AACGGGTGAA TCTCGCCCGT GAGATGTCGC TGGAGCAGAT TGCCGAGACG GCGGGGATGT GCAACGAAAT CGAGCTTGAA GTCTTCGTCC ACGGCGCCAT GTGCATCTCG TATTCGGGGC GCTGCCTCCT CTCCCTTGCC ATGACCGGCA GGGACGCCAA CAAGGGAGAG TGCACCCAGC CCTGCCGCTG GAACTACGCC ATCGTAGAGG AGAGCCGTCC CGGGGAGTAT TTCCCCATCC ACGAGGACGA AAGCGGGAGC TTCATCTTCA ACTCGAAGGA TCTCTGCCTG ATCGAGCAGC TTCCGGATCT GGTGGAGAGC GGCGTCCACT CCTTGAAGAT AGAGGGGAGG ATGAAAGGGA TCTACTACGC GGCCAGCGTG ATCCGCATCT ACCGCGAGGC GCTGGACAGC TACTGGGAGG ACCCGGTGAA CTACCGGTTG AACCCGGCGT GGCTGGAGGA GCTAAGCAAG ATCAGCCACC GCGGGTACAC CACCGGGTTT TTGTTGGGCA AGCCGCGCGA CGTGGACCAC GAGTACCTCT CGCGTTACGT GAGGAATTTC GAGTTTGTGG CGCTGGTCGA GGGGGAAGCT AAAGGGGGAG GCACCCTGGT TGCGGTGAGG AACAGGTTGC AGTTGGGCGA CGCGTTGGAA CTGATCGGTC AAGGCACGTG CTTCACCAGA TTCATATTGG AGTCGATGGA AGACGAGGAC GGCGTCCCGC TCCAGGTAGC CCATCCAAAC CAGCGGGTAG TACTGAAAGA ACTCACCGGT GCAGGAGAGT ACGATCTGAT CAGGAGAGAA AAAACTTGA
|
Protein sequence | MGQKPRQGRV KPELLAPAGN MEKLKVAIRY GADAVYLGGK SFGLRNLAGN FSLPELSHAV DYAHRHGVKV YLTVNAFADN RDLIELERYL EEIREIPFDA LIAADPGVVA LIAERCPGRD IHLSTQANTT NWRSARFWQA QGVKRVNLAR EMSLEQIAET AGMCNEIELE VFVHGAMCIS YSGRCLLSLA MTGRDANKGE CTQPCRWNYA IVEESRPGEY FPIHEDESGS FIFNSKDLCL IEQLPDLVES GVHSLKIEGR MKGIYYAASV IRIYREALDS YWEDPVNYRL NPAWLEELSK ISHRGYTTGF LLGKPRDVDH EYLSRYVRNF EFVALVEGEA KGGGTLVAVR NRLQLGDALE LIGQGTCFTR FILESMEDED GVPLQVAHPN QRVVLKELTG AGEYDLIRRE KT
|
| |