Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2567 |
Symbol | |
ID | 8137909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2996988 |
End bp | 2998358 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870175 |
Product | carboxyl-terminal protease |
Protein accession | YP_003022365 |
Protein GI | 253701176 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.00000043041 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAAAAA AGGTTTTCGC CGCATTTGCA GTGATCTTTC TCCTCCTCTC GCTCGTCTTG ATGCTCCCCC TGCTGGACCG GGAGGAGCGG GCCAAAAGAA GCGAGGCGGA GTACCTCGAG ATGTTCCGCG AGGTGGTCAA CATCGTCAAG CAAAGCTACG TGGACAAGGT CGACGACAAG AAGCTCATGG CCGGCGCCAT CAACGGGATG CTGGCGACGC TCGACCCGCA CAGCACCTAT CTCCCCGCAA CCGACTATAC CGAGATGAAG GTGCACATGG CCGGCGCCTT CGGCGGCCTG GGGATCGAGC TGGAGATGCG CAACGGCAAG CTGATGGTGA ACGCCCCCAT CGAGGATACC CCCGCCTTCC GGGCCGGGAT CCAGTCCGGC GACCATATCT GGACCATCGA CGGCAAGCCG ACCGCCGACC TCAACATCAA CCAATGCGTG AGCCGCATGC GCGGGACCCC CGGAACCTCG GTCACCCTCG GCATAATGCG CGAGGGGAAG CCGTCCCCGC TCACCTTCCG CCTGGTGCGG GCCATAATCA AGACCAAGAG CCTGAAAGGG AGGCTCCTCG AGCCGGGGTA CGGCTACATC CGGATCGGCG AATTCCAGGA GCGCACCGGC GAGGACTTCG AAAAGGCACT GAAGACGCTT GCCGCCGACA ACGGCCAGCC GCTATCGGGC CTCGTGCTGG ACCTCCGGTT CAACCCGGGG GGGCTGGTGG ACCAGGCTTA CCGCGTCGCC AACCGCTTCA TCGGCGAGGG GCTTTCCTCA GGGGTCATCG TCACCACCAA AGGGCGCGAC CCATCGATGG AACGAAGCCT GACCGCGACC GTCGGCGACA AGGAGCCCCG CTACCCGATC GTGGTGCTCA TAAACGGCGG CAGCGCCAGC GCCTCCGAGA TCGTGGCGGG CGCGCTGCAG GACCAAAAGA GAGCGGTCAT CATGGGGACC CAGAGCTTCG GCAAGGGAAG CGTCCAGTCG GTGATGACGC TCGACAACGG CGACGGCCTG AAGCTCACCA CGGCCCGCTA CTACACCCCC AGCGGGCGTT CCATCCAGGC CAAGGGGATC ACACCCGACA TCGTGGTGGA GTTTGCCAAG CCCGCCCCCC CCGCGAAAGA CAAGCAAAAG GGTGAGAAGG AACTGGAGAT CCGCGAGCAG GATCTGGACG GGCACATGGA CCAGGCTCCG GCACCGACGC GCCCGGCGAA TCCGCACCAG GCTCCCCCTC CCTCTCCGAG CCTAAAGCCG AGCGGCAAGG AGGTGAAAGA GCAGGACCTT CTGAAAGCTG ACAACCAGCT GGCCCGGGCG CTCGACCTTC TGAAGGGAGT GAACCTGCTG CAAGCGAGCG GCCGGCGTTG A
|
Protein sequence | MSKKVFAAFA VIFLLLSLVL MLPLLDREER AKRSEAEYLE MFREVVNIVK QSYVDKVDDK KLMAGAINGM LATLDPHSTY LPATDYTEMK VHMAGAFGGL GIELEMRNGK LMVNAPIEDT PAFRAGIQSG DHIWTIDGKP TADLNINQCV SRMRGTPGTS VTLGIMREGK PSPLTFRLVR AIIKTKSLKG RLLEPGYGYI RIGEFQERTG EDFEKALKTL AADNGQPLSG LVLDLRFNPG GLVDQAYRVA NRFIGEGLSS GVIVTTKGRD PSMERSLTAT VGDKEPRYPI VVLINGGSAS ASEIVAGALQ DQKRAVIMGT QSFGKGSVQS VMTLDNGDGL KLTTARYYTP SGRSIQAKGI TPDIVVEFAK PAPPAKDKQK GEKELEIREQ DLDGHMDQAP APTRPANPHQ APPPSPSLKP SGKEVKEQDL LKADNQLARA LDLLKGVNLL QASGRR
|
| |