Gene GM21_2567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2567 
Symbol 
ID8137909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2996988 
End bp2998358 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content64% 
IMG OID644870175 
Productcarboxyl-terminal protease 
Protein accessionYP_003022365 
Protein GI253701176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00000043041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAAAAA AGGTTTTCGC CGCATTTGCA GTGATCTTTC TCCTCCTCTC GCTCGTCTTG 
ATGCTCCCCC TGCTGGACCG GGAGGAGCGG GCCAAAAGAA GCGAGGCGGA GTACCTCGAG
ATGTTCCGCG AGGTGGTCAA CATCGTCAAG CAAAGCTACG TGGACAAGGT CGACGACAAG
AAGCTCATGG CCGGCGCCAT CAACGGGATG CTGGCGACGC TCGACCCGCA CAGCACCTAT
CTCCCCGCAA CCGACTATAC CGAGATGAAG GTGCACATGG CCGGCGCCTT CGGCGGCCTG
GGGATCGAGC TGGAGATGCG CAACGGCAAG CTGATGGTGA ACGCCCCCAT CGAGGATACC
CCCGCCTTCC GGGCCGGGAT CCAGTCCGGC GACCATATCT GGACCATCGA CGGCAAGCCG
ACCGCCGACC TCAACATCAA CCAATGCGTG AGCCGCATGC GCGGGACCCC CGGAACCTCG
GTCACCCTCG GCATAATGCG CGAGGGGAAG CCGTCCCCGC TCACCTTCCG CCTGGTGCGG
GCCATAATCA AGACCAAGAG CCTGAAAGGG AGGCTCCTCG AGCCGGGGTA CGGCTACATC
CGGATCGGCG AATTCCAGGA GCGCACCGGC GAGGACTTCG AAAAGGCACT GAAGACGCTT
GCCGCCGACA ACGGCCAGCC GCTATCGGGC CTCGTGCTGG ACCTCCGGTT CAACCCGGGG
GGGCTGGTGG ACCAGGCTTA CCGCGTCGCC AACCGCTTCA TCGGCGAGGG GCTTTCCTCA
GGGGTCATCG TCACCACCAA AGGGCGCGAC CCATCGATGG AACGAAGCCT GACCGCGACC
GTCGGCGACA AGGAGCCCCG CTACCCGATC GTGGTGCTCA TAAACGGCGG CAGCGCCAGC
GCCTCCGAGA TCGTGGCGGG CGCGCTGCAG GACCAAAAGA GAGCGGTCAT CATGGGGACC
CAGAGCTTCG GCAAGGGAAG CGTCCAGTCG GTGATGACGC TCGACAACGG CGACGGCCTG
AAGCTCACCA CGGCCCGCTA CTACACCCCC AGCGGGCGTT CCATCCAGGC CAAGGGGATC
ACACCCGACA TCGTGGTGGA GTTTGCCAAG CCCGCCCCCC CCGCGAAAGA CAAGCAAAAG
GGTGAGAAGG AACTGGAGAT CCGCGAGCAG GATCTGGACG GGCACATGGA CCAGGCTCCG
GCACCGACGC GCCCGGCGAA TCCGCACCAG GCTCCCCCTC CCTCTCCGAG CCTAAAGCCG
AGCGGCAAGG AGGTGAAAGA GCAGGACCTT CTGAAAGCTG ACAACCAGCT GGCCCGGGCG
CTCGACCTTC TGAAGGGAGT GAACCTGCTG CAAGCGAGCG GCCGGCGTTG A
 
Protein sequence
MSKKVFAAFA VIFLLLSLVL MLPLLDREER AKRSEAEYLE MFREVVNIVK QSYVDKVDDK 
KLMAGAINGM LATLDPHSTY LPATDYTEMK VHMAGAFGGL GIELEMRNGK LMVNAPIEDT
PAFRAGIQSG DHIWTIDGKP TADLNINQCV SRMRGTPGTS VTLGIMREGK PSPLTFRLVR
AIIKTKSLKG RLLEPGYGYI RIGEFQERTG EDFEKALKTL AADNGQPLSG LVLDLRFNPG
GLVDQAYRVA NRFIGEGLSS GVIVTTKGRD PSMERSLTAT VGDKEPRYPI VVLINGGSAS
ASEIVAGALQ DQKRAVIMGT QSFGKGSVQS VMTLDNGDGL KLTTARYYTP SGRSIQAKGI
TPDIVVEFAK PAPPAKDKQK GEKELEIREQ DLDGHMDQAP APTRPANPHQ APPPSPSLKP
SGKEVKEQDL LKADNQLARA LDLLKGVNLL QASGRR