Gene GM21_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2993 
Symbol 
ID8138336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3479840 
End bp3481174 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID644870591 
Productcarboxyl-terminal protease 
Protein accessionYP_003022780 
Protein GI253701591 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value3.70163e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAAGG GCAGCAAATT GAAGAAGACA ACCCTGTTTG TCACAACCGT GTGCCTGCTG 
ACGCTCGTCA TCGGCTTCGG CATGCAACGC AGATGCGCCG CCCAGGGGGG CGGCAACGAT
TACCAGTCCA TCGAATTGTT CACGGACGTT CTGGCCATCG TCAAGAAGAG CTATGTAGAA
GAGGTCGACA CCAAGAAGCT CGTGTACGGT GCCATAAACG GCATGCTCAC CTCGCTGGAC
CCCCACAGTT CCTTCATGCC CCCTGAAACC TACAAGGAAA TGAAGATCGA CACCAAGGGC
TCCTTCGGCG GTCTCGGCAT TGAGATCACG GTCAAGGAAG GGATCCTCAC CGTCATCTCC
CCGATAGAGG ACACCCCCGC CTTCAAGGCC GGCATCAAGG CAGGCGACCA GATCCTGAAG
ATCGACGACA AGTTCACCAA GGACCTCACC ATCACCGATG CGGTGAAGAG GATGAGGGGG
GTCAAGGGGA CCAAGGTCAC CCTCACCATC ATGCGTGAAG GTTTCGACAA GACGAAGGAA
TTCGTGCTGG AGCGCGACAT CATCCAGGTC AAGAGCGTGA AGCACAAGGT GCTCGACGAC
GGCTACGGCT ATGTCAGGAT TGCTCAGTTC CAGGAGAAGA CCGACGACGA CCTGGAGAGG
GCGCTGCAGG CCCTTCAGGG CGAAAAGAAG CAGCTCAAGG GGCTGGTGCT CGACCTGCGC
AACGATCCGG GGGGGCTTTT GGACCAGGCG GTCCGGGTGA GCGAGCACTG GATCCCCGAA
GGGAAGCTGA TCGTTTATAC CGAGGGGCGC GAGAAGGATT CCCAGATGCG CTTCACCTCC
CGCAAAGGGC ACAAGCAGCC GGACTACCCG ATAGTGGTGC TGATCAACAG CGGTTCGGCG
AGCGCCTCGG AGATCGTCGC CGGCTGCCTG CAGGACCACA AGCGCGCCGT CGTCATGGGG
ACCCAGAGCT TCGGCAAGGG GAGCGTCCAG ACCATCATCC CTCTCCCCGA CAACTCGGGT
CTCAGGCTCA CCACCGCCAG GTACTTCACC CCCAGCGGCC GCTCCATCCA GGCTAAGGGG
ATCACCCCCG ATATCGTCGC CGAGAAGGTC GAACTTGCCG CCACCGGCGA GAAGCGTGAA
GGGATGCACA TCAGAGAGAA GGACCTCGAG AATCACTTCG AAGGGGACAA AAAGGAGGGG
GCCGAGGAGA AGAAGGATAA GCCCGCTCCC TACAAGACCG ACGAAATGAT CAAGAGCGAC
TCGCAGGTGC TGCGCGCGCT TGACCTCCTG AAAGGGTGGG AGATCCTGAA GACCATGGGC
AAACTCCCCT CCTGA
 
Protein sequence
MFKGSKLKKT TLFVTTVCLL TLVIGFGMQR RCAAQGGGND YQSIELFTDV LAIVKKSYVE 
EVDTKKLVYG AINGMLTSLD PHSSFMPPET YKEMKIDTKG SFGGLGIEIT VKEGILTVIS
PIEDTPAFKA GIKAGDQILK IDDKFTKDLT ITDAVKRMRG VKGTKVTLTI MREGFDKTKE
FVLERDIIQV KSVKHKVLDD GYGYVRIAQF QEKTDDDLER ALQALQGEKK QLKGLVLDLR
NDPGGLLDQA VRVSEHWIPE GKLIVYTEGR EKDSQMRFTS RKGHKQPDYP IVVLINSGSA
SASEIVAGCL QDHKRAVVMG TQSFGKGSVQ TIIPLPDNSG LRLTTARYFT PSGRSIQAKG
ITPDIVAEKV ELAATGEKRE GMHIREKDLE NHFEGDKKEG AEEKKDKPAP YKTDEMIKSD
SQVLRALDLL KGWEILKTMG KLPS