Gene GM21_2988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2988 
Symbol 
ID8138331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3471156 
End bp3472922 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content63% 
IMG OID644870586 
Productoligoendopeptidase, pepF/M3 family 
Protein accessionYP_003022775 
Protein GI253701586 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.68469e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAGCG ACCTCAATTG GGATACGACC CCGCTCTACC CCGCCCCTTC CGCACCGGAA 
CTTACCCGCG CCTTCGAAGA GGCGATGGGC CAGGTGGCCG CCTTTCGCGA GCGCTACCGC
GGAAAGGTGT CGGCGCTCGA CGCCGGGGGG CTCCTGGAAG CGCTGAAGCA ATACGAGTCC
CTGCAGGAGA AGCTGGCGAC GCCACAGCTC TACGCCCACC TTCTCTTCGC CGCGGACAGC
GAGAACGACG AGCACAAGCG GCTGGCGCAG AAAGCCGAGG AGTTCGGCAA CGCCATGGGA
AGGGAGCTGA TCTTCTTCGA CCTGGAACTG ATCCAGATGG AGGAGGAGCC CTTCGCGAAG
CTTGCGGACG ACCTGCTGCT GGACAACTAC CGCCACTACC TGCAGGTTTT GCGCAAGTTC
AAAAAGCACA CCCTGACCGA GCGGGAAGAG AGCCTGCTGG CGCAAAAGAG CCTGACCGGG
GTGCAGGCGT TTTGCCGCCT CTTCGACGAG GTCTCCGCCT CGCTGCGCTA CACCTTCGAG
ATGGAAGGAG AGACGCGGGA GATGACCGGC GAGGAGCTCC TTGCGCTCCT GCATCACCCG
GACGCGGGGC TTAGGGAAAG GGCCTTCGGC ACCTTCCTCA AACGCCACGA AGAAAACGGC
ATCATGTTCT CCGCCGTCTT CAACAACGTC GCCCTGGACC ACTCCCAGGA GATGGAGCTT
AGAAACTACA GCCATCCCAT GGACCCTACC AACCTCGGCA ACGAGATCCC CAACGAGGTG
GTCGAAAGCC TGATGCGGGT CTCCGAGGAG AACTACCCGC TGGCGCAGGA GTACTTCCGT
CTCAAGGCGC AGCTTTTGGG CATCCCGCGC CTGAAGAACA CCGACGTCTA CGCCCCCATC
ACCGAGAGCG ACCGGAAATA CAGCTTCGAG GAGGCGCGCG CCATGACGGT CGCGGCCTAC
CGCGGCTTCT CCGACGAATT CGCAGAACTC GCCGATTCCT TCTTCACCGG CAAGAGGGTC
GACGTCCTAC CCCGCCCCGG CAAGAGCGGA GGCGCCTTCT GCATGGGGAT GATCCCGTCG
CTCCCCCCGT ACCTGCTTTT GAACTTCACC GGAAACCTGC GCGACGTATC GACCATGGCG
CACGAGGTGG GGCACGGCAT CCACTACCTC TTGGCGCAGC GCCAGAGCAT GCTCAACTAC
CATCCTCCGC TGCCTTTGGC CGAGACCGCG TCGGTCTTCG GAGAGATGCT TTTGACCCGG
CAGCTCTTGG AGCAGGAGAC GGACGTGGAG GTGAAGAAAT CGCTTCTCTG CGCGAAGATC
GAGGACATCA TCGCCACCAC CTTCCGCCAG AACGTCCTTA CCAGGTTCGA GGAACGGATG
CACCTGGAGC GGGCGAATGG GCTTCTGACG GCAACAGAGC TCTGCGACAT GTGGTGGCAG
GAGAACGCGA AGCTCTACGG CGACGCCGTC GAGATGATAG AGCCGTACCG CTACGGCTGG
AGCTACATCT CCCACTTCAT CCACGCCCGC TTTTACTGCT ACTCCTACAC CTGCGCCGAG
CTGGTGGTGC TCTCCTTGTT CCAGCGCTAC CTGAAGGAGC GCGAGAGCTT CGTCCCCGTC
TACCGCGGCA TCCTTGCCGA CGGCGGTTCC AAGTCCCCCG GCGACACCCT CGCCCCGGGG
GGGATCGTGT TCAGCGACCC GAGCTTCTGG CAGGGAGGGT ACGACCTCCT GGGCGACCTG
ATCAAGGAGT TGAAGGCGCT GGTTTGA
 
Protein sequence
MQSDLNWDTT PLYPAPSAPE LTRAFEEAMG QVAAFRERYR GKVSALDAGG LLEALKQYES 
LQEKLATPQL YAHLLFAADS ENDEHKRLAQ KAEEFGNAMG RELIFFDLEL IQMEEEPFAK
LADDLLLDNY RHYLQVLRKF KKHTLTEREE SLLAQKSLTG VQAFCRLFDE VSASLRYTFE
MEGETREMTG EELLALLHHP DAGLRERAFG TFLKRHEENG IMFSAVFNNV ALDHSQEMEL
RNYSHPMDPT NLGNEIPNEV VESLMRVSEE NYPLAQEYFR LKAQLLGIPR LKNTDVYAPI
TESDRKYSFE EARAMTVAAY RGFSDEFAEL ADSFFTGKRV DVLPRPGKSG GAFCMGMIPS
LPPYLLLNFT GNLRDVSTMA HEVGHGIHYL LAQRQSMLNY HPPLPLAETA SVFGEMLLTR
QLLEQETDVE VKKSLLCAKI EDIIATTFRQ NVLTRFEERM HLERANGLLT ATELCDMWWQ
ENAKLYGDAV EMIEPYRYGW SYISHFIHAR FYCYSYTCAE LVVLSLFQRY LKERESFVPV
YRGILADGGS KSPGDTLAPG GIVFSDPSFW QGGYDLLGDL IKELKALV