Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2988 |
Symbol | |
ID | 8138331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3471156 |
End bp | 3472922 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870586 |
Product | oligoendopeptidase, pepF/M3 family |
Protein accession | YP_003022775 |
Protein GI | 253701586 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 1.68469e-23 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAAGCG ACCTCAATTG GGATACGACC CCGCTCTACC CCGCCCCTTC CGCACCGGAA CTTACCCGCG CCTTCGAAGA GGCGATGGGC CAGGTGGCCG CCTTTCGCGA GCGCTACCGC GGAAAGGTGT CGGCGCTCGA CGCCGGGGGG CTCCTGGAAG CGCTGAAGCA ATACGAGTCC CTGCAGGAGA AGCTGGCGAC GCCACAGCTC TACGCCCACC TTCTCTTCGC CGCGGACAGC GAGAACGACG AGCACAAGCG GCTGGCGCAG AAAGCCGAGG AGTTCGGCAA CGCCATGGGA AGGGAGCTGA TCTTCTTCGA CCTGGAACTG ATCCAGATGG AGGAGGAGCC CTTCGCGAAG CTTGCGGACG ACCTGCTGCT GGACAACTAC CGCCACTACC TGCAGGTTTT GCGCAAGTTC AAAAAGCACA CCCTGACCGA GCGGGAAGAG AGCCTGCTGG CGCAAAAGAG CCTGACCGGG GTGCAGGCGT TTTGCCGCCT CTTCGACGAG GTCTCCGCCT CGCTGCGCTA CACCTTCGAG ATGGAAGGAG AGACGCGGGA GATGACCGGC GAGGAGCTCC TTGCGCTCCT GCATCACCCG GACGCGGGGC TTAGGGAAAG GGCCTTCGGC ACCTTCCTCA AACGCCACGA AGAAAACGGC ATCATGTTCT CCGCCGTCTT CAACAACGTC GCCCTGGACC ACTCCCAGGA GATGGAGCTT AGAAACTACA GCCATCCCAT GGACCCTACC AACCTCGGCA ACGAGATCCC CAACGAGGTG GTCGAAAGCC TGATGCGGGT CTCCGAGGAG AACTACCCGC TGGCGCAGGA GTACTTCCGT CTCAAGGCGC AGCTTTTGGG CATCCCGCGC CTGAAGAACA CCGACGTCTA CGCCCCCATC ACCGAGAGCG ACCGGAAATA CAGCTTCGAG GAGGCGCGCG CCATGACGGT CGCGGCCTAC CGCGGCTTCT CCGACGAATT CGCAGAACTC GCCGATTCCT TCTTCACCGG CAAGAGGGTC GACGTCCTAC CCCGCCCCGG CAAGAGCGGA GGCGCCTTCT GCATGGGGAT GATCCCGTCG CTCCCCCCGT ACCTGCTTTT GAACTTCACC GGAAACCTGC GCGACGTATC GACCATGGCG CACGAGGTGG GGCACGGCAT CCACTACCTC TTGGCGCAGC GCCAGAGCAT GCTCAACTAC CATCCTCCGC TGCCTTTGGC CGAGACCGCG TCGGTCTTCG GAGAGATGCT TTTGACCCGG CAGCTCTTGG AGCAGGAGAC GGACGTGGAG GTGAAGAAAT CGCTTCTCTG CGCGAAGATC GAGGACATCA TCGCCACCAC CTTCCGCCAG AACGTCCTTA CCAGGTTCGA GGAACGGATG CACCTGGAGC GGGCGAATGG GCTTCTGACG GCAACAGAGC TCTGCGACAT GTGGTGGCAG GAGAACGCGA AGCTCTACGG CGACGCCGTC GAGATGATAG AGCCGTACCG CTACGGCTGG AGCTACATCT CCCACTTCAT CCACGCCCGC TTTTACTGCT ACTCCTACAC CTGCGCCGAG CTGGTGGTGC TCTCCTTGTT CCAGCGCTAC CTGAAGGAGC GCGAGAGCTT CGTCCCCGTC TACCGCGGCA TCCTTGCCGA CGGCGGTTCC AAGTCCCCCG GCGACACCCT CGCCCCGGGG GGGATCGTGT TCAGCGACCC GAGCTTCTGG CAGGGAGGGT ACGACCTCCT GGGCGACCTG ATCAAGGAGT TGAAGGCGCT GGTTTGA
|
Protein sequence | MQSDLNWDTT PLYPAPSAPE LTRAFEEAMG QVAAFRERYR GKVSALDAGG LLEALKQYES LQEKLATPQL YAHLLFAADS ENDEHKRLAQ KAEEFGNAMG RELIFFDLEL IQMEEEPFAK LADDLLLDNY RHYLQVLRKF KKHTLTEREE SLLAQKSLTG VQAFCRLFDE VSASLRYTFE MEGETREMTG EELLALLHHP DAGLRERAFG TFLKRHEENG IMFSAVFNNV ALDHSQEMEL RNYSHPMDPT NLGNEIPNEV VESLMRVSEE NYPLAQEYFR LKAQLLGIPR LKNTDVYAPI TESDRKYSFE EARAMTVAAY RGFSDEFAEL ADSFFTGKRV DVLPRPGKSG GAFCMGMIPS LPPYLLLNFT GNLRDVSTMA HEVGHGIHYL LAQRQSMLNY HPPLPLAETA SVFGEMLLTR QLLEQETDVE VKKSLLCAKI EDIIATTFRQ NVLTRFEERM HLERANGLLT ATELCDMWWQ ENAKLYGDAV EMIEPYRYGW SYISHFIHAR FYCYSYTCAE LVVLSLFQRY LKERESFVPV YRGILADGGS KSPGDTLAPG GIVFSDPSFW QGGYDLLGDL IKELKALV
|
| |