Gene GM21_3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3569 
Symbol 
ID8138942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4148848 
End bp4150194 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID644871189 
ProductPeptidase M23 
Protein accessionYP_003023348 
Protein GI253702159 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCT CGGCAATCCT CGGGTTATTT GTCTTGTTTG TCGCAATCGC GGCAGGGATC 
GGGATCTATT ACTTCGTCGA CACCTCCGGG CCGGCCCTGG CGCTTTCCCG TCGCCCGGGT
CCTATAGCCT CCAGGGCAGA CCTGATCCTG ACCCTGCGGG ACCGGGCTTC AGGCCTCAAG
TCCCTCAGCG TGCAGGCGGT GCAGGGAGAA AAGAGCTTCG GCATCCTGAC CAGGGAATAC
ACCGCAGGGA CCACCGACGC GAAGGAGACC TTCCGGCTCC CGCCCCCCCC CGGACTGAAA
GAGGGCCCGG TGACGCTGCG GATATCGGCT GCCGACCGCT CCGTGTTCAG GTTCGGCTCG
GGCAACAGCA CCTCCGTCGC GCTGGAGTTC GTGGTGCAGA ACAAGCCCCC TGTGGTCTCC
GTGCTCAGCA CCGCGCACAA CGTCTCGCCG GGAGGCTCCG CCCTGGCGGC CTACACGCTG
AACCGGGACG CGGTCAAGAC CGGCGTCACC TTCGCGGACA GGTTCTATCC AGGCTATAAG
CAGCCCGAGG GTTACTACGC CTCCCTGTTC CCGTTCCCCT ACGACGTCCC CCCGGAGCGT
TTCATCCCCA AGGTCTTCGC GGTGGACCAG GCGGGCAACG AGCGGTTTAC CGGGATCTAC
TACCGGGTCC TGGCCAAATC CTTCCCCAAG GACCGCATCG AGCTGACCGA CGCCTTCCTG
GAGAAGGTCT TCACCGAATT CAAGGACCGC TACCCCCAGA TAACGAACCC GCTCGAGCTG
TACCTGAAGG TGAACCGGGA GGTGCGGCAA AGCGACGCGA AGATCCTGCA GCAGTGCAGC
CTGAAAACCT CCCCCACCCC TCTTTGGGAG GGGGACTTCA TGCGCCTCCC CAACTCCGCC
CCGCGCGGTA CCTTCAACCA GTTGCGCAGC TACTATTACC AGGGGAAAGA GGTGGACCAG
CAGCATCACC TGGGAATCGA CCTGGCCTCG CTCTCCCACG CCAAGGTCCC CGCGGCCAAC
CGAGGCAAGG TGGTATATGC CGACGACCTG GGGATCTACG GCCAGTGCAT CATCATTGAC
CACGGGATGG GGCTGCAGAG CCTGTACGGC CACCTGAGCC GGATCGGCGT GAAGGAAGGG
GACGAGGTGA AAAAAGGGGA CACCATCGGC GACACCGGGG ACACCGGGCT TGCCGGCGGG
GACCATCTGC ATTTCGGCGT GGTGGTGTCG GGCCAGGAGG TGAACCCGAT CGAATGGTGG
GACCCGTCCT GGATCAAGAA CAACGTCACG GACAAGTTGA AGGAAGCAAG GGACGCCGCG
GCTGCCGCCG CCGGGACCGC GAAGTAG
 
Protein sequence
MRTSAILGLF VLFVAIAAGI GIYYFVDTSG PALALSRRPG PIASRADLIL TLRDRASGLK 
SLSVQAVQGE KSFGILTREY TAGTTDAKET FRLPPPPGLK EGPVTLRISA ADRSVFRFGS
GNSTSVALEF VVQNKPPVVS VLSTAHNVSP GGSALAAYTL NRDAVKTGVT FADRFYPGYK
QPEGYYASLF PFPYDVPPER FIPKVFAVDQ AGNERFTGIY YRVLAKSFPK DRIELTDAFL
EKVFTEFKDR YPQITNPLEL YLKVNREVRQ SDAKILQQCS LKTSPTPLWE GDFMRLPNSA
PRGTFNQLRS YYYQGKEVDQ QHHLGIDLAS LSHAKVPAAN RGKVVYADDL GIYGQCIIID
HGMGLQSLYG HLSRIGVKEG DEVKKGDTIG DTGDTGLAGG DHLHFGVVVS GQEVNPIEWW
DPSWIKNNVT DKLKEARDAA AAAAGTAK