Gene EcSMS35_0187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0187 
SymbolrseP 
ID6146793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp205728 
End bp207080 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID641615088 
Productzinc metallopeptidase RseP 
Protein accessionYP_001742304 
Protein GI170683080 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000756829 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTT TTCTCTGGGA TTTGGCTTCG TTCATCGTTG CACTGGGTGT ACTTATCACC 
GTGCATGAAT TTGGTCATTT CTGGGTTGCC CGGCGTTGTG GTGTTCGCGT TGAGCGTTTC
TCAATAGGGT TTGGTAAGGC GCTCTGGCGG CGAACTGATA AGCTCGGCAC CGAATATGTT
ATCGCCCTGA TCCCGTTGGG CGGTTATGTC AAAATGCTGG ATGAGCGCGC AGAACCGGTC
GTTCCGGAAC TCCGCCACCA TGCCTTCAAT AATAAATCTG TCGGCCAACG AGCGGCGATT
ATTGCCGCAG GTCCGGTTGC AAACTTCATT TTTGCTATCT TTGCCTACTG GCTGGTTTTT
ATTATTGGTG TGCCTGGCGT ACGTCCGGTG GTTGGTGAAA TAGCAGCCAA TTCGATAGCT
GCGGAAGCAC AAATTGCACC AGGTACGGAA CTAAAAGCCG TAGATGGTAT CGAAACGCCT
GATTGGGATG CCGTGCGTTT GCAGTTGGTC GATAAAATTG GCGATGAAAG CACCACCATT
ACGGTAGCGC CATTTGGCAG CGACCAACGG CGGGATGTAA AGCTCGATTT ACGTCACTGG
GCGTTTGAGC CTGATAAAGA AGATCCGGTA ACTTCGCTGG GGATTCGTCC TCGTGGGCCG
CAAATTGAAC CTGTACTGGA AAATGTGCAG CCAAACTCGG CGGCAAGCAA GGCAGGTTTG
CAAGCAGGCG ACAGGATCGT TAAAGTCGAT GGTCAGCCCT TAACGCAGTG GGTGACCTTT
GTGATGCTTG TCCGGGATAA CCCGGGTAAA TCCTTAGCGT TAGAAATCGA AAGGCAGGGG
AGTCCTTTGT CTTTGACATT AATCCCGGAG AGTAAACCGG GTAATGGTAA AGCGATTGGT
TTTGTCGGTA TTGAGCCGAA AGTCATTCCT TTGCCAGATG AGTATAAAGT TGTACGCCAG
TATGGGCCGT TCAACGCCAT TGTCGAAGCC ACGGACAAAA CGTGGCAGCT GATGAAGCTG
ACGGTCAGTA TGCTGGGAAA ATTGATCACC GGTGATGTGA AACTGAACAA CCTCAGTGGG
CCGATCTCTA TCGCCAAGGG GGCTGGGATG ACAGCGGAAC TCGGGGTTGT TTATTACCTG
CCGTTTCTTG CGCTTATTAG CGTGAACTTA GGGATAATTA ACCTGTTTCC GTTGCCCGTA
CTTGACGGGG GGCATCTGCT GTTCCTTGCG ATCGAAAAGA TCAAGGGCGG ACCGGTATCC
GAGCGGGTTC AAGACTTTTG TTATCGCATT GGCTCGATTC TGCTGGTGCT GTTAATGGGG
CTTGCACTTT TCAATGATTT CTCTCGGTTA TGA
 
Protein sequence
MLSFLWDLAS FIVALGVLIT VHEFGHFWVA RRCGVRVERF SIGFGKALWR RTDKLGTEYV 
IALIPLGGYV KMLDERAEPV VPELRHHAFN NKSVGQRAAI IAAGPVANFI FAIFAYWLVF
IIGVPGVRPV VGEIAANSIA AEAQIAPGTE LKAVDGIETP DWDAVRLQLV DKIGDESTTI
TVAPFGSDQR RDVKLDLRHW AFEPDKEDPV TSLGIRPRGP QIEPVLENVQ PNSAASKAGL
QAGDRIVKVD GQPLTQWVTF VMLVRDNPGK SLALEIERQG SPLSLTLIPE SKPGNGKAIG
FVGIEPKVIP LPDEYKVVRQ YGPFNAIVEA TDKTWQLMKL TVSMLGKLIT GDVKLNNLSG
PISIAKGAGM TAELGVVYYL PFLALISVNL GIINLFPLPV LDGGHLLFLA IEKIKGGPVS
ERVQDFCYRI GSILLVLLMG LALFNDFSRL