Gene EcHS_A0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0178 
SymbolrseP 
ID5594330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp194703 
End bp196055 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID640919365 
Productzinc metallopeptidase RseP 
Protein accessionYP_001456959 
Protein GI157159641 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00000000111428 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTT TTCTCTGGGA TTTGGCTTCG TTCATCGTTG CACTGGGTGT ACTTATCACC 
GTGCATGAAT TTGGTCATTT CTGGGTTGCC CGGCGTTGTG GTGTTCGCGT TGAGCGTTTC
TCAATAGGGT TTGGTAAGGC GCTCTGGCGG CGAACTGATA AGCTCGGCAC CGAATATGTT
ATCGCCCTGA TCCCGCTGGG CGGTTATGTC AAAATGCTGG ATGAGCGCGC AGAACCGGTC
GTTCCGGAAC TCCGCCACCA TGCCTTCAAT AATAAATCTG TTGGCCAACG AGCGGCGATT
ATTGCCGCAG GTCCGGTTGC AAACTTCATT TTTGCTATCT TAGCCTACTG GCTGGTTTTT
ATTATTGGTG TGCCTGGCGT ACGTCCGGTG GTTGGTGAAA TAGCAGCCAA TTCGATAGCT
GCGGAAGCAC AAATTGCACC AGGTACGGAA CTAAAAGCCG TAGATGGTAT CGAAACGCCT
GATTGGGATG CCGTGCGTTT GCAGTTGGTC GATAAAATTG GCGATGAAAG CACCACCATT
ACAGTAGCGC CATTTGGCAG CGACCAACGG CGGGATGTAA AGCTCGATTT ACGTCACTGG
GCGTTTGAGC CTGATAAAGA AGATCCGGTA TCTTCGCTGG GGATTCGTCC TCGTGGGCCG
CAAATTGAAC CTGTACTGGA AAATGTGCAG CCAAACTCGG CGGCAAGCAA GGCAGGTTTG
CAAGCAGGCG ACAGGATCGT TAAAGTCGAT GGTCAGCCCT TAACGCAGTG GGTGACCTTT
GTGATGCTTG TCCGGGATAA CCCGGGTAAA TCCTTAGCGT TAGAAATCGA AAGGCAGGGG
AGTCCCTTGT CTTTGACATT AATCCCGGAG AGTAAACCGG GTAATGGTAA AGCGATTGGT
TTTGTCGGTA TTGAGCCGAA AGTCATTCCT TTGCCAGATG AGTATAAAGT TGTACGCCAG
TATGGGCCGT TCAACGCCAT CGTCGAAGCC ACGGACAAAA CGTGGCAGCT GATGAAGCTG
ACGGTCAGTA TGCTGGGAAA ATTGATCACC GGTGATGTGA AACTGAACAA CCTCAGTGGG
CCGATCTCTA TCGCCAAGGG GGCTGGGATG ACAGCGGAAC TCGGGGTAGT TTATTACCTG
CCGTTTCTTG CGCTTATTAG CGTGAACTTA GGGATAATTA ACCTGTTTCC GTTGCCCGTA
CTTGACGGGG GGCATCTGCT GTTCCTTGCG ATCGAAAAGA TCAAGGGCGG ACCGGTATCC
GAGCGGGTTC AAGACTTTTG TTATCGCATT GGCTCGATTC TGCTGGTGCT GTTAATGGGG
CTTGCACTTT TCAATGATTT CTCTCGGTTA TGA
 
Protein sequence
MLSFLWDLAS FIVALGVLIT VHEFGHFWVA RRCGVRVERF SIGFGKALWR RTDKLGTEYV 
IALIPLGGYV KMLDERAEPV VPELRHHAFN NKSVGQRAAI IAAGPVANFI FAILAYWLVF
IIGVPGVRPV VGEIAANSIA AEAQIAPGTE LKAVDGIETP DWDAVRLQLV DKIGDESTTI
TVAPFGSDQR RDVKLDLRHW AFEPDKEDPV SSLGIRPRGP QIEPVLENVQ PNSAASKAGL
QAGDRIVKVD GQPLTQWVTF VMLVRDNPGK SLALEIERQG SPLSLTLIPE SKPGNGKAIG
FVGIEPKVIP LPDEYKVVRQ YGPFNAIVEA TDKTWQLMKL TVSMLGKLIT GDVKLNNLSG
PISIAKGAGM TAELGVVYYL PFLALISVNL GIINLFPLPV LDGGHLLFLA IEKIKGGPVS
ERVQDFCYRI GSILLVLLMG LALFNDFSRL