Gene EcolC_3484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3484 
Symbol 
ID6068364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3801895 
End bp3803247 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID641602900 
Productzinc metallopeptidase RseP 
Protein accessionYP_001726425 
Protein GI170021471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00175231 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0731606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTT TTCTCTGGGA TTTGGCTTCG TTCATCGTTG CACTGGGTGT ACTTATCACC 
GTGCATGAAT TTGGTCATTT CTGGGTTGCC CGGCGTTGTG GTGTTCGCGT TGAGCGTTTC
TCAATAGGGT TTGGTAAGGC GCTCTGGCGG CGAACTGATA AGCTCGGCAC CGAATATGTT
ATCGCCCTGA TCCCGCTGGG CGGTTATGTC AAAATGCTGG ATGAGCGCGC AGAACCGGTC
GTTCCGGAAC TCCGCCACCA TGCCTTCAAT AATAAATCTG TTGGCCAACG AGCGGCGATT
ATTGCCGCAG GTCCGGTTGC AAACTTCATT TTTGCTATCT TTGCCTACTG GCTGGTTTTT
ATTATTGGTG TGCCTGGCGT ACGTCCGGTG GTTGGTGAAA TAGCAGCCAA TTCGATAGCT
GCGGAAGCAC AAATTGCACC AGGTACGGAA CTAAAAGCCG TAGATGGTAT CGAAACGCCT
GATTGGGATG CCGTGCGTTT GCAGTTGGTC GATAAAATTG GCGATGAAAG CACCACCATT
ACAGTAGCGC CATTTGGCAG CGACCAACGG CGGGATGTAA AGCTCGATTT ACGTCACTGG
GCGTTTGAGC CTGATAAAGA AGATCCGGTA TCTTCGCTGG GGATTCGTCC TCGTGGGCCG
CAAATTGAAC CTGTACTGGA AAATGTGCAG CCAAACTCGG CGGCAAGCAA GGCAGGTTTG
CAAGCAGGCG ACAGGATCGT TAAAGTCGAT GGTCAGCCCT TAACGCAGTG GGTGACCTTT
GTGATGCTTG TCCGGGATAA CCCGGGTAAA TCCTTAGCGT TAGAAATCGA AAGGCAGGGG
AGTCCCTTGT CTTTGACATT AATCCCGGAG AGTAAACCGG GTAATGGTAA AGCGATTGGT
TTTGTCGATA TTGAGCCGAA AGTCATTCCT TTGCCAGATG AGTATAAAGT TGTACGCCAG
TATGGGCCGT TCAACGCCAT CGTCGAAGCC ACGGACAAAA CGTGGCAGCT GATGAAGCTG
ACGGTCAGTA TGCTGGGAAA ATTGATCACC GGTGATGTGA AACTGAACAA CCTCAGTGGG
CCGATCTCTA TCGCCAAGGG GGCTGGGATG ACAGCGGAAC TCGGGGTAGT TTATTACCTG
CCGTTTCTTG CGCTTATTAG CGTGAACTTA GGGATAATTA ACCTGTTTCC GTTGCCCGTA
CTTGACGGGG GGCATCTGCT GTTCCTTGCG ATCGAAAAGA TCAAGGGCGG ACCGGTATCC
GAGCGGGTTC AAGACTTTTG TTATCGCATT GGCTCGATTC TGCTGGTGCT GTTAATGGGG
CTTGCACTTT TCAATGATTT CTCTCGGTTA TGA
 
Protein sequence
MLSFLWDLAS FIVALGVLIT VHEFGHFWVA RRCGVRVERF SIGFGKALWR RTDKLGTEYV 
IALIPLGGYV KMLDERAEPV VPELRHHAFN NKSVGQRAAI IAAGPVANFI FAIFAYWLVF
IIGVPGVRPV VGEIAANSIA AEAQIAPGTE LKAVDGIETP DWDAVRLQLV DKIGDESTTI
TVAPFGSDQR RDVKLDLRHW AFEPDKEDPV SSLGIRPRGP QIEPVLENVQ PNSAASKAGL
QAGDRIVKVD GQPLTQWVTF VMLVRDNPGK SLALEIERQG SPLSLTLIPE SKPGNGKAIG
FVDIEPKVIP LPDEYKVVRQ YGPFNAIVEA TDKTWQLMKL TVSMLGKLIT GDVKLNNLSG
PISIAKGAGM TAELGVVYYL PFLALISVNL GIINLFPLPV LDGGHLLFLA IEKIKGGPVS
ERVQDFCYRI GSILLVLLMG LALFNDFSRL