Gene ECH74115_4477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4477 
Symbol 
ID6969978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4145750 
End bp4146745 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content57% 
IMG OID643388192 
Productpeptidase, U32 family 
Protein accessionYP_002272629 
Protein GI209398259 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTGC TCTGCCCTGC CGGAAATCTC CCGGCGCTTA AGGCGGCCAT CGAAAACGGC 
GCAGATGCTG TTTATATCGG GCTAAAAGAT GATACCAATG CCCGTCACTT CGCCGGCCTT
AACTTTACCG AGAAAAAATT GCAGGAAGCG GTGAGTTTTG TCCATCAACA TCGCCGCAAA
CTTCACATCG CGATTAACAC TTTTGCGCAT CCGGACGGTT ACGCCCGTTG GCAGCGTGCC
GTGGATATGG CGGCGCAGCT GGGTGCCGAC GCGCTGATCC TCGCCGACCT CGCCATGCTA
GAGTACGCCG CCGAGCGTTA CCCGCATATT GAGCGTCATG TGTCAGTGCA GGCTTCGGCG
ACCAATGAAG AGGCGATTAA CTTTTATCAT CGCCATTTTG ACGTTGCTCG CGTGGTGCTG
CCGCGCGTGT TGTCGATTCA TCAGGTGAAA CAACTGGCAC GGGTCACACC TGTCCCGCTG
GAAGTCTTTG CTTTCGGCAG CCTGTGCATT ATGTCGGAAG GTCGTTGCTA TCTGTCGTCG
TATCTGACGG GTGAGTCGCC CAACACCGTG GGCGCGTGTT CTCCGGCCCG TTTCGTGCGC
TGGCAGCAAA CGCCGCAGGG GCTGGAATCC CGCCTGAACG AAGTGCTGAT CGACCGTTAT
CAGGACGGCG AAAACGCAGG TTATCCGACG CTATGTAAAG GGCGTTATCT GGTGGACGGC
GAGCGCTATC ACGCGCTGGA AGAACCAACC AGTCTCAATA CCCTGGAACT GCTGCCGGAG
TTAATGGCGG CGAATATTGC TTCGGTGAAA ATTGAAGGCC GCCAACGTAG CCCGGCGTAT
GTCAGCCAGG TGGCGAAAGT CTGGCGTCAG GCTATCGACC GTTGTAAGGC CGATCCGCAA
AACTTCGTAC CGCAAAGCGC GTGGATGGAG ACGCTCGGGT CGATGTCCGA AGGCACGCAG
ACCACCCTTG GCGCGTATCA CCGTAAATGG CAGTGA
 
Protein sequence
MELLCPAGNL PALKAAIENG ADAVYIGLKD DTNARHFAGL NFTEKKLQEA VSFVHQHRRK 
LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPHI ERHVSVQASA
TNEEAINFYH RHFDVARVVL PRVLSIHQVK QLARVTPVPL EVFAFGSLCI MSEGRCYLSS
YLTGESPNTV GACSPARFVR WQQTPQGLES RLNEVLIDRY QDGENAGYPT LCKGRYLVDG
ERYHALEEPT SLNTLELLPE LMAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDRCKADPQ
NFVPQSAWME TLGSMSEGTQ TTLGAYHRKW Q