Gene ECH74115_0252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0252 
Symbol 
ID6970670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp267063 
End bp271295 
Gene Length4233 bp 
Protein Length1410 aa 
Translation table11 
GC content61% 
IMG OID643384322 
ProductRHS Repeat family protein 
Protein accessionYP_002268838 
Protein GI209400273 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.661469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGAA AACCGGCGGC GCGGCAGGGT GACATGACCC GCAAGGGACT GGATATTGTG 
CAGGGTTCAG CAGGGGTGCT GATAGGTGCG CCGACGGGCG TGGCCTGCTC GGTGTGTCCG
GGAGGGATTA CCTATGCTAA CCCGGTGAAC CCGGTGCTGG GTGCGAAGGT GCTGCCGGGC
GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTTATTC TTTCCCGCGC CTACAGCAGC
TACCGGACCA GAACACCCGC GCCGGTGGGG GTGTTTGGTC CCGGCTGGAA AGCGCCCTTC
GATATCCGCT TACAGATACG CGATGAAGGC CTGATACTCA ACGACAACGG CGGCAGGAGC
ATTCACTTTG AGCCGCTGTT TCCCGGCGAG ATAAGCTACA GCCGCAGCGA GTCGTTCTGG
CTGGCGCGGG GCGGGGTGGC GGAGCAGCAC AGTTCGCAGC CGCTAAGCGC GCTCTGGCAG
GTGCTGCCGG AAGATGTTCG CCTGAGTCCG CATATGTACC TGGCGACAAA CAGCCTTCAG
GGGCCGTGGT GGATACTCAA CTGGCCGGAG CGGGTGCCGG GGGCGGACGA GGTGCTGCCG
CCGGAGCCGC CCGCATACCG GGTGCTGACG GGCGTGGTGG ATGGCTTCGG GCGGACGCTG
GCCTTTCACC GGGCGGCGGA GGGTGATGTG GCGGGCGCGG TGACGGGGGT GACGGACGGC
GCGGGCCGCC GTTTTCACCT GGTGCTGACC ACACAGGCAC AGCGGGCGGA AGTATTCCGT
AAACAGCGCG CCACGTCTTT ATCTTCCCCT GCCGGTCCCC GTTCTGCTTC CTCTTCTCTG
GTTTTCCCCG ACACGCTGCC CGCCGGTACA GAATACGGTG CCGATAACGG TATCCGGCTG
GAGGCGGTAT GGCTGACACA CGACCCGGCA TACCCGGATG AACTGCCCGC CGCGCCGCTG
GCGCGCTACA CGTACACGGC CAGCGGAGAA CTGCGGGCGG TGTATGACCG CAGCGGGACG
CAGGTGCGCG GGTTTGCTTA TGATGCGGAG CACGCCGGGC GGATGGTGGC GCACCATTAT
GCGGGTAGGC CGGAGAGCCG CTACCGGTAT GATGATACCG GCCGGGTGAC GGAGCTGGTC
AACCCGGAGG GGCTGGACTA CCGCTTTGAG TACGGGCAGG ACCGTGTGAC CATCACGGAC
AGCCTGAACC GGCGGGAGGT GCTGTACACG GAAGGCGAGG GTGGCCTGAA ACGTGTTGTG
AAGAAGGAAC ATGCGGACGG GAGCATCACC CGCAGCGAGT ATGATGAGGC GGGGAGGCTG
AAGGCACAGA CGGATGCGGC GGGACGGCGG ACGGAGTACA GCCTGCATAT GGCGTCGGGT
GCGGTGACAG CGGTGACGGG GCCGGACGGC AGGACGGTGC GGTATGGCTA TAACAGCCAG
CGGCAGGTGA CGTCAGTGAC GTACCCGGAC GGGCTGCGCA GCAGCCGGGA GTATGATGAG
AAGGGAAGGC TGGCGGCGGA GACCTCGCGC AGCGGAGAGA CGACGCGGTA CAGCTATGAT
GACCCGGCGA GTGAGCTGCC GACAGGGATA CAGGACGCGA CGGGCAGTAC AAAACAGATG
GCATGGAGCC GTTACGGTCA GCTGCTGACC TTTACGGACT GCTCGGGGTA CACGACGCGG
TATGAGTATG ACCGGTACGG TCAACAAATC GCCGTTCACC GGGAAGAAGG CATCAGCACT
TACAGCAGTT ATAACCCGCG TGGCCAACTG GTCAGTCAGA AGGATGCGCA GGGGCGTGAA
ACCCGTTATG AGTACAGCGC CGCAGGCGAC CTCACCGCTA TCGTTGCCCC GGACGGCAGC
CGCAGTGAGA TACAGTATGA TGCGTGGGGA AAGGCCGTCA GCACCACGCA GGGCGGTCTG
ACGCGCAGCA TGGGGTATGA CGCTGCCGGG CGCATCACCG TGCTGACCAA CGAGAACGGC
AGCCAGTCCA CGTTCCGGTA TGACCCGGTG GACAGGCTGA CTGAACAGCG CGGTTTTGAC
GGCCGGACGC AACGTTACCA CTATGACCTG ACCGGAAAAC TCACGCAGAG TGAAGACGAG
GGGCTTGTCA CCCTCTGGCA CTACGATGCG TCGGACCGCA TCACGCACCG GACGGTGAAC
GGCGACCCGG CAGAGCAGTG GCAGTATGAT GAGCACGGGT GGCTAACCAC CCTCAGCCAT
ACCAGTGAAG GCCACCGGGT GTCGGTCCAC TACGGCTATG ACGATAAAGG CCGCCTGACG
GGTGAACGGC AGACGGTGGA GAACCCGGAG ACGGGGGAGA TGCTGTGGGA GCATGAGACG
GGGCACGCGT ACAGCGAACA GGGGCTGGCG ACCCGTCAGG AGCCGGACGG TCTGCCGCCG
GTAGAGTGGC TGACGTATGG CAGCGGTTAT CTTGCGGGGA TGAAGCTGGG CGGAACGCCA
CTGGTCGAGT ACATGCGGGA CCGGCTGCAC CGTGAGACGG CCCGCAGCTT CGGCGGGGAG
GCATATGAAC TTGCCACCGC CTGGAATACC AGCGGCCAGC TCCGGAGCAG GCACCTGAAC
CTTCCGCAGC TTGACCGTGA CTACGACTGG AACGACAACG GACAGCTAAT CCGCATCAGC
GGCCCGCAGG AGAGCCGGGA GTACCGTTAC AGTGACACGG GAAGGCTGAC GGGCGTCCAC
ACCACGGCAG CGAACCTGGA TATCGATATC CCGTATGCAA CGGACCCGGC AGGAAACCGG
CTGCCGGACC CGGAACTGCA TCCGGACAGC ACGCTCACGG CGTGGCCGGA TAACCGCATC
GCGGAAGATG CGCACTATAT CTATCGCTAC GATGAATACG GCAGGCTGGC GGAGAAGACG
GACCGTATCC CGGAAGGGGT TATCCGGATG CACGACGAGC GCACCCACCA CTATCACTAC
GACAGCCAGC ACCGCCTGGT GTTCCACACG CGGATACAGC ACGGCGAACC ACAGGTGGAG
AGCCGGTACC TCTATGACCC GCTGGGCCGC CGGACGGGAA AACGGGTGTG GCGGCGGGAG
CGTGACCTGA CGGGGTGGAT GTCGCTGTCG CGTAAACCGG AGGAGACCTG GTACGGGTGG
GACGGTGACA GGCTGACCAC TGTACAGACC CAACAGACAA GAATCCAGAC GGTATACCAG
CCGGGAAGCT TCACGCCGCT CCTGAGAATC GAAACAGAGA ATGGTGAACA GGCGAAGGCG
CGGCACCGTA GCCTGGCGGA GGTGTTGCAG GAGGACACGG GTGTGACGCT ACCGGCGGAG
CTGGCGGTGA TGCTGGGGAG GCTGGAGCGG GAACTGCGGC AGGGCAGCGT GAGTGAAGAA
AGCCAGCAGT GGCTTGCGCA GTGCGGGCTG ACGGCGGAAC AGATGGCCGC GCAGCTGGAG
GCGGAATATA TCCCGGAGCG GAAACTTCAT CTTTACCACT GCGACCACCG GGGACTGCCG
CAGGCGCTCA TCAGCCCGGA AGGGGAAACG GCGTGGCAGG GGGAGTATGA CGAGTGGGGA
AACCTGCTGG GCGAAACCAG CGCGCAGCAC CTGCAACAGC CGTACCGTCT GCCGGGACAG
CAGTATGATG AGGAGTCGGG GCTGTATTAC AACCGTCACC GGTACTATGA CCCGCTACAG
GGGAGGTATA TCACCCAGGA TCCAATAGAT ATAAAAGGAG GATGGAATTT ATATTCTTAT
GCGCTTAATC CGGTAAGTTG GATTGACCCA TTAGGATTGA CGCAGTGCGA TTCCGAAGGG
TGTAATAATG ATATATTATT TACAGGAGGA AGTGGTCCCG ATAATAAAAT ACTTAATGAA
TTAGGTCCGA GAGATGGCAT TGACGGTTTG GGTTCACAAA ACATGAAAAT GTATAGTGGG
CTATTAGGGG GCGATATTTT AAAACCGGGG ATTCTGGGCG GTTTGACGCT TGGTAGTGTG
ACACAACGAC CATCGCGAAC GGCTGAGGAG GAGGTGCAAG CTCAAATAGA ATATTTAGCA
TATAAGAAGC GTTGTGAGCA AAAACCTGAT AGTAATTTAA GTCGTTGTGC TGCCGCGATC
TTTCAAAAGA GCAGAAAAGA AGATTGTTTA AGAATGAGAC AAGAATGGGA TGATAAATGG
TGGCCTGGCA AACATGCGGA TGAAATAAAA AATGTGAAAG CATCGTTGAA AAACACACTT
ATTGATGTTA AAAAAAAGAT GCATGCCAAC TAG
 
Protein sequence
MGGKPAARQG DMTRKGLDIV QGSAGVLIGA PTGVACSVCP GGITYANPVN PVLGAKVLPG 
ETDLALPGPL PFILSRAYSS YRTRTPAPVG VFGPGWKAPF DIRLQIRDEG LILNDNGGRS
IHFEPLFPGE ISYSRSESFW LARGGVAEQH SSQPLSALWQ VLPEDVRLSP HMYLATNSLQ
GPWWILNWPE RVPGADEVLP PEPPAYRVLT GVVDGFGRTL AFHRAAEGDV AGAVTGVTDG
AGRRFHLVLT TQAQRAEVFR KQRATSLSSP AGPRSASSSL VFPDTLPAGT EYGADNGIRL
EAVWLTHDPA YPDELPAAPL ARYTYTASGE LRAVYDRSGT QVRGFAYDAE HAGRMVAHHY
AGRPESRYRY DDTGRVTELV NPEGLDYRFE YGQDRVTITD SLNRREVLYT EGEGGLKRVV
KKEHADGSIT RSEYDEAGRL KAQTDAAGRR TEYSLHMASG AVTAVTGPDG RTVRYGYNSQ
RQVTSVTYPD GLRSSREYDE KGRLAAETSR SGETTRYSYD DPASELPTGI QDATGSTKQM
AWSRYGQLLT FTDCSGYTTR YEYDRYGQQI AVHREEGIST YSSYNPRGQL VSQKDAQGRE
TRYEYSAAGD LTAIVAPDGS RSEIQYDAWG KAVSTTQGGL TRSMGYDAAG RITVLTNENG
SQSTFRYDPV DRLTEQRGFD GRTQRYHYDL TGKLTQSEDE GLVTLWHYDA SDRITHRTVN
GDPAEQWQYD EHGWLTTLSH TSEGHRVSVH YGYDDKGRLT GERQTVENPE TGEMLWEHET
GHAYSEQGLA TRQEPDGLPP VEWLTYGSGY LAGMKLGGTP LVEYMRDRLH RETARSFGGE
AYELATAWNT SGQLRSRHLN LPQLDRDYDW NDNGQLIRIS GPQESREYRY SDTGRLTGVH
TTAANLDIDI PYATDPAGNR LPDPELHPDS TLTAWPDNRI AEDAHYIYRY DEYGRLAEKT
DRIPEGVIRM HDERTHHYHY DSQHRLVFHT RIQHGEPQVE SRYLYDPLGR RTGKRVWRRE
RDLTGWMSLS RKPEETWYGW DGDRLTTVQT QQTRIQTVYQ PGSFTPLLRI ETENGEQAKA
RHRSLAEVLQ EDTGVTLPAE LAVMLGRLER ELRQGSVSEE SQQWLAQCGL TAEQMAAQLE
AEYIPERKLH LYHCDHRGLP QALISPEGET AWQGEYDEWG NLLGETSAQH LQQPYRLPGQ
QYDEESGLYY NRHRYYDPLQ GRYITQDPID IKGGWNLYSY ALNPVSWIDP LGLTQCDSEG
CNNDILFTGG SGPDNKILNE LGPRDGIDGL GSQNMKMYSG LLGGDILKPG ILGGLTLGSV
TQRPSRTAEE EVQAQIEYLA YKKRCEQKPD SNLSRCAAAI FQKSRKEDCL RMRQEWDDKW
WPGKHADEIK NVKASLKNTL IDVKKKMHAN