Gene ECH74115_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0254 
Symbol 
ID6972234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp272333 
End bp273841 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content55% 
IMG OID643384324 
ProductRhsG core protein with extension 
Protein accessionYP_002268840 
Protein GI209399453 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.814194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCCC TGCAAGGGAA TTTCCCGTAT GCAACAGACC CGGCAGGAAA CCGGCTGCCG 
GACCCGGAAC TGCACCCGGA CAGCACCCTC ACGGCATGGC CGGATAACCG CATCGCGGAA
GATGCGCACT ATGTCTACCG CTACGATGAA TACGGCAGGC TGGCGGAGAA GACGGACCGC
ATCCCGGAAG GGGTTATCCG GATGCACGAC GAGCGCACCC ACCACTATCA CTACGACAGC
CAGCACCGCC TGGTGTTCCA CACGCGGATA CAGCACGGCG AACCACAGGT GGAGAGCCGG
TACCTCTATG ACCCGCTGGG CCGCCGGACG GGAAAACGGG TGTGGCGGCG GGAGCGTGAC
CTGACGGGGT GGATGTCGCT GTCGCGTAAA CCGGAGGAGA CCTGGTACGG GTGGGACGGT
GACAGGCTGA CCACTGTACA GACCCAACAG ACAAGAATCC AGACGGTATA CCAGCCGGGA
AGCTTCACGC CGCTCCTGAG AATCGAAACA GAGAATGGTG AACAGGCGAA GGCGCGGCAC
CGTAGCCTGG CGGAGGTGTT GCAGGAGGAC ACGGGTGTGA CGCTACCGGC GGAGCTGGCG
GTGATGCTGG GAAGGCTGGA GCGGGAGCTG CGGCAGGGCA GCGTGAGTGA AGAAAGCCAG
CAGTGGCTTG CGCAGTGCGG GCTGACGGCG GAGCAGATGG CCGCGCAGCT GGAGGCGGAA
TACATCCCGG AGAGGAAACT TCATCTTTAC CACTGCGACC ACCGGGGACT GCCGCTGGCG
CTCATCAGCC CGGAAGGGGA AACGGCGTGG CAGGGGGAGT ATGACGAGTG GGGAAACCTG
CTGGGCGAAA CCAGCGCGCA GCACCTTCAA CAGTCACTCC GTCTGCCGGG GCAGCAGTAT
GATGAGGAGT TGGGGCTGTA CTACAACCGC AACCGGTACT ATGATCCGTT GCAGGGGAGA
TATATCACCC AGGACCCGAT AGGGCTGGAG GGGGGATGGA ACCTGTATCA GTACCCACTC
AATCCTATTG AACATATAGA TCCGTTGGGG TTAGCACTTG ATTTGAATTA TTATTCTCCG
TCAGATCCTA TATATAAAGG TTCTCTTAAT GTAAGGGAAT TTCCAACGGG TTTTACAGTT
GGAGGTCATG GTTCGCCTAC ATCTATGAGT GATGACAGAA TCAAAAAAGG AAGTGATCTG
ACAATAAAAC AATTAGCTAG CGACATAAGA GCGAATCCTA AATATCATGA AGGTATGCCT
GTAGTCTTGT TTTCCTGCGA AACAGGAAAA GGCAAAAATT CATTTGCACA GAAACTAGCT
AACGAACTTG ATGCTACAGT GATTGCTCCT GATGAAATAA TATGGATTTG GCCGGATGGA
AACTATGCTA TTATGGGGCA AACAGCTAGG ATAACAATAG GCGGTAAGGA CAACGGGGTG
TTTGAATTGG TGCCAGACGA GAAACAACCA GGGGACTTTC ATAAGTTTAC ACCTACAGGA
AGCAAATAG
 
Protein sequence
MDALQGNFPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRYDE YGRLAEKTDR 
IPEGVIRMHD ERTHHYHYDS QHRLVFHTRI QHGEPQVESR YLYDPLGRRT GKRVWRRERD
LTGWMSLSRK PEETWYGWDG DRLTTVQTQQ TRIQTVYQPG SFTPLLRIET ENGEQAKARH
RSLAEVLQED TGVTLPAELA VMLGRLEREL RQGSVSEESQ QWLAQCGLTA EQMAAQLEAE
YIPERKLHLY HCDHRGLPLA LISPEGETAW QGEYDEWGNL LGETSAQHLQ QSLRLPGQQY
DEELGLYYNR NRYYDPLQGR YITQDPIGLE GGWNLYQYPL NPIEHIDPLG LALDLNYYSP
SDPIYKGSLN VREFPTGFTV GGHGSPTSMS DDRIKKGSDL TIKQLASDIR ANPKYHEGMP
VVLFSCETGK GKNSFAQKLA NELDATVIAP DEIIWIWPDG NYAIMGQTAR ITIGGKDNGV
FELVPDEKQP GDFHKFTPTG SK