Gene ECH74115_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0601 
Symbol 
ID6969303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp620324 
End bp624520 
Gene Length4197 bp 
Protein Length1398 aa 
Translation table11 
GC content61% 
IMG OID643384643 
ProductRHS Repeat family protein 
Protein accessionYP_002269157 
Protein GI209400573 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AACCAGCGGC GCGTCAGGGA GATATGACTC AGTATGGCGG TCCCATTGTC 
CAGGGTTCGG CAGGTGTAAG AATTGGCGCG CCAACTGGCG TGGCGTGCTC GGTGTGCCCG
GGCGGGATGA CTTCGGGCAA CCCGGTAAAT CCGCTGCTGG GGGCGAAGGT GCTGCCCGGC
GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGC
TACCGGACCC GGACGCCTGC GCCGGTGGGG ATTTTTGGCC CCGGCTGGAA AGCGCCTTCT
GATATCCGCT TACAGCTACG CGATGATGCA CTGGTACTCA ATGACAACGG CGGGCGGAGC
ATTCACTTTG AGCCGCTGCT GCCGGGGGAG GCGGTGTACA GCCGCAGCGA GTCAATGTGG
CTGGTGCGCG GTGGTAAGGC AGCGCAGCCG GACGGCCACA CGCTGGCGCG GCTGTGGGGG
GCGCTGCCGC CGGATATCCG GTTAAGCCCG CATCTTTACC TGGCGACCAA CAGCGCACAG
GGGCCGTGGT GGATACTGGG GTGGTCAGAG CGGGTGCCGG GTGCTGAGGA CGTACTGCCA
GCGCCGCTGC CGCCGTACCG GGTGCTTACC GGGATGGCGG ACCGCTTCGG GCGGACGCTG
ACGTACAGGC GTGAGGCCGC CGGTGACCTG GCCGGGGAAA TCACCGGCGT GACGGACGGT
GCCGGGCGGG AGTTCCGTCT GGTGCTGACC ACGCAGGCGC AGCGGGCGGA AGAGGCCCGT
AAACAGCACA CCGCTTCTTT ATCTTCCCCT GACACCCCCC GCCCTCTTTC AGACTCAGCG
TTCCCCGACA CACTGCCCGG TACCGAATAC GGTCCCGACA GAGGTATCCG CCTTTCGGCG
GTGTGGCTGA CGCACGACCC GGCATACCCG GAGAGCCTGC CCGCTGCGCC ACTGGTGCGG
TACACGTATA CGGAAGCCGG TGAACTGCTG GCGGTATATG ACCGCAGCAA TACGCAGGTG
CGCGCTTTCA CGTATGACGC GCAGCACCCG GGCCGGATGG TGGGGCACCG TTATGCGGGA
AGGCCGGAGA TGCGCTACCG CTACGACGAT GCGGGGCGGG TGGTGGAGCA ACTGAACCCG
GCAGGCCTGA GTTACCACTA CCAGTATGAG CAGGACCGCA TCACCGTCAC GGACAGCCTG
AACCGGCGTG AGGTGCTGCA TACAGAAGGC GGGGCCGGGC TGAAGCGGGT GGTGAAAAAA
GAACTGGCGG ACGGCAGCGT CACGCACAGC GGGTATGACG CGGCAGGAAG GCTAACGGCG
CAGACGGACG CGGCGGGACG GCGGACAGAG TATGGTCTGA ATGTGGTATC CGGCGATATC
ACGGACATCA CCACACCGGA CGGGCGGGAG ACGAAATTTT ACTATAACGA CGGGAACCAG
CTGACGGCGG TGGTGTCCCC GGACGGGCTG GAGAGCCGCC GGGCATATGA TGAACCGGGC
AGGCTGGTAT CGGAGACATC GCGCAGCGGG GAGACAGTAC GCTACCGCTA CGATGATGCG
TACAGTGAGT TACCGGCGAC GACAACAGAT GCGACGGGCA GCACCCGGCA GATGACCTGG
AGCCGCTACG GTCAGTTGCT GGCGTTCACC GACTGCTCGG GCTACCAGAC CCGCTATGAA
TACGACCGCT TCGGCCAGAT GACGGCGGTC CACCGCGAGG AAGGCATCAG CCTTTACCGC
CGCTATGACA ACCGTGGCCG GTTAATCTCG GTGAAAGACG CACAGGGCCA TGAAACGCGG
TATGAGTACA ACGCCGCAGG CGACCTGACT GCCGTTATCA CCCCGGACGG CAACCGGAGC
GAGACACAGT ACGATGCGTG GGGAAAGGCG GTCAGCACCA CGCAGGGCGG GCTGACGCGC
AGTATGGAAT ACGACCTTGC CGGACGCATC ACCACGCTGA CCAACGAGAA CGGCAGCCGG
AGTGAGTTTA CCTACGATGC GCTTGACCGG CTGGTACAGC AGCGCGGCTT TGACGGGCGG
ACGCAACGTT ACCACTATGA CCTGACCGGA AAACTCACGC AGAGTGAAGA TGAGGGGCTT
GTCACCCTCT GGCACTACGA CGAATCGGAC CGCCTCACTC ACCGCACGGT GAACGGCGAA
CCGGCAGAGC AGTGGCAGTA CGACGAGCAC GGCTGGCTGA CAGAAATCAG CCACCTGAGC
GAAGGCCATC AGGTGGCGGT GCATTACGGT TATGATGATA AGGGCCGCCT GGCCGGGGAG
CGCCAGACGG TGCATAACCC GGAGACGGGG GAACTGCTGT GGCAGCATGA GACAGAGCAC
GCATACAACG AACAGGGTCT GGCAAACCGC GTCACGCCGG ACAGCCTGCC GCGGGTGGAG
TGGCTGACCT ACGGCAGCGG TTATCTTGCG GGGATGAAGC TGGGCGGGAC GCCGCTGGTG
GAGTTCACGC GCGACAGGCT GCACCGCGAG ACGGTGCGCA GCTTCGGCAA TAACGCATAC
GAACTGACCA GCACATACAC TCCCGCAGGC CATTTACAGA GCCAGCGCCT GAACAGCCAG
GTGTATGACC GTGACTACGA CTGGAATGAC AATGGCGACC TGGTGCGCAT CAGCGGCCCG
CGACAGACGC GGGAATATGG CTACAGCGCC ACGGGCAGGC TGGAGAGCGT GCGCACCCTT
GCATCAGACC TGGATATCCG CATCCCGTAT GCGACCGACC CGGCGGGAAA CCGGCTGCCG
GACCCGGAGC TACACCCGGA CAGCACGCTC ACGGCGTGGC CGGATAACCG CATCGCGGAG
GATGCGCACT ATGTCTACCG ACACGATGAA TACGGCAGGC TGACGGAGAA GACGGACCGC
ATCCCGGCGG GTGTGATACG GACGGACGAC GAGCGGACCC ACCACTACCA CTACGACAGC
CAGCACCGCC TGGTGTTCTA CACGCGGATA CAGCATGGCG AGCCACTGGT CGAGAGCCGC
TACCTCTACG ACCCGCTGGG CCGCCGGACG GGGAAACGGG TGTGGCGTCG CGGGCGTGAC
CTGACGGGAT GGATGTCGCT GTCGCGGAAA CCGGAGGTGA CGTGGTACGG GTGGGACGGC
GACCGGCTGA CGACGGTACA GACCGACACC ACGCGTATCC AGACGGTATA CGAGCCGGGA
AGCTTCGCGC CGCTCATCCG CATCGAAACA GACAACGGTG AGCGGGAGAA AGCGCAGCGC
CGCAGCCTGG CGGAGAAGCT GCAGCAGGAA GGGAGCGAGG ACGGGCACGG TGTGGTATTT
CCGGCTGAAC TGGTGCGGCT GCTGGACAGA CTGGAGGAAG AAATCCGGGC AGACCGCGTG
AGCAGTGAAA GCCGGGCGTG GCTTGCGCAG TGCGGGCTGA CGGTGGAGCA ACTGGCCAGA
CAGGTGGAGC CGGAATACAC ACCGGCGAGA AAAGTTCATC TTTACCACTG CGACCACCGG
GGCCTGCCGC TGGCGCTCAT CAGCGAAGAC GGCAATACGG CGTGGAGCGG GGAGTATGAT
GAATGGGGCA ACCAGCTGAA TGAGGAGAAC CCGCATCACC TGCACCAGCC GTACCGGCTG
CCGGGGCAGC AGTATGATAA GGAGTCGGGG CTGTACTACA ACCGGAACCG GTACTACGAT
CCGTTGCAGG GGCGGTATAT CACTCAGGAC CCGATAGGGC TGGAGGGGGG ATGGAGTCTG
TATGCGTATC CGCTGAATCC GGTGAATGGT ATTGATCCAT TAGGGTTAAG TCCCGCAGAT
GTAGCGCTAA TAAGAAGAAA AGATCAACTA AACCATCAAA GAGCATGGGA TATATTATCT
GATACTTATG AAGATATGAA GAGATTAAAT TTAGGTGGGA CTGATCAATT TTTCCATTGT
ATGGCATTTT GTCGAGTGTC TAAATTAAAT GACGCTGGTG TTAGCCGATC GGCGAAAGGG
CTGGGTTATG AAAAAGAGAT TAGAGATTAC GGGTTAAATC TGTTCGGTAT GTACGGCAGA
AAAGTAAAGC TATCCCATTC TGAAATGATT GAAGATAATA AAAAAGACTT GGCTGTAAAT
GACCATGGGT TGACATGTCC ATCAACAACA GATTGCTCAG ATAGATGTAG TGATTATATT
AATCCAGAGC ATAAAGAAAC GATAAAGGCT TTACAAGATG CTGGCTATCT CAAGTAA
 
Protein sequence
MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVLPG 
ETDLALPGPL PFILSRTYSS YRTRTPAPVG IFGPGWKAPS DIRLQLRDDA LVLNDNGGRS
IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ
GPWWILGWSE RVPGAEDVLP APLPPYRVLT GMADRFGRTL TYRREAAGDL AGEITGVTDG
AGREFRLVLT TQAQRAEEAR KQHTASLSSP DTPRPLSDSA FPDTLPGTEY GPDRGIRLSA
VWLTHDPAYP ESLPAAPLVR YTYTEAGELL AVYDRSNTQV RAFTYDAQHP GRMVGHRYAG
RPEMRYRYDD AGRVVEQLNP AGLSYHYQYE QDRITVTDSL NRREVLHTEG GAGLKRVVKK
ELADGSVTHS GYDAAGRLTA QTDAAGRRTE YGLNVVSGDI TDITTPDGRE TKFYYNDGNQ
LTAVVSPDGL ESRRAYDEPG RLVSETSRSG ETVRYRYDDA YSELPATTTD ATGSTRQMTW
SRYGQLLAFT DCSGYQTRYE YDRFGQMTAV HREEGISLYR RYDNRGRLIS VKDAQGHETR
YEYNAAGDLT AVITPDGNRS ETQYDAWGKA VSTTQGGLTR SMEYDLAGRI TTLTNENGSR
SEFTYDALDR LVQQRGFDGR TQRYHYDLTG KLTQSEDEGL VTLWHYDESD RLTHRTVNGE
PAEQWQYDEH GWLTEISHLS EGHQVAVHYG YDDKGRLAGE RQTVHNPETG ELLWQHETEH
AYNEQGLANR VTPDSLPRVE WLTYGSGYLA GMKLGGTPLV EFTRDRLHRE TVRSFGNNAY
ELTSTYTPAG HLQSQRLNSQ VYDRDYDWND NGDLVRISGP RQTREYGYSA TGRLESVRTL
ASDLDIRIPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRHDE YGRLTEKTDR
IPAGVIRTDD ERTHHYHYDS QHRLVFYTRI QHGEPLVESR YLYDPLGRRT GKRVWRRGRD
LTGWMSLSRK PEVTWYGWDG DRLTTVQTDT TRIQTVYEPG SFAPLIRIET DNGEREKAQR
RSLAEKLQQE GSEDGHGVVF PAELVRLLDR LEEEIRADRV SSESRAWLAQ CGLTVEQLAR
QVEPEYTPAR KVHLYHCDHR GLPLALISED GNTAWSGEYD EWGNQLNEEN PHHLHQPYRL
PGQQYDKESG LYYNRNRYYD PLQGRYITQD PIGLEGGWSL YAYPLNPVNG IDPLGLSPAD
VALIRRKDQL NHQRAWDILS DTYEDMKRLN LGGTDQFFHC MAFCRVSKLN DAGVSRSAKG
LGYEKEIRDY GLNLFGMYGR KVKLSHSEMI EDNKKDLAVN DHGLTCPSTT DCSDRCSDYI
NPEHKETIKA LQDAGYLK