Gene ECH74115_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2065 
Symbol 
ID6971881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1960504 
End bp1964706 
Gene Length4203 bp 
Protein Length1400 aa 
Translation table11 
GC content61% 
IMG OID643385977 
Productprotein rhsD 
Protein accessionYP_002270466 
Protein GI209397644 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.595023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AACCAGCGGC GCGTCAGGGA GATATGACTC AGTATGGCGG TCCCATTGTC 
CAGGGTTCGG CAGGTGTAAG AATTGGCGCG CCAACTGGCG TGGCGTGCTC GGTGTGCCCG
GGCGGGATGA CTTCGGGCAA CCCGGTAAAT CCGCTGCTGG GGGCGAAGGT GCTGCCCGGC
GAGACGGACC TTGCGCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGC
TACCGGACCC GGACGCCTGC GCCGGTGGGG ATTTTTGGCC CCGGCTGGAA AGCGCCTTCT
GATATCCGCT TACAGCTACG CGATGATGCA CTGGTACTCA ATGACAACGG CGGGCGGAGC
ATTCACTTTG AGCCGCTGCT GCCGGGGGAG GCGGTGTACA GCCGCAGCGA GTCAATGTGG
CTGGTGCGCG GTGGTAAGGC AGCGCAGCCG GACGGCCACA CGCTGGCGCG GCTGTGGGGG
GCGCTGCCGC CGGATATCCG GTTAAGCCCG CATCTTTACC TGGCGACCAA CAGCGCACAG
GGGCCGTGGT GGATACTGGG GTGGTCAGAG CGGGTGCCGG GTGCTGAGGA CGTACTGCCA
GCGCCGCTGC CGCCGTACCG GGTGCTTACC GGGATGGCGG ACCGCTTCGG GCGGACGCTG
ACGTACAGGC GTGAGGCCGC CGGTGACCTG GCCGGGGAAA TCACCGGCGT GACGGACGGT
GCCGGGCGGG AGTTCCGTCT GGTGCTGACC ACGCAGGCGC AGCGGGCGGA AGAGGCCCGT
AAACAGCACA CCGCTTCTTT ATCTTCCCCT GACCCCCCCC GCCCTCTTTC AGACTCAGCG
TTCCCCGACA CACTGCCCGG TACCGAATAC GGTCCCGACA GAGGTATCCG CCTTTCGGCG
GTGTGGCTGA CGCACGACCC GGCATACCCG GAAAGCCTGC CCGGTGCGCC ACTGGCGCGG
TACACGTATA CGGAAGCCGG TGAACTGCTG GCGGTATATG ACCGCAGCAA TACGCAGGTG
CGCGCTTTCA CGTATGACGC GCAGCATCCG GGCCGGATGG TGGCGCACCG TTACGCGGGA
AGGCCGGAGA TGCGCTACCG CTACGACGAT ACCGGGCGGG TGGTGGAGCA GCTGAACCCG
GCAGGCCTGA GTTACCGCTA CCAGTATGAG CAGGACCGCA TCACCGTCAC GGACAGCCTG
AACCGGCGTG AGGTGCTGCA TACAGAAGGC GGGGCCGGGC TGAAGCGGGT GGTGAAAAAA
GAACTGGCGG ACGGCAGCGT CACGCACAGC GGGTATGACG CGGCAGGAAG GCTCACGGCG
CAGACGGACG CGGCGGGACG GCGGACAGAG TACGGTCTGA ATGTGGTGTC CGGCGATATC
ACGGACATCA CCACACCGGA CGGGCGGGAG ACGAAATTTT ACTATAACGA CGGGAACCAG
CTGACGGCGG TGGTGTCCCC GGACGGGCTG GAGAGCCGCC GGGCATATGA TGAACCGGGC
AGGCTGGTAT CGGAGACATC GCGCTGTGGG GACGTCATCC GGTATGCTTA TGATAATCCG
CACAGTGAAT TACCGGCCAC GACAACAGAT GCGACGGGCA GCACCCGGCA GATGACCTGG
AGCCGCTACG GGCAGTTGCT GGCGTTCACC GACTGCTCGG GCTACCAGAC CCGTTATGAA
TACGACCGCT TTGGTCAGAT GACGGCGGTC CACCGTGAGG AAGGTATCAG CCGTTACCGC
CGCTATGACA ACCGTGGCCG GTTAACCTCG GTGAAAGACG CACAGGGCCA TGAAACGCGG
TATGAGTACA ACGCCGCAGG CGACCTGACT GCCGTTATCA CTCCGGACGG CAACCGGAGC
GAGACACAGT ACGATGCGTG GGGAAAAGCG GTCAGCACCA CGCAGGGCGG GCTGACGCGC
AGTATGGAGT ATGACCTCGC CGGACGCATC ACCACGCTGA CCAACGAGAA CGGCAGCCGG
AGTGAGTTTA CCTACGATGC GCTTGACCGG CTGGTACAGC AGCGCGGCTT TGACGGGCGG
ACGCAACGTT ACCACTATGA CCTGACCGGA AAACTCACGC AGAGTGAAGA TGAGGGGCTT
GTCACCCTCT GGCACTACGA CGAATCGGAC CGCCTCACTC ACCGCACGGT GAACGGCGAA
CCGGCAGAGC AGTGGCAGTA CGACGAGCAC GGCTGGCTGA CAGAAATCAG CCACCTGAGC
GAAGGCCATC AGGTGGCGGT GCATTACGGT TATGATGATA AGGGCCGCCT GGCCGGGGAG
CGCCAGACGG TGCATAACCC GGAGACGGGG GAACTGCTGT GGCAGCATGA GACAGAGCAC
GCATACAACG AACAGGGTCT GGCAAACCGC GTCACGCCGG ACAGCCTGCC GCGGGTGGAG
TGGCTGACCT ACGGCAGCGG TTATCTTGCG GGGATGAAGC TGGGCGGGAC GCCGCTGGTG
GAGTTCACGC GCGACAGGCT GCACCGCGAG ACGGTGCGCA GCTTCGGCAA TAACGCATAC
GAACTGACCA GCACATACAC TCCCGCAGGC CATTTACAGA GCCAGCGCCT GAACAGCCAG
GTGTATGACC GTGACTACGA CTGGAATGAC AATGGCGACC TGGTGCGCAT CAGCGGCCCG
CGACAGACGT GGGAATATGG CTACAGTGCC ACGGGCAGGC TGGAGAGCGT GCGCACCCTT
GCATCAGACC TGGATATCCG CATCCCGTAT GCGACCGACC CGGCGGGAAA CCGGCTGCCG
GACCCGGAGC TACACCCGGA CAGCACGCTC ACGGCGTGGC CGGATAACCG CATCGCGGAG
GATGCGCACT ATGTCTACCG ACACGATGAA TACGGCAGGC TGACGGAGAA GACGGACCGC
ATCCCGGCGG GTGTGATACG GACGGACGAC GAGCGGACCC ACCACTACCA CTACGACAGC
CAGCACCGCC TGGTGTTCTA CACGCGGATA CAGCATGGCG AGCCACTGGT CGAGAGCCGC
TACCTCTACG ACCCGCTGGG ACGGCGAATG GCAAAACGGG TCTGGCGGCG GGAGCGTGAC
CTGACGGGGT GGATGTCGCT GTCGCGTAAA CCGGAGGTGA CGTGGTATGG CTGGGACGGA
GACAGGCTGA CGACGGTGCA GACTGACACC ACACGTATCC AGACGGTATA CGAGCCGGGA
AGCTTCACGC CGCTCATCCG GGTCGAGACA GAGAACGGCG AGCGGGAAAA AGCGCAGCGG
CGCAGCCTGG CAGAGACGCT CCAGCAGGAA GGGAGTGAGA ACGGCCACGG CGTGGTGTTC
CCGGCTGAAC TGGTGCGGCT GCTGGACAGG CTGGAGGAAG AAATCCGGGC AGACCGCGTG
AGCAGTGAAA GCCGGGCGTG GCTTGCGCAG TGCGGGCTGA CGGTGGAGCA ACTGGCCAGA
CAGGTGGAGC CGGAATACAC ACCGGCGCGA AAAGTTCATT TTTACCACTG CGACCACCGG
GGCCTGCCGC TGGCGCTCAT CAGCGAAGAC GGCAATACGG CGTGGCGCGG GGAGTATGAT
GAATGGGGCA ACCAGCTTAA TGAAGAGAAC CCGTATTACC TGCACCAGCC ATACCGTCTG
CCGGGGCAGC AGCATGATGA GGAATCAGGG CTGTACTATA ACCGGAACCG GTACTATGAC
CCGCTACAGG GGAGGTATAT TACACAAGAC CCCATTGGGC TGGCGGGGGG ATGGAATCTG
TATAATTACC CACTGAATCC GATAATAAGG ATGGATCCTT TGGGTTTGTA TAATTTATAT
CAATTATTAT ATGATGTTTG GCATGATGAT TCATATGGAA CATCATCAAT TGATATTACT
GGCAGTGGAG ATCTAATATC ATTAGGTGGT CATGCAGGAC TTGGCGTTGC GTTTGCTAAA
AAGAAAGGTG AAATGTTATC TGATATTTGT ATTTATGCTA CAGCATGCGG ACATGCAGGA
ATTGGTGGTG GGATAAATGC GGCTATCACA TATTCTGAGA CCAAGTCTTT ACCTACATCG
GGAGTCAGCA ATTCAGTAGG TGTAACGGTT GGCGGCGGAG TTGGGGGGCA TTTTGCGTAT
ACTTATGTAG TGGATGTTGA TAATCCAGAA TCATCGACAG AATCTGTTGG TATCGGTGCA
GGTGTTGACG CTTCAGTTAT GACTCTGGCT TGTAGAACGT GGCAAGAATG CTGGGTCAAT
TAA
 
Protein sequence
MSGKPAARQG DMTQYGGPIV QGSAGVRIGA PTGVACSVCP GGMTSGNPVN PLLGAKVLPG 
ETDLALPGPL PFILSRTYSS YRTRTPAPVG IFGPGWKAPS DIRLQLRDDA LVLNDNGGRS
IHFEPLLPGE AVYSRSESMW LVRGGKAAQP DGHTLARLWG ALPPDIRLSP HLYLATNSAQ
GPWWILGWSE RVPGAEDVLP APLPPYRVLT GMADRFGRTL TYRREAAGDL AGEITGVTDG
AGREFRLVLT TQAQRAEEAR KQHTASLSSP DPPRPLSDSA FPDTLPGTEY GPDRGIRLSA
VWLTHDPAYP ESLPGAPLAR YTYTEAGELL AVYDRSNTQV RAFTYDAQHP GRMVAHRYAG
RPEMRYRYDD TGRVVEQLNP AGLSYRYQYE QDRITVTDSL NRREVLHTEG GAGLKRVVKK
ELADGSVTHS GYDAAGRLTA QTDAAGRRTE YGLNVVSGDI TDITTPDGRE TKFYYNDGNQ
LTAVVSPDGL ESRRAYDEPG RLVSETSRCG DVIRYAYDNP HSELPATTTD ATGSTRQMTW
SRYGQLLAFT DCSGYQTRYE YDRFGQMTAV HREEGISRYR RYDNRGRLTS VKDAQGHETR
YEYNAAGDLT AVITPDGNRS ETQYDAWGKA VSTTQGGLTR SMEYDLAGRI TTLTNENGSR
SEFTYDALDR LVQQRGFDGR TQRYHYDLTG KLTQSEDEGL VTLWHYDESD RLTHRTVNGE
PAEQWQYDEH GWLTEISHLS EGHQVAVHYG YDDKGRLAGE RQTVHNPETG ELLWQHETEH
AYNEQGLANR VTPDSLPRVE WLTYGSGYLA GMKLGGTPLV EFTRDRLHRE TVRSFGNNAY
ELTSTYTPAG HLQSQRLNSQ VYDRDYDWND NGDLVRISGP RQTWEYGYSA TGRLESVRTL
ASDLDIRIPY ATDPAGNRLP DPELHPDSTL TAWPDNRIAE DAHYVYRHDE YGRLTEKTDR
IPAGVIRTDD ERTHHYHYDS QHRLVFYTRI QHGEPLVESR YLYDPLGRRM AKRVWRRERD
LTGWMSLSRK PEVTWYGWDG DRLTTVQTDT TRIQTVYEPG SFTPLIRVET ENGEREKAQR
RSLAETLQQE GSENGHGVVF PAELVRLLDR LEEEIRADRV SSESRAWLAQ CGLTVEQLAR
QVEPEYTPAR KVHFYHCDHR GLPLALISED GNTAWRGEYD EWGNQLNEEN PYYLHQPYRL
PGQQHDEESG LYYNRNRYYD PLQGRYITQD PIGLAGGWNL YNYPLNPIIR MDPLGLYNLY
QLLYDVWHDD SYGTSSIDIT GSGDLISLGG HAGLGVAFAK KKGEMLSDIC IYATACGHAG
IGGGINAAIT YSETKSLPTS GVSNSVGVTV GGGVGGHFAY TYVVDVDNPE SSTESVGIGA
GVDASVMTLA CRTWQECWVN