Gene ECH74115_4965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4965 
Symbol 
ID6966670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4607989 
End bp4612218 
Gene Length4230 bp 
Protein Length1409 aa 
Translation table11 
GC content59% 
IMG OID643388647 
ProductRhs family protein 
Protein accessionYP_002273074 
Protein GI209396291 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.832182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AACCGGCGGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT 
CAGGGTTCAG CCGGGGTGCG CATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC
GGCGGAGTGA CGTCCGGCCA TCCGGTCAAT CCCCTGCTCG GTGCAAAGGT CCTTCCCGGT
GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATCC TCTCCCGCAC CTACAGCAGT
TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG
GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC
CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA GTCACTCTGG
CTGGTGCGCG GCGGCGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG
GCGCTGCCGG AAGAACTCCG CTTAAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG
GGGCCGTGGT GGCTGCTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT
GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG
ACGTTCCACC GCGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT
GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG
CAGCAGGCCA TTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC
ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG
GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCGCGCGGC
GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT
AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT
TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT
CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGA ACCGCCGTGA AGTCCTGCAC
ACGCAGGGTG AAGGCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC
ACGCAGAGTC AGTTTGACGC GGTGGGCAGG CTCAGGGCAC AGACGGATGC CGCAGGCAGG
ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT
GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT
GACGGGCTGG AAATACGCCG GGAATATGAT GAATGGGGCC GTCTGATTCA GGAAACTGCC
CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA
ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG
AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG
ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG
TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CGCCGCCGGT
GACCTGACCA CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG
GGAAAAGCCA TCTGTACCAC GCAGGGCGGT CTGACGCGCA GTATGGAATA CGATGCTGCC
GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA
CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC
CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GCACTATGAC
GAAGCAGACC GCCTCACGCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCG CTGGCAGTAT
GACGAACGCG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGACGGTG
CATTACGGGT ATGATGAGAA AGGCCGGCTG ACCGGTGAGC GTCAGACGGT GCATCACCCG
CAGACGGAAG CACTGCTCTG GCAGCATGAG ACCAGACACG CTTACAACGC GCAGGGGCTG
GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA TGGCAGCGGC
TGGCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG
CACCGGAAAA CGCTGCGCAG ATTCGGCCGT TATGAACTCA CCACCGCTTA TACCCCTGCC
GGGCAGTTAC AGAGCCAGCA CCTGAACAGC CTGCAGTATG ACCGCGATTA CACCTGGAAC
GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC
GACTCCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG
TATGCCACGG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC
CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CTGGTATGAC
CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT
GATGAGCGGA CTCACCGGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG
ACACAATATG AAGAGCCGCT GGTCGAAAGC CGCTATCTTT ACGACCCGCT GGGCCGCAGG
GTGGCAAAAC GGGTGTGGCG ACGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG
AAACCGCAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACAAT ACAGAACGAC
AGAACCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGGGTTGAA
ACCGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC GCTTCAGCAG
TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC
CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA
TCGTGCGGCC TGACTGTGGC GCAGATGCAA AGCCAGATGG ACCCGGTATA CACGCCGGCG
CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCCCT TATCAGTAAG
GAAGGGGCAA CAGAATGGTG CGCAGAATAC GATGAGTGGG GCAACCTGCT GAATGAAGAG
AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC
GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC ACGGGCGATA TATCACTCAG
GATCCGATTG GACTGAAGGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCGGTCATA
AATGTAGATC CGCAAGGTTT GGTTGATATA AATTTATACC CCGAAAGTGA TCTTATCCAT
TCTGTAGCTG ATGAGATTAA TATCCCAGGC GTTTTCACAA TCGGGGGGCA TGGTACCCCC
ACATCTATTG AATCCGCAAC GCGCAGTATC ATGACAGCTA AAGATCTAGC ATATCTAATT
AAATTTGATG GGAATTATAA AGATGGGATG ACAGTTTGGT TATTTTCTTG TAATACAGGT
AAAGGACAAA ATTCATTTGC TAGCCAATTA GCTAAAGAGT TACATACAAA TGTAATAGGA
CCTGACACGC TATGGACGTG GTGGGGGCGA GGAACTAATG GTAAGTTAAA AATGGATACA
GTGCTAACAG CACCAACGAA CCTTAATTCA AATAAGGATC TAATGGCTAT AACAACAAAA
GACCTTGGTA ATTGGATAAC ATATGGGCCA TCTGGGCACC CCATTTCTAA TATGCAAGGT
ACGCCAGAAA AACCCAGTGA TATAAGATAG
 
Protein sequence
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG 
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWLLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHREAAGEF SGEITGVTDG
AGRHFRLVLT TQAQRAEEAR QQAISGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLNRREVLH TQGEGGLKRV VKKEHADGSV
TQSQFDAVGR LRAQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP
DGLEIRREYD EWGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL
SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNAAG
DLTTVIAPDG SRNGTQYDAW GKAICTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV
LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWHYD EADRLTHRTV KGETAERWQY
DERGWLTDIS HISEGHRVTV HYGYDEKGRL TGERQTVHHP QTEALLWQHE TRHAYNAQGL
ANRCIPDSLP AVEWLTYGSG WLAGMKLGDT PLVDFTRDRL HRKTLRRFGR YELTTAYTPA
GQLQSQHLNS LQYDRDYTWN DNGELIRISS PRQTRSYSYS DSGRLTGVHT TAANLDIRIP
YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYWYD RHGRLTEKTD LIPEGVIRTD
DERTHRYHYD SQHRLVHYTR TQYEEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR
KPQVTWYGWD GDRLTTIQND RTRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ
SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVAQMQ SQMDPVYTPA
RKIHLYHCDH RGLPLALISK EGATEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES
GLYYNRHRYY DPLHGRYITQ DPIGLKGGWN FYQYPLNPVI NVDPQGLVDI NLYPESDLIH
SVADEINIPG VFTIGGHGTP TSIESATRSI MTAKDLAYLI KFDGNYKDGM TVWLFSCNTG
KGQNSFASQL AKELHTNVIG PDTLWTWWGR GTNGKLKMDT VLTAPTNLNS NKDLMAITTK
DLGNWITYGP SGHPISNMQG TPEKPSDIR