Gene ECH74115_5225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5225 
Symbol 
ID6972101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4872780 
End bp4874030 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content53% 
IMG OID643388890 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002273310 
Protein GI209395930 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.115958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTGG CAAAAGCGTC CTTGTGGACG GCGGCCAGTA CACTGGTCAA GATTGGTGCC 
GGGTTACTGG TCGGTAAGTT GCTGGCGGTG TCATTTGGTC CGGCGGGGCT TGGGCTGGCG
GCAAATTTCC GCCAGTTGAT TACCGTGCTC GGCGTGCTTG CCGGGGCTGG CATCTTTAAC
GGTGTAACCA AATACGTTGC CCAGTACCAT GATAATCCGC AACAGCTGCG CCGCGTGGTC
GGCACTTCAT CAGCGATGGT ACTTGGCTTC TCTACGCTGA TGGCGCTGGT TTTTGTGCTG
GCAGCTGCGC CAATCAGCTT GGGATTGTTT GGTAATCACG ACTATCAGGG GCTGGTGCGT
TTAGTGGCGC TGGTGCAAAT GGGGATCGCC TGGGGCAACC TGTTACTGGC GTTGATGAAA
GGCTTTCGCG ATGCCGCAGG TAATGCGTTA TCTCTGATTG TTGGCAGCTT GATTGGCGTT
CTTGCGTACT ACGTCAGTTA CCGTTTGGGC GGATATGAAG GGGCGTTGCT GGGTCTGGCG
TTGATTCCCG CGCTGGTGGT GATTCCTGCC GCCGTCATGT TAATTAAGCG TGGTGCTATC
CCGTTAAGCT ATCTGAAACC CAGCTGGGAT AACGGTCTGG CAGGGCAGTT GAGCAAATTT
ACGCTCATGA CGTTGATTAC GTCGGTGACC TTGCCTGTTG CTTACATCAT GATGCGTAAA
CTGCTGGCGG CGCAGTATAG CTGGGATGAG GTGGGGATCT GGCAAGGGGT GAGCAGTATT
TCCGATGCCT ACCTGCAATT TATTACGGCA TCGTTCAGCG TATATTTGCT GCCCACGTTG
TCGCGGCTAA CGGAAAAGCG CGATATCACC CGGGAAGTGG TTAAATCGCT GAAATTCGTC
TTACCGGCAG TGGCGGCGGC GAGTTTTACC GTCTGGTTGC TGCGTGATTT TGCTATCTGG
CTGCTGTTGT CGAATAAATT TACCGCTATG CGCGATCTCT TTGCCTGGCA GTTGGTGGGT
GATGTGTTAA AAGTGGGCGC TTATGTCTTT GGTTATCTGG TGATCGCCAA AGCTTCACTG
CGGTTTTATA TTCTGGCGGA AGTCAGCCAG TTCACTTTAT TGATGGTATT TGCCCACTGG
CTAATCCCTA CGCACGGCGC GCTGGGCGCG GCACAGGCAT ATATGGCAAC CTATATCGTC
TATTTTTCTC TTTGTTGTGG CGTGTTTTTA CTCTGGCGTA GGCGGGCATG A
 
Protein sequence
MSLAKASLWT AASTLVKIGA GLLVGKLLAV SFGPAGLGLA ANFRQLITVL GVLAGAGIFN 
GVTKYVAQYH DNPQQLRRVV GTSSAMVLGF STLMALVFVL AAAPISLGLF GNHDYQGLVR
LVALVQMGIA WGNLLLALMK GFRDAAGNAL SLIVGSLIGV LAYYVSYRLG GYEGALLGLA
LIPALVVIPA AVMLIKRGAI PLSYLKPSWD NGLAGQLSKF TLMTLITSVT LPVAYIMMRK
LLAAQYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLTEKRDIT REVVKSLKFV
LPAVAAASFT VWLLRDFAIW LLLSNKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL
RFYILAEVSQ FTLLMVFAHW LIPTHGALGA AQAYMATYIV YFSLCCGVFL LWRRRA