Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5225 |
Symbol | |
ID | 6972101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4872780 |
End bp | 4874030 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388890 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_002273310 |
Protein GI | 209395930 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.115958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTGG CAAAAGCGTC CTTGTGGACG GCGGCCAGTA CACTGGTCAA GATTGGTGCC GGGTTACTGG TCGGTAAGTT GCTGGCGGTG TCATTTGGTC CGGCGGGGCT TGGGCTGGCG GCAAATTTCC GCCAGTTGAT TACCGTGCTC GGCGTGCTTG CCGGGGCTGG CATCTTTAAC GGTGTAACCA AATACGTTGC CCAGTACCAT GATAATCCGC AACAGCTGCG CCGCGTGGTC GGCACTTCAT CAGCGATGGT ACTTGGCTTC TCTACGCTGA TGGCGCTGGT TTTTGTGCTG GCAGCTGCGC CAATCAGCTT GGGATTGTTT GGTAATCACG ACTATCAGGG GCTGGTGCGT TTAGTGGCGC TGGTGCAAAT GGGGATCGCC TGGGGCAACC TGTTACTGGC GTTGATGAAA GGCTTTCGCG ATGCCGCAGG TAATGCGTTA TCTCTGATTG TTGGCAGCTT GATTGGCGTT CTTGCGTACT ACGTCAGTTA CCGTTTGGGC GGATATGAAG GGGCGTTGCT GGGTCTGGCG TTGATTCCCG CGCTGGTGGT GATTCCTGCC GCCGTCATGT TAATTAAGCG TGGTGCTATC CCGTTAAGCT ATCTGAAACC CAGCTGGGAT AACGGTCTGG CAGGGCAGTT GAGCAAATTT ACGCTCATGA CGTTGATTAC GTCGGTGACC TTGCCTGTTG CTTACATCAT GATGCGTAAA CTGCTGGCGG CGCAGTATAG CTGGGATGAG GTGGGGATCT GGCAAGGGGT GAGCAGTATT TCCGATGCCT ACCTGCAATT TATTACGGCA TCGTTCAGCG TATATTTGCT GCCCACGTTG TCGCGGCTAA CGGAAAAGCG CGATATCACC CGGGAAGTGG TTAAATCGCT GAAATTCGTC TTACCGGCAG TGGCGGCGGC GAGTTTTACC GTCTGGTTGC TGCGTGATTT TGCTATCTGG CTGCTGTTGT CGAATAAATT TACCGCTATG CGCGATCTCT TTGCCTGGCA GTTGGTGGGT GATGTGTTAA AAGTGGGCGC TTATGTCTTT GGTTATCTGG TGATCGCCAA AGCTTCACTG CGGTTTTATA TTCTGGCGGA AGTCAGCCAG TTCACTTTAT TGATGGTATT TGCCCACTGG CTAATCCCTA CGCACGGCGC GCTGGGCGCG GCACAGGCAT ATATGGCAAC CTATATCGTC TATTTTTCTC TTTGTTGTGG CGTGTTTTTA CTCTGGCGTA GGCGGGCATG A
|
Protein sequence | MSLAKASLWT AASTLVKIGA GLLVGKLLAV SFGPAGLGLA ANFRQLITVL GVLAGAGIFN GVTKYVAQYH DNPQQLRRVV GTSSAMVLGF STLMALVFVL AAAPISLGLF GNHDYQGLVR LVALVQMGIA WGNLLLALMK GFRDAAGNAL SLIVGSLIGV LAYYVSYRLG GYEGALLGLA LIPALVVIPA AVMLIKRGAI PLSYLKPSWD NGLAGQLSKF TLMTLITSVT LPVAYIMMRK LLAAQYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLTEKRDIT REVVKSLKFV LPAVAAASFT VWLLRDFAIW LLLSNKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL RFYILAEVSQ FTLLMVFAHW LIPTHGALGA AQAYMATYIV YFSLCCGVFL LWRRRA
|
| |