Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4156 |
Symbol | |
ID | 6145786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4255925 |
End bp | 4257175 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618979 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001746111 |
Protein GI | 170680789 |
COG category | [R] General function prediction only |
COG ID | [COG2244] Membrane protein involved in the export of O-antigen and teichoic acid |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTGG CAAAAGCGTC CTTGTGGACG GCGGCCAGTA CACTGGTCAA GATTGGTGCC GGGTTACTGG TCGGTAAGTT GCTGGCAGTG TCATTTGGTC CGGCGGGGCT TGGGCTGGCG GCAAATTTCC GCCAGTTGAT TACCGTGCTC GGCGTGCTTG CCGGGGCAGG CATCTTTAAC GGCGTAACCA AATACGTTGC CCAGTACCAT GATAATCCCC AACAATTGCG CCGCGTGGTC GGCACTTCAT CAGCGATGGT ACTTGGCTTC TCTATGCTAA TGGCGCTGGT TTTTGTGCTG GCAGCTGCGC CAATCAGCCA GGGATTGTTC GGTAATACCG ACTATCAGGG GCTGGTGCGT TTAGTGGCGT TGGTGCAAAT GGGGATCGCC TGGGGCAACC TGTTACTGGC GTTGATGAAA GGCTTTCGCG ATGCCGCAGG TAATGCGTTA TCCCTGATTG TCGGCAGCCT GATTGGCGTT CTTGCGTACT ACGTCAGTTA CCGTTTGGGC GGTTATGAAG GGGCGTTGCT GGGGCTGGCG TTGATTCCCG CGCTGGTGGT GATTCCTGCC GTCATCATGC TAATTAAGCG TGGTGCAATC CCGTTAAGCT ATCTGAAACC CAGCTGGGAT AACGGCCTGG CAGGGCAGTT GAGCAAATTT ACGCTCATGG CGTTGATTAC GTCGGTAACC TTGCCTGTTG CTTACATCAT GATGCGTAAA CTGCTGGCGG CGCAGTATAG CTGGGATGAA GTGGGGATCT GGCAAGGGGT GAGCAGTATT TCCGATGCCT ACCTGCAATT TATTACGGCA TCGTTCAGCG TATATTTGCT GCCCACGTTG TCGCGGCTAA CGGAAAAGCG CGATATCACC CGGGAAGTGG TTAAATCGCT GAAATTCGTC TTACCGGCAG TGGCGGCGGC GAGTTTTTCC GTCTGGTTGC TGCGTGATTT TGCTATCTGG CTGCTGTTGT CGAATAAATT TACCGCTATG CGCGATCTCT TTGCCTGGCA GTTGGTAGGT GATGTGTTAA AAGTGGGCGC TTATGTCTTT GGTTATCTGG TGATCGCCAA AGCGTCACTG CGGTTTTATA TTCTGGCGGA AGTCAGCCAG TTCACTTTAT TGATGGTATT TTCCCACTGG CTAATCCCTG CGCATGGTGC ACTGGGCGCG GCGCAGGCCT ATATGGCAAC CTATATCGTC TATTTTTCTC TTTGTTGTGG CGTGTTTTTA CTCTGGCGTA GGCGGGCATG A
|
Protein sequence | MSLAKASLWT AASTLVKIGA GLLVGKLLAV SFGPAGLGLA ANFRQLITVL GVLAGAGIFN GVTKYVAQYH DNPQQLRRVV GTSSAMVLGF SMLMALVFVL AAAPISQGLF GNTDYQGLVR LVALVQMGIA WGNLLLALMK GFRDAAGNAL SLIVGSLIGV LAYYVSYRLG GYEGALLGLA LIPALVVIPA VIMLIKRGAI PLSYLKPSWD NGLAGQLSKF TLMALITSVT LPVAYIMMRK LLAAQYSWDE VGIWQGVSSI SDAYLQFITA SFSVYLLPTL SRLTEKRDIT REVVKSLKFV LPAVAAASFS VWLLRDFAIW LLLSNKFTAM RDLFAWQLVG DVLKVGAYVF GYLVIAKASL RFYILAEVSQ FTLLMVFSHW LIPAHGALGA AQAYMATYIV YFSLCCGVFL LWRRRA
|
| |