Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3961 |
Symbol | waaW |
ID | 6145798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4039379 |
End bp | 4040407 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641618787 |
Product | lipopolysaccharide 1,2-galactosyltransferase |
Protein accession | YP_001745926 |
Protein GI | 170682179 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0769884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAT TAGCTGAGAG TATTACTGAA GTCGCTGTCT CTGGGGAAAT TGCTAACACC GATCGTGTGT TAAATATCGC TTACGGTATT GACCGCAACT TTTTATTTGG TGCAGCAGTA TCTATGCAAT CAGTTGTTAT GCATAACCCG GATCTTGCGG TTAAGTTTCA TCTCTTTACT GACTACATTG ATGAAGACTA TCTACAACGT GTTAATGCTT TTACCAGCAA AAATGCTAAC GTTGAAGTAA GAATTTATAA AGTATCCAGT GCCTTTATTG ATATCTTCCC CAGCCTGAAA CAGTGGTCTT ATGCAACATT CTTCCGTTTA GTTGCGTTCC AGTATCTGAG TGAAACTATT GAAAATCTGT TATATATCGA TGCTGATGTC ATCTGTAAAG GCTCATTAGC TGGATTGCTT GATATTAATT TTGATGGCGA TAAATTCGCA GCTGTTATTA AAGATGTGCC TTTTATGCAG GAAAAACCAG CGAAGCGTCT GGCTATAGAG GGGCTTCCAG GGAATTATTT CAACGCCGGT GTAGTATATC TGCAGCTTGA AGCGTGGGCG AAAAATGATT TTATGAATAA AGCCATTGCT ATGCTGGCAA GTGACCCGCA GCACACGAAG TATAAATGCC TTGATCAGGA TATTTTAAAT ATTCTGTTCT TTGGTCATTG TATTTTTATT AGCGGCGATT ATGATTGCTT TTATGGCATT GACTATGAGT TAAAAAATAA AAGCGATGAA GATTATAAAA AGACCATTAC CGATGATACT AAGCTGATTC ATTATGTTGG CGTAACGAAG CCCTGGAACG ACTGGACGAA TTATCCCTGC CAGAAGTATT TTAATGAGGC TTATCAGGCT TCTTGCTGGA ATGATGTGGC GTTTATTCCA GCCACGAATG AAAAGCAGTA TCAAGTGAAA TACCAACATG CAAAGAAAAA TGGTGATACG TTTAACGCTT TTATTTACTT CATTAAATTC AAATTAAATA AGTATAAAAG AAAACTATTT AGGCAATAA
|
Protein sequence | MDLLAESITE VAVSGEIANT DRVLNIAYGI DRNFLFGAAV SMQSVVMHNP DLAVKFHLFT DYIDEDYLQR VNAFTSKNAN VEVRIYKVSS AFIDIFPSLK QWSYATFFRL VAFQYLSETI ENLLYIDADV ICKGSLAGLL DINFDGDKFA AVIKDVPFMQ EKPAKRLAIE GLPGNYFNAG VVYLQLEAWA KNDFMNKAIA MLASDPQHTK YKCLDQDILN ILFFGHCIFI SGDYDCFYGI DYELKNKSDE DYKKTITDDT KLIHYVGVTK PWNDWTNYPC QKYFNEAYQA SCWNDVAFIP ATNEKQYQVK YQHAKKNGDT FNAFIYFIKF KLNKYKRKLF RQ
|
| |