Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1012 |
Symbol | wcaI |
ID | 6146917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1029821 |
End bp | 1031044 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615899 |
Product | putative glycosyl transferase |
Protein accession | YP_001743091 |
Protein GI | 170683742 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC ACCGGCGAGA TGGTGGAATG GCTGGCGGCA GAAGGTCATG AGGTGCGGGT TATTACCGCA CCGCCTTACT ACCCGCAATG GCAGGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGG GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGCCAGCA GTTTCTTTCC GCTGATGGCG CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTGGTAC CAACGCTGTT TTGCACGCCG GGAATGCGCC TGCTGGTGAA ACTGTCTGGT GCGCGCACGG TGCTACATAT TCAGGATTAC GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAA GTAAAGGCGG CAAAGTGGCA CAGCTGGCGA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT TCGCGCTCGA TGATGAATAA AGCTATCGAA AAAGGCGTGG CGGCGGAAAA CGTCATCTTC TTCCCCAACT GGTCGGAAAT CGCCCGTTTT CAACATGTTG CAGACGCCGA TGTTGACGCC CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCGCCTGCG CGATGAACCG CTTATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTTGAAAA AATGGCGCAG CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA CTGCTGAAGA TGGGGGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA TTGCCGTCGA AACTGACCAA TATTCTGGCG GTAGGCGGTA ACGCGGTGAT TACTGCTGAA GCCCACACAG AACTGGGACA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATTCGTCAGG CGCTCCTGCT GCCCAAACAC AACACGGTGG CACGTGAATA TGCCGAACGC ACGCTCGATA AAGAGAACGT GTTACGTCAA TTTATAAATG ATATTCGGGG ATAA
|
Protein sequence | MKILVYGINY SPELTGIGKY TGEMVEWLAA EGHEVRVITA PPYYPQWQVG ENYSAWRYKR EEGAATVWRC PLYVPKQPST LKRLLHLGSF AASSFFPLMA QRRWKPDRII GVVPTLFCTP GMRLLVKLSG ARTVLHIQDY EVDAMLGLGL AGKSKGGKVA QLATAFERSG LHNVDNVSTI SRSMMNKAIE KGVAAENVIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG
|
| |