Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2343 |
Symbol | wcaI |
ID | 5590874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2307534 |
End bp | 2308757 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640926008 |
Product | putative glycosyl transferase |
Protein accession | YP_001463403 |
Protein GI | 157157792 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGTAT CGGCAAATAC ACCGGCGAGA TGGTGGAATG GCTGGCGGCA CAAGGTCATG AGGTGCGGGT TATTACCGCA CCGCCTTACT ACCCGCAGTG GCAGGTGGGT GAGAACTATT CCGCCTGGCG CTACAAACGA GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTATG TGCCAAAACA GCCGAGCACC CTGAAACGCC TGTTGCATCT CGGCAGTTTT GCCGTCAGCA GTTTCTTTCC ACTGATGGCG CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTAGTGC CAACGCTGTT TTGCACGCCG GGAATGCGCC TGCTGGCGAA GCTCTCTGGT GCGCGTACCG TGCTGCATAT TCAGGATTAC GAAGTGGACG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA CAGCTGGCAA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT TTCCACGATT TCGCGTTCGA TGATGAATAA AGCCATCGAA AAAGGCGTGG CGGCGGATAA CGTCATCTTC TTCCCCAACT GGTCGGAAAT TGCCCGTTTT CAACATGTTG CAGACGCCGA TGTTGATGCC CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCGTCTGCG CGATGAACCG CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAG CAGCGTGGAC TGCGCAACAT GCAATTTTTC CCGCTGCAAT CGTATGACGC TTTACCCGCA CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA TTGCCGTCGA AACTGACCAA TATTCTGGCG GTAGGCGGTA ACGCGGTGAT TACCGCAGAG GCCCACACAG AACTGGGGCA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTAGAA CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATCCGTCAGG CGCTCCTGCT GCCCAAACAC AACACGGTGG CACGTGAATA TGCCGAACGC ACACTCGATA AAGAGAACGT GTTACGTCAA TTTATAAATG ATATTCGGGG ATAA
|
Protein sequence | MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI SRSMMNKAIE KGVAADNVIF FPNWSEIARF QHVADADVDA LRNQLGLPDN KKIILYSGNI GEKQGLENVI EAADRLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG
|
| |