Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5003 |
Symbol | waaA |
ID | 6967262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4655503 |
End bp | 4656780 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388684 |
Product | 3-deoxy-D-manno-octulosonic-acid transferase |
Protein accession | YP_002273111 |
Protein GI | 209400697 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1519] 3-deoxy-D-manno-octulosonic-acid transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00137507 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGAAT TGCTTTACAC CGCCCTTCTC TACCTTATTC AGCCGCTGAT CTGGATACGG CTCTGGGTGC GCGGACGTAA GGCTCCGGCC TATCGAAAAC GCTGGGGTGA ACGTTACGGT TTTTACCGCC ATCCGCTAAA ACCAGGCGGC ATTATGCTGC ACTCCGTCTC CGTCGGTGAA ACTCTGGCGG CGATCCCGTT GGTGCGCGCG CTGCGTCATC GTTATCCTGA TTTACCTATT ACCGTAACAA CCATGACGCC AACCGGTTCG GAGCGCGTAC AATCGGCTTT CGGGAAGGAT GTTCAGCACG TTTATCTGCC GTATGATCTG CCCGATGCAC TCAACCGTTT CCTGAATAAA GTCGACCCTA AACTGGTGTT GATTATGGAA ACCGAACTAT GGCCTAACCT GATTGCGGCG CTACATAAAC GTAAAATTCC GCTGGTGATC GCTAACGCGC GACTCTCTGC CCGCTCGGCC GCAGGTTATG CCAAACTGGG TAAATTCGTC CGTCGCTTGC TGCGTCGTAT TACGCTGATT GCCGCCCAAA ATGAAGAAGA TGGTGCACGT TTTGTGGCGC TGGGCGCGAA AAATAACCAG GTAACCGTTA CCGGTAGCCT GAAATTCGAT ATTTCTGTAA CGCCGCAGTT GGCTGCTAAA GCCGTGACGC TGCGCCGCCA GTGGGCACCA CACCGCCCGG TATGGATTGC CACCAGCACT CACGAAGGTG AAGAGAGCGT GGTCATTGCC GCACATCAGG CTTTATTACA GCAATTCCCG AATTTATTGC TCATCCTGGT ACCCCGTCAT CCAGAACGCT TCCCGGATGC GATTAACCTT GTCCGCCAGG CTGGACTAAG CTATATCACA CGCTCTTCAG GGGAAGTCCC CTCCACCAGC ACGCAGGTTG TGGTTGGCGA TACGATGGGC GAGTTGATGT TACTGTACGG CATTGCCGAT CTCGCCTTTG TTGGCGGTTC ACTGGTTGAA CGTGGTGGGC ATAATCCGCT GGAAGCTGCC GCACACGCAA TTCCGGTATT GATGGGGCCG CATACTTTTA ACTTTAAAGA CATTTGCGCG CGGCTGGAGC AGGCAAGCGG GCTGATTACC GTTACCGATG CCACTACGCT TGCAAAAGAG GTTTCCTCTT TACTCACCGA CGCCGATTAC CGTAGTTTCT ATGGCCGTCA TGCCGTTGAA GTACTGTATC AAAACCAGGG CGCGCTACAG CGTCTGCTTC AACTGCTGGA ACCTTACCTG CCACCGAAAA CGCATTGA
|
Protein sequence | MLELLYTALL YLIQPLIWIR LWVRGRKAPA YRKRWGERYG FYRHPLKPGG IMLHSVSVGE TLAAIPLVRA LRHRYPDLPI TVTTMTPTGS ERVQSAFGKD VQHVYLPYDL PDALNRFLNK VDPKLVLIME TELWPNLIAA LHKRKIPLVI ANARLSARSA AGYAKLGKFV RRLLRRITLI AAQNEEDGAR FVALGAKNNQ VTVTGSLKFD ISVTPQLAAK AVTLRRQWAP HRPVWIATST HEGEESVVIA AHQALLQQFP NLLLILVPRH PERFPDAINL VRQAGLSYIT RSSGEVPSTS TQVVVGDTMG ELMLLYGIAD LAFVGGSLVE RGGHNPLEAA AHAIPVLMGP HTFNFKDICA RLEQASGLIT VTDATTLAKE VSSLLTDADY RSFYGRHAVE VLYQNQGALQ RLLQLLEPYL PPKTH
|
| |