Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5226 |
Symbol | wecF |
ID | 6966797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4874027 |
End bp | 4875106 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388891 |
Product | 4-alpha-L-fucosyltransferase |
Protein accession | YP_002273311 |
Protein GI | 209396700 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0239954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTAC TGATTCACGT ACTGGGATCG GATATCCCTC ACCATAACCG AACCGTTTTG CGGTTTTTCA ATGACGCGCT GGCCGCGACG AGCGAGCACG CGCGCGAGTT TATGGTTGTT GGCAAGGACG ACGGCTTAAG TGATAGCTGT CCGGCGCTTT CTGTGCAATT TTTCCCTGGG AAAAAATCGC TGGCGGAAGC GGTCATCGCG AAAGCAAAAG CTAACCGTCA GCAGCGTTTT TTCTTCCACG GTCAGTTCAA TCCCACACTG TGGCTGGCTC TGCTGAGTGG TGGCATTAAG CCCAGCCAGT TTTACTGGCA TATCTGGGGG GCAGACCTGT ACGAGCTTTC CAGTGGCTTG AGATATAAGC TTTTTTACCC ACTACGTCGC CTGGCGCAAA AGCGAGTCGG CTGTGTATTT GCCACCCGTG GTGATTTGAG CTTTTTTGCC AAAACGCACC CAAAGGTGCG GGGCGAACTG CTGTACTTCC CGACGCGGAT GGATGCTTCT CTCAATACGA TGGCGAACGA TCGGCAACGT GAAGGGAAAA TGACCATTCT GGTGGGGAAC TCCGGCGACC GCAGCAATGA GCATATTGCT GCCTTGCGCG CCGTTCATCA GCAATTTGGC GATACGGTAA AAGTGGTGGT GCCGATGGGA TATCCGCCTC ATAACGAAGC GTACATCGAG GAAGTTCGTC AGGCGGGGCT GGAGTTATTC AGCGAAGAAA ATCTGCAAGT TCTGAGTGAA AAACTGGAAT TTGACGCCTA TCTGACGCTA CTTCGTCAGT GCGATCTTGG TTACTTTATT TTTGCCCGCC AGCAGGGCAT TGGTACGCTG TGCTTACTGA TTCAGGCGGG CATTCCTTGT GTGCTTAACC GGGAAAATCC GTTCTGGCAG GATATGACGG AACAGCATTT ACCGGTGCTG TTTACTACCG ACGATCTCAA CGAGGATATT GTGCGTGAAG CGCAGCGCCA GTTGGCGTCG GTGGATAAAA ACACCATTGC CTTCTTTAGC CCTAACTATC TACAAGGCTG GCAGCGGGCG TTGGCGATTG CCGCCGGGGA GGTCGCATGA
|
Protein sequence | MTVLIHVLGS DIPHHNRTVL RFFNDALAAT SEHAREFMVV GKDDGLSDSC PALSVQFFPG KKSLAEAVIA KAKANRQQRF FFHGQFNPTL WLALLSGGIK PSQFYWHIWG ADLYELSSGL RYKLFYPLRR LAQKRVGCVF ATRGDLSFFA KTHPKVRGEL LYFPTRMDAS LNTMANDRQR EGKMTILVGN SGDRSNEHIA ALRAVHQQFG DTVKVVVPMG YPPHNEAYIE EVRQAGLELF SEENLQVLSE KLEFDAYLTL LRQCDLGYFI FARQQGIGTL CLLIQAGIPC VLNRENPFWQ DMTEQHLPVL FTTDDLNEDI VREAQRQLAS VDKNTIAFFS PNYLQGWQRA LAIAAGEVA
|
| |