Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4157 |
Symbol | wecF |
ID | 6144964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4257172 |
End bp | 4258251 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618980 |
Product | 4-alpha-L-fucosyltransferase |
Protein accession | YP_001746112 |
Protein GI | 170682089 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.792181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTAC TGATTCACGT ACTGGGATCG GATATCCCTC ACCATAACCG AACCGTTTTG CGGTTTTTCA ATGACGCGCT GGCCGCGACG AGCGAGCACG CGCGCGAGTT TATGGTTGTT GGCAAGGACG ACGGTTTAAG TGATAGCTGT CCGACGCTTT CTGTGCAATT TTTCCCTGGG AAAAAATCGC TGGCGGAAGC GGTCATCGCG AAAGCAAAAG CTAACCGTCA GCAGCGTTTT TTTTTCCACG GTCAGTTCAA TCCCACACTG TGGCTGGCTC TGCTGAGTGG TGGTATTAAG CCCAGCCAGT TTTACTGGCA TATCTGGGGG GCAGACCTGT ACGAGCTTTC CAGTGGCTTG AGATATAAGC TTTTTTACCC ACTACGTCGC CTGGCGCAAA AGCGAGTCGG CTGTGTATTT GCCACCCGCG GTGATTTGAG CTTTTTTGCC AAAACGCACC CAAAGGTGCG GGGCGAACTG CTGTACTTCC CGACGCGGAT GGATCCTTCT CTCAATACGA TGGCGAACGA TCGGCAACGT GAAGGAAAAA TGACCATTCT GGTGGGCAAC TCCGGCGACC GCAGCAATGA GCATATTGCT GCCTTGCGCG CCGTTCATCA GCAATTTGGC GATACGGTAA AAGTGGTGGT GCCGATGGGA TATCCGCCTA ATAACGAAGC GTACATCGAG GAAGTTCGTC AGGCGGGGCT GGAGTTATTC AGCGAAGAAA ATCTGCAAGT TCTGAGCGAA AAACTGGAAT TTGACGCCTA TCTGACGCTA CTTCGTCAGT GCGATCTTGG TTACTTTATT TTTGCCCGCC AGCAGGGCAT TGGTACGCTG TGCTTACTGA TTCAGGCGGG CATTCCTTGT GTGCTTAACC GGGAAAATCC GTTCTGGCAG GATATGACGG AACAACATTT GCCGGTGCTG TTTACTACCG ACGATCTCAA CGAGGATATT GTGCGTGAAG CGCAGCGCCA GCTGGCGTCG GTGGATAAAA ACACCATTGC CTTCTTTAGC CCTAACTATC TACAAGGCTG GCAGCGGGCG TTGGCGATTG CCACCGGGGA GGTCGCATGA
|
Protein sequence | MTVLIHVLGS DIPHHNRTVL RFFNDALAAT SEHAREFMVV GKDDGLSDSC PTLSVQFFPG KKSLAEAVIA KAKANRQQRF FFHGQFNPTL WLALLSGGIK PSQFYWHIWG ADLYELSSGL RYKLFYPLRR LAQKRVGCVF ATRGDLSFFA KTHPKVRGEL LYFPTRMDPS LNTMANDRQR EGKMTILVGN SGDRSNEHIA ALRAVHQQFG DTVKVVVPMG YPPNNEAYIE EVRQAGLELF SEENLQVLSE KLEFDAYLTL LRQCDLGYFI FARQQGIGTL CLLIQAGIPC VLNRENPFWQ DMTEQHLPVL FTTDDLNEDI VREAQRQLAS VDKNTIAFFS PNYLQGWQRA LAIATGEVA
|
| |