Gene SNSL254_A4207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4207 
SymbolwecF 
ID6486863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4102352 
End bp4103431 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content54% 
IMG OID642739461 
Product4-alpha-L-fucosyltransferase 
Protein accessionYP_002043164 
Protein GI194443274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTAC TGATTCACGT CCTGGGATCG GATATCCCTC ACCATAACCA CACCGTGCTG 
CGGTTTTTCA ATGATACGCT GGCCGCCACA AGCGAGCACG CGCGCGAATT TATGGTTGCC
GGTGAAGATA ACGGCTTCAC GGAAAGCTGC CCGGCGCTCT CGCTTCGTTT TTATGGCAGT
AAGAAAGCGC TGGCGCAGGC GGTCATCGCC AAAGCGAAAG CAAATCGTCG ACAGAGATTC
TTCTTTCACG GTCAGTTCAA CACCAGCCTG TGGCTGGCGC TGTTAAGCGG CGGTATTAAG
CCAGCTCAGT TTTACTGGCA TATCTGGGGC GCGGATCTCT ACGAAGTGTC CAACGGGCTG
AAATTCCGCC TTTTCTACCC GCTTCGTCGT ATCGCGCAGG GGCGAGTAGG GTGCGTATTC
GCGACGCGCG GCGATCTCAG CTATTTTGCG CGCCAGCATC CGGACGTACG CGGCGAGTTG
CTCTATTTCC CGACGCGCAT GGATCCTTCC CTGAATGCTA TGGCAAAAGA GTGCCAACGT
GCGGGAAAAT TGACCATTTT AGTAGGGAAC TCCGGCGATC GCAGTAACCA ACATATTGCG
GCGTTACGGG CGGTGTATCA GCAGTTTGGC GACACGGTAA ACGTGGTGGT GCCGATGGGC
TATCCGGCCA ATAACCAGGA CTATATTGAT GAGGTTCGTC AGGCCGGTCT GGCGCTATTT
AGCGCCGAAA ATTTACAAAT TCTTAGCGAA AAAATGGAAT TTGATGCCTA TCTTGCGCTG
TTGCGCCAGT GCGACCTCGG TTATTTTATT TTTGCCCGCC AACAGGGGAT CGGGACGTTA
TGTCTGCTAA TTCAGGCCGA TATCCCGTGC GTACTGAATC GCGACAATCC TTTCTGGCAG
GATATGGCGG AACAGCATCT GCCCGTCCTG TTTACCACGG ACGATCTTAA TGAGCAGGTC
GTGCGCGAGG CGCAGCGTCA GCTCGCATCG GTAGATAAAA GCGGCATCAC CTTCTTTAGC
CCCAACTACC TGCAACCGTG GCATAATGCG TTGAGAATCG CCGCAGGAGA AGCCGAATGA
 
Protein sequence
MTVLIHVLGS DIPHHNHTVL RFFNDTLAAT SEHAREFMVA GEDNGFTESC PALSLRFYGS 
KKALAQAVIA KAKANRRQRF FFHGQFNTSL WLALLSGGIK PAQFYWHIWG ADLYEVSNGL
KFRLFYPLRR IAQGRVGCVF ATRGDLSYFA RQHPDVRGEL LYFPTRMDPS LNAMAKECQR
AGKLTILVGN SGDRSNQHIA ALRAVYQQFG DTVNVVVPMG YPANNQDYID EVRQAGLALF
SAENLQILSE KMEFDAYLAL LRQCDLGYFI FARQQGIGTL CLLIQADIPC VLNRDNPFWQ
DMAEQHLPVL FTTDDLNEQV VREAQRQLAS VDKSGITFFS PNYLQPWHNA LRIAAGEAE