Gene SNSL254_A1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1001 
Symbol 
ID6485079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1010079 
End bp1011227 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content52% 
IMG OID642736407 
Productputative MFS family transporter protein 
Protein accessionYP_002040166 
Protein GI194444300 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT ATACCCGTCC CGTCATGCTT TTGCTGTGCG GGCTACTTTT GTTGACTCTG 
GCCATTGCGG TACTGAATAC GCTTGTGCCG CTGTGGCTTG CTCAGGCAAA CCTTCCGACC
TGGCAGGTGG GGATGGTCAG CTCGTCTTAT TTTACCGGCA ATCTGGTCGG GACGTTATTT
ACCGGGTATT TAATTAAACG CATTGGGTTT AACCGTAGCT ATTATCTTGC CTCGCTGATC
TTCGCCGCGG GTTGTGTCGG ATTGGGGGTG ATGGTGGGGT TCTGGAGCTG GATGAGCTGG
CGTTTTATTG CCGGTATCGG CTGCGCCATG ATTTGGGTGG TTGTCGAGAG CGCGTTGATG
TGCAGCGGAA CCTCGCATAA TCGCGGGCGC CTGCTGGCTG CCTATATGAT GGTCTATTAT
ATGGGGACCT TCCTTGGACA ATTATTGGTC AGTAAAGTAT CTGGTGAATT GCTGCACGTT
CTTCCCTGGG TGACCGGAAT GATTCTGGCG GGAATTCTGC CGCTACTCTT TACCCGAATT
GTAAATCAGC AAACGCAGGC ACGTCATTCC TCTTCTATTA GCGCCATGCT GAAGCTACGC
CAGGCGCGTC TTGGCGTGAA TGGTTGTATT ATTTCCGGCA TTGTTCTTGG TTCATTATAT
GGCCTGATGC CGTTATATCT GAAGCATCAG GGGATGGCTA ACGCCAGCAT CGGTTTCTGG
ATGGCGGTGC TGGTGAGCGC CGGCATTTTG GGGCAATGGC CAATGGGACG TCTGGCGGAC
AAATTTGGTC GCTTGCTGGT GTTACGCGTA CAGGTATTCG TTGTCATACT CGGTAGTATT
GCCATGTTAA CCCAGGCGGC GATGGCGCCA GCTCTGTTTA TTCTGGGGGC GGCGGGTTTT
ACGCTTTATC CCGTTGCAAT GGCCTGGGCC TGTGAAAAAG TCGAACACCA CCAGCTTGTG
GCAATGAACC AGGCGCTGTT GTTAAGTTAT ACGGTAGGGA GCCTGTTGGG GCCGTCTTTT
GCTGCGATGT TAATGCAGAA TTATTCAGAT AATCTGCTGT TTATTATGAT CGCCAGCGTA
TCGTTTATTT ATCTGCTGAT GCTGTTACGT AACGCCGGCC AGACGCCTAA TCCTGTCGCC
CACATCTAA
 
Protein sequence
MSTYTRPVML LLCGLLLLTL AIAVLNTLVP LWLAQANLPT WQVGMVSSSY FTGNLVGTLF 
TGYLIKRIGF NRSYYLASLI FAAGCVGLGV MVGFWSWMSW RFIAGIGCAM IWVVVESALM
CSGTSHNRGR LLAAYMMVYY MGTFLGQLLV SKVSGELLHV LPWVTGMILA GILPLLFTRI
VNQQTQARHS SSISAMLKLR QARLGVNGCI ISGIVLGSLY GLMPLYLKHQ GMANASIGFW
MAVLVSAGIL GQWPMGRLAD KFGRLLVLRV QVFVVILGSI AMLTQAAMAP ALFILGAAGF
TLYPVAMAWA CEKVEHHQLV AMNQALLLSY TVGSLLGPSF AAMLMQNYSD NLLFIMIASV
SFIYLLMLLR NAGQTPNPVA HI