Gene SNSL254_A1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1657 
Symbol 
ID6483557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1620855 
End bp1622096 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content49% 
IMG OID642737042 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002040794 
Protein GI194444944 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.992815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTA ATGTCTATGA GAACACCGAC AGCGAAACTA TCACCCCGCT CAACAAGCGG 
CGTATTTTGC CTGTTTTCCT GCTTGTCGGC CTTTACGCCG CCAGTACAGC GGCTGTAATG
TCGGTACTGC CTTTTTATAT CCGCGAGATG GGCGGTTCGC CGCTTATCAT TGGAATCATC
ATCGCCACTG AAGCTTTTAG CCAATTTTGT GCGGCGCCCC TGATTGGCCA CCTTTCCGAT
CGCGTTGGCC GCAAGCGAAT ATTGATTGTC ACGCTGGCTA TTGCGGCGAT AAGTTTACTA
TTACTCGCCA ACGCGCAATG TATCCTGTTT ATCCTGCTCG CCCGCACGCT TTTTGGCATT
AGCGCCGGGA ATTTGTCAGC CGCCGCAGCC TATATTGCCG ATTGTACGCA CGTCAGAAAT
CGGCGTCAGG CAATCGGTAT CCTCACAGGC TGCATTGGTT TAGGCGGTAT TGTCGGGGCA
GGCGTTTCCG GGTGGTTATC GCGTATCAGT CTGAGCGCGC CGATCTACGC CGCCTTTATA
CTTGTCCTTG GGTCTGCCCT GGTCGCGATT TGGGGGTTAA AAGACCCTTC CACAACATCA
CGTACCACAG ATAAAATAGC GGCGTTCTCT GCCCGCGCTA TTTTAAAGAT GCCTGTCCTT
CGCGTCTTAA TCATCGTAAT GCTTTGTCAT TTCTTCGCCT ATGGCATGTA CTCTTCACAA
TTACCTGTTT TTCTTTCTGA CACCTTCATC TGGAATGGGC TTCCCTTTGG GCCAAAAGCG
TTAAGCTATC TGTTAATGGC GGACGGGGTT ATTAATATTT TCGTTCAGCT ATTTCTGTTA
GGTTGGGTGA GCCAATATTT TTCGGAGCGA AAGCTAATTA TCCTCATCTT CGCCCTTCTT
TGTACTGGAT TTCTCACTGC GGGTATCGCC ACGACCATAC CTGTGCTTGT TTTTGCTATC
GTTTGTATTA GCATCGCTGA TGCGCTAGCC AAACCCACTT ATCTTGCCGC CTTGTCCGTC
CATGTATCGC CTGCCCGACA AGGTATCGTC ATCGGAACGG CGCAGGCATT AATCGCAATC
GCTGATTTTA TATCCCCCGT ATTGGGCGGA TTTGTCCTGG GTTATGCTCT GTATGGCGTC
TGGATCGGTA TAGCTATCTC TGTCGCCATT ATTGGTCTGG TGACGGCAAT GATTTACCTT
TCAAAAAGTT CACCGCTAAT AGTGAAACCA GAAACAGAAT AA
 
Protein sequence
MNTNVYENTD SETITPLNKR RILPVFLLVG LYAASTAAVM SVLPFYIREM GGSPLIIGII 
IATEAFSQFC AAPLIGHLSD RVGRKRILIV TLAIAAISLL LLANAQCILF ILLARTLFGI
SAGNLSAAAA YIADCTHVRN RRQAIGILTG CIGLGGIVGA GVSGWLSRIS LSAPIYAAFI
LVLGSALVAI WGLKDPSTTS RTTDKIAAFS ARAILKMPVL RVLIIVMLCH FFAYGMYSSQ
LPVFLSDTFI WNGLPFGPKA LSYLLMADGV INIFVQLFLL GWVSQYFSER KLIILIFALL
CTGFLTAGIA TTIPVLVFAI VCISIADALA KPTYLAALSV HVSPARQGIV IGTAQALIAI
ADFISPVLGG FVLGYALYGV WIGIAISVAI IGLVTAMIYL SKSSPLIVKP ETE