Gene SNSL254_A3247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3247 
Symbol 
ID6483488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3160140 
End bp3161558 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content53% 
IMG OID642738545 
ProductL-arabinose/proton symport protein 
Protein accessionYP_002042267 
Protein GI194442626 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000138124 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCTCTA TTAATCATGA CTCTGCTTTA ACGCCGCGTT CGCTTCGCGA CACACGACGT 
ATGAATATGT TTGTTTCGGT TTCTGCAGCG GTTGCGGGAC TGTTATTTGG TCTGGATATC
GGCGTTATCG CCGGGGCGCT GCCTTTTATT ACCGACCATT TCGTACTGAC CAGCCGGCTG
CAGGAGTGGG TCGTCAGCAG TATGATGCTT GGCGCGGCAA TTGGCGCATT ATTTAACGGC
TGGCTTTCAT TCCGGCTGGG GCGTAAGTAT AGCCTGATGG CTGGCGCGAT TTTGTTCGTG
CTCGGCTCGC TGGGGTCGGC GTTTGCTTCC AGCGTGGAAG TATTGATTGG CGCCCGCGTG
ATACTGGGCG TAGCAGTAGG GATTGCCTCC TATACCGCGC CGCTTTATCT CTCTGAAATG
GCTAGCGAAA ATGTTCGCGG CAAAATGATC AGTATGTATC AACTGATGGT GACGTTAGGC
ATTGTGCTGG CTTTTTTATC CGATACGGCA TTCAGCTACA GCGGCAACTG GCGCGCGATG
TTGGGCGTGC TGGCGCTTCC TGCGGTGTTG CTCATTATTC TGGTGGTATT CCTGCCGAAT
AGTCCGCGTT GGCTGGCGCA AAAAGGTCGC CATATTGAAG CGGAAGAGGT GCTGCGTATG
CTGCGCGATA CCTCGGAAAA AGCCCGTGAT GAACTGAATG AGATTCGGGA AAGCCTCAAA
CTCAAGCAGG GCGGGTGGGC ATTATTTAAA GCTAACCGCA ATGTTCGCCG CGCCGTGTTC
CTCGGTATGC TGCTTCAGGC AATGCAGCAG TTCACCGGCA TGAACATCAT TATGTACTAT
GCGCCGCGCA TTTTTAAAAT GGCCGGCTTT ACCACCACGG AACAGCAAAT GATCGCTACG
CTGGTGGTCG GACTGACCTT TATGTTCGCG ACGTTTATCG CCGTCTTTAC GGTCGATAAG
GCCGGGCGTA AACCGGCGTT AAAAATCGGT TTCAGCGTAA TGGCGTTAGG GACATTGGTG
TTGGGCTACT GCCTGATGCA GTTTGATAAC GGTACGGCAT CAAGCGGTCT CTCCTGGCTT
TCCGTTGGGA TGACGATGAT GTGTATCGCC GGTTACGCGA TGAGCGCCGC TCCGGTGGTG
TGGATACTGT GTTCGGAAAT CCAGCCGCTG AAATGCCGTG ATTTTGGCAT TACCTGTTCA
ACCACGACAA ACTGGGTATC GAACATGATC ATCGGCGCGA CATTCCTGAC ACTGTTGGAC
AGCATTGGCG CGGCAGGTAC ATTCTGGCTT TACACCGCGC TGAATATCGC TTTTATCGGC
ATCACTTTCT GGCTGATTCC GGAAACCAAA AATGTCACCC TGGAGCACAT CGAACGCAAG
CTGATGGCGG GCGAGAAGCT AAGAAATATT GGCGTGTAA
 
Protein sequence
MVSINHDSAL TPRSLRDTRR MNMFVSVSAA VAGLLFGLDI GVIAGALPFI TDHFVLTSRL 
QEWVVSSMML GAAIGALFNG WLSFRLGRKY SLMAGAILFV LGSLGSAFAS SVEVLIGARV
ILGVAVGIAS YTAPLYLSEM ASENVRGKMI SMYQLMVTLG IVLAFLSDTA FSYSGNWRAM
LGVLALPAVL LIILVVFLPN SPRWLAQKGR HIEAEEVLRM LRDTSEKARD ELNEIRESLK
LKQGGWALFK ANRNVRRAVF LGMLLQAMQQ FTGMNIIMYY APRIFKMAGF TTTEQQMIAT
LVVGLTFMFA TFIAVFTVDK AGRKPALKIG FSVMALGTLV LGYCLMQFDN GTASSGLSWL
SVGMTMMCIA GYAMSAAPVV WILCSEIQPL KCRDFGITCS TTTNWVSNMI IGATFLTLLD
SIGAAGTFWL YTALNIAFIG ITFWLIPETK NVTLEHIERK LMAGEKLRNI GV