Gene SNSL254_A1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1472 
Symbol 
ID6485098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1440768 
End bp1442003 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content46% 
IMG OID642736863 
Productmajor facilitator family protein 
Protein accessionYP_002040617 
Protein GI194445599 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.00286209 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCAGA ACAAGGCTCG CAACATGCCA TATTTGCTGG CTGTTATCTG CATTTATTTT 
AGTTACTTTC TCCACGGCAT GAGTGTTATT ACACTAGCCC AGAACATGAC CTCCCTTGCA
CAGAAATTCT CCACGGATAG TGCCGGTATC GCCTATTTAA TCTCTGGCAT TGGTCTTGGC
CGTCTGGTCA GTATTTTATT CTTTGGCGTA CTGTCCGATA AATTTGGCCG TCGGGCAATA
ATACTGCTTG GCGCCGTACT ATATATGCTG TTTTTCTTCG GTATTCCCGC CAGTCCTAAT
CTGATGATCG CTTTCATATT AGCGGTGTGT GTCGGCGTGG CGAACTCCGC GCTGGATACC
GGCGGATACC CTGCATTAAT GGAGTGTTTT CCCAAAGCAT CGGGCTCGGC AGTTATTCTG
GTTAAAGCGA TGGTCTCTTT TGGGCAAATG ATTTATCCCC TTATTGTCAG CGCCTTGTTA
GTCAACCATA TCTGGTACGG CTACGCGGTG GTAATCCCCG GTATCCTTTT CGTCCTCATC
ACGTTGATGC TGTTGAAAAG CCGTTTTCCC AGCCAACTTG TCGATGCCAG TATTGCGAAA
GAATTACCCC AGATGAACAG TTCTCCCCTC GTCTGGCTGG AAGGCGTAGC TTCCGTTTTA
TTTGGCGTCG CCGCGTTCTC AACCTTCTAT GTGATTGTGG TCTGGATGCC TAAATATGCG
ATGGCCTTCG CCGGAATGGC GGAATCCGAC GCGCTGAAAA CCATCTCTTA TTACAGTATG
GGATCGTTGG TTTGCGTGTT TATTTTTGCC GCATTGCTGA AAAAAATGGT TCGCCCTATC
TGGGCCAATG TTTTCAATGC CGGGCTGGCG ACACTCACCG CTGCGGCAAT TTACCTGTAT
CCCTCTCCAC TGATCTGTAA TGCTGGCGCC TTCGTGATTG GTTTTTCCGC TGCTGGAGGT
ATTTTACAAT TAGGTGTATC GGTAATGTCG GAATTTTTCC CTAAGAGTAA AGCTAAAGTC
ACCAGTATAT ATATGATGAT GGGGGGCGTA GCTAACTTTA TTATTCCACT GATCACCGGT
TATCTCTCTA CTATTGGCCT GCAATATATC ATTTTGTTAG ATTTTGCCTT TGCACTACTG
ACATTTATCA CCGCCATTAT TGTATTTATT CGCTATTATC GCGTATTTAA GATCCCGCAA
AACGATGTCC GATTTGGCGA GCGTTATTTC CAGTAA
 
Protein sequence
MSQNKARNMP YLLAVICIYF SYFLHGMSVI TLAQNMTSLA QKFSTDSAGI AYLISGIGLG 
RLVSILFFGV LSDKFGRRAI ILLGAVLYML FFFGIPASPN LMIAFILAVC VGVANSALDT
GGYPALMECF PKASGSAVIL VKAMVSFGQM IYPLIVSALL VNHIWYGYAV VIPGILFVLI
TLMLLKSRFP SQLVDASIAK ELPQMNSSPL VWLEGVASVL FGVAAFSTFY VIVVWMPKYA
MAFAGMAESD ALKTISYYSM GSLVCVFIFA ALLKKMVRPI WANVFNAGLA TLTAAAIYLY
PSPLICNAGA FVIGFSAAGG ILQLGVSVMS EFFPKSKAKV TSIYMMMGGV ANFIIPLITG
YLSTIGLQYI ILLDFAFALL TFITAIIVFI RYYRVFKIPQ NDVRFGERYF Q