Gene SNSL254_A0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0424 
Symbol 
ID6483007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp440004 
End bp441215 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID642735847 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002039621 
Protein GI194442257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.477286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCT GGAAAGTTAA TCTAATTTCC GTTTGGTTTG GATGTTTTTT TACCGGGCTG 
GCAATCAGCC AAATCCTGCC ATTCTTACCC CTTTATATTT CCCAGCTTGG CGTCTCTTCC
CATGAAGCGT TATCAATGTG GTCCGGGTTA ACGTTTAGCA TCACGTTTCT TATTTCCGCC
ATTGTGTCGC CGATGTGGGG CAGTCTTGCC GATCGTAAAG GGCGTAAACT GATGCTATTG
CGCGCGTCGC TCGGGATGGC GATAGCTATT CTACTGCAGG CATTTGCGAC CCATGTCTGG
CAACTTTTCC TGCTGCGCGG AATCATGGGG TTAACGTCAG GCTATATCCC CAATGCCATG
GCGCTGGTAG CCTCTCAGGT ACCGCGCGAA CGTAGCGGCT GGGCGCTCAG TACGCTTTCT
ACCGCGCAGA TCAGCGGCGT TATCGGCGGG CCGTTAATGG GCGGCTTTGT TGCGGATCAT
ATCGGGCTGC GGGCGGTATT TCTGATTACC GCCATGCTGT TGGTGGTGAG CTTTCTGGTC
ACGCTATTTT TAATTAAAGA AGGCGTGCGT CCGGTCATCA GGAAAAGCGA ACGCTTGAGC
GGTAAAGCCG TTTTTGCGTC GTTACCTTAT CCTGCGCTGG TGATCAGTTT GTTTTTTACC
ACGATGGTCA TTCAACTCTG TAATGGTTCC ATCAGTCCAA TCCTGGCGCT GTTTATCAAA
TCAATGATGC CGGACAGTAA TAACATCGCC TTTCTTAGCG GGTTAATCGC CTCGGTGCCC
GGTATCTCTG CGCTTATCTC CGCGCCTCGC CTGGGAAAAC TTGGCGACAG AATCGGCACG
GAAAGAATTC TGATGGCCAC GCTTATCTGC GCAGTGCTGC TTTTCTTCGC GATGTCCTGG
GTCACTACGC CGTTCCAGTT GGGGCTGTTG CGTTTCTTGT TAGGCTTTGC CGATGGCGCG
ATGTTACCCG CCGTACAAAC GTTATTGGTG AAATACTCCA GCGACCAAAT TACCGGACGT
ATTTTTGGCT ACAACCAGTC ATTTATGTAC CTGGGCAACG TGGTTGGGCC GTTGATGGGC
GCGACGGTAT CGGCGATGGC CGGTTTCCGC TGGGTTTTTA TCGCTACGGC GGCGATCGTG
TTGATCAATA TTGGACAACT GACCCTGGCG TTACGTCGTC GGCGTAACGC GCAAAAAGCG
AAAGGCCAAT AG
 
Protein sequence
MESWKVNLIS VWFGCFFTGL AISQILPFLP LYISQLGVSS HEALSMWSGL TFSITFLISA 
IVSPMWGSLA DRKGRKLMLL RASLGMAIAI LLQAFATHVW QLFLLRGIMG LTSGYIPNAM
ALVASQVPRE RSGWALSTLS TAQISGVIGG PLMGGFVADH IGLRAVFLIT AMLLVVSFLV
TLFLIKEGVR PVIRKSERLS GKAVFASLPY PALVISLFFT TMVIQLCNGS ISPILALFIK
SMMPDSNNIA FLSGLIASVP GISALISAPR LGKLGDRIGT ERILMATLIC AVLLFFAMSW
VTTPFQLGLL RFLLGFADGA MLPAVQTLLV KYSSDQITGR IFGYNQSFMY LGNVVGPLMG
ATVSAMAGFR WVFIATAAIV LINIGQLTLA LRRRRNAQKA KGQ