Gene SeSA_A0436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0436 
Symbol 
ID6518734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp448444 
End bp449655 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID642745587 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002113411 
Protein GI194736766 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.263395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.769414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCT GGAAAGTTAA TCTAATTTCC GTTTGGTTTG GATGTTTTTT TACCGGGCTG 
GCAATCAGCC AAATCCTGCC ATTTTTACCC CTTTATATTT CCCAGCTTGG CGTCTCTTCC
CATGAAGCGT TATCAATGTG GTCCGGGTTA ACGTTTAGCA TCACGTTTCT TATTTCCGCC
ATTGTGTCGC CGATGTGGGG CAGTCTCGCC GATCGTAAAG GGCGTAAACT GATGCTATTG
CGCGCGTCGC TCGGGATGGC GATAGCTATT CTACTGCAGG CATTCGCGAC CCATGTCTGG
CAACTTTTCC TGCTGCGCGG AATCATGGGG TTAACGTCAG GCTATATCCC CAATGCCATG
GCGCTGGTAG CCTCTCAGGT ACCGCGCGAA CGTAGCGGCT GGGCGCTCAG TACGCTTTCT
ACCGCGCAAA TCAGCGGCGT TATCGGCGGG CCGTTAATGG GCGGCTTTGT CGCGGATCAT
ATCGGGCTGC GGGCGGTATT TCTGATTACC GCCATGCTGT TGGTGGTGAG CTTTCTGGTC
ACGCTATTTT TAATTAAAGA AGGCGTACGT CCGGTCATCA GGAAAAGCGA ACGCCTGAGC
GGTAAAGCCG TTTTTGCGTC GTTACCTTAT CCTGCGCTGG TGATCAGTTT GTTTTTTACC
ACGATGGTCA TTCAACTCTG TAATGGTTCC ATCAGTCCAA TCCTGGCGCT GTTTATCAAA
TCAATGATGC CGGACAGTAA TAACATCGCC TTTCTTAGCG GGTTAATTGC CTCGGTGCCC
GGTATCTCCG CGCTTATCTC CGCGCCTCGC CTGGGAAAAC TTGGCGACAG AATCGGCACG
GAAAGAATTC TGATGGCCAC GCTTATCTGC GCAGTGCTGC TTTTCTTCGC GATGTCCTGG
GTCACCACGC CGTTCCAGTT GGGGCTGTTG CGTTTCTTGT TAGGCTTTGC CGATGGCGCG
ATGTTACCCG CCGTACAAAC GTTATTGGTG AAATACTCCA GCGATCAAAT TACCGGACGT
ATTTTTGGTT ACAACCAGTC ATTTATGTAC CTGGGCAACG TGGTTGGGCC GTTGATGGGC
GCGACGGTAT CGGCGATGGC CGGTTTCCGC TGGGTTTTTA TCGCTACGGC GGCGATCGTG
TTGATCAATA TTGGGCAACT GACCCTGGCG TTACGTCGTC GGCGTAACGC GCAAAAAGCG
AAAGGCCAAT AG
 
Protein sequence
MESWKVNLIS VWFGCFFTGL AISQILPFLP LYISQLGVSS HEALSMWSGL TFSITFLISA 
IVSPMWGSLA DRKGRKLMLL RASLGMAIAI LLQAFATHVW QLFLLRGIMG LTSGYIPNAM
ALVASQVPRE RSGWALSTLS TAQISGVIGG PLMGGFVADH IGLRAVFLIT AMLLVVSFLV
TLFLIKEGVR PVIRKSERLS GKAVFASLPY PALVISLFFT TMVIQLCNGS ISPILALFIK
SMMPDSNNIA FLSGLIASVP GISALISAPR LGKLGDRIGT ERILMATLIC AVLLFFAMSW
VTTPFQLGLL RFLLGFADGA MLPAVQTLLV KYSSDQITGR IFGYNQSFMY LGNVVGPLMG
ATVSAMAGFR WVFIATAAIV LINIGQLTLA LRRRRNAQKA KGQ