Gene SeSA_A1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1649 
Symbol 
ID6516737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1588857 
End bp1590098 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content49% 
IMG OID642746759 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002114562 
Protein GI194736011 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTA ATGTCTATGA GAACACCGAC AGCGAAACTA TCACCCCGCT CAACAAGCGG 
CGTATTTTGC CTGTTTTCCT GCTTGTCGGC CTTTACGCCG CCAGTACAGC GGCAGTGATG
TCGGTACTGC CTTTTTATAT CCGCGAGATG GGCGGTTCGC CGCTTATCAT TGGAATCATC
ATCGCCACTG AGGCTTTTAG CCAATTTTGT GCGGCGCCCC TGATTGGCCA CCTTTCCGAT
CGCGTTGGCC GCAAGCGAAT ACTGATTATC ACGCTGACTA TTGCGGCGAT AAGTTTACTA
TTACTCGCCA ACGCGCAATG TATCCTGTTT ATCCTGCTCG CCCGCACGCT TTTTGGCATT
AGCGCCGGGA ATTTGTCAGC CGCCGCAGCC TATATTGCCG ATTGTACGCA CGTCAGAAAT
CGGCGCCAGG CAATCGGTAT CCTCACAGGC TGCATTGGTT TAGGCGGTAT TGTCGGGGCA
GGCGTTTCAG GGTGGCTATC GCGTATCAGT CTGAGCGCGC CGATCTACGC CGCCTTTATA
CTTGTCCTTG GGTCTGCCCT GGTCGCGATT TGGGGGTTAA AAGACCCGTC CACAACATCA
CGTACCGCAG ATAAAATAGC GGCGTTATCT GCCCGCGCTA TTTTAAAGAT GCCAGTCCTT
CGCGTCTTAA TCATCGTAAT GCTTTGTCAT TTCTTCGCCT ATGGCATGTA CTCTTCACAA
TTACCTGTTT TTCTTTCTGA CACCTTCATC TGGAATGGGC TTCCCTTTGG GCCAAAAGCG
TTAAGCTACC TGTTAATGGC GGACGGGGTT ATTAATATTT TCGTTCAGCT ATTTCTGTTA
GGTTGGGTGA GCCAATATTT TTCGGAGCGA AAGCTAATTA TTCTAATCTT CGCCCTTCTT
TGTACTGGAT TTCTCACTGC GGGTATCGCC ACGACCATAC CAGTGCTTAT TTTTGCTATC
GTTTGTATTA GTATCGCTGA TGCGCTAGCC AAACCCACTT ATCTTGCCGC CTTGTCCGTC
CATGTATCGC CTGCCCGACA AGGTATCGTC ATCGGAACGG CGCAGGCATT AATCGCAATC
GCTGATTTTA TATCCCCCGT ATTGGGCGGA TTTGTCCTGG GTTATGCCCT GTATGGTGTC
TGGATCGGTA TAGCTATCTC TGTCGCCATT ATTGGTCTGG TGACGGCAAT GATTTACCTT
TCAAAAAGTT CAGTGCTAAT AGCGAAACCA GAAACAGAAT AA
 
Protein sequence
MNTNVYENTD SETITPLNKR RILPVFLLVG LYAASTAAVM SVLPFYIREM GGSPLIIGII 
IATEAFSQFC AAPLIGHLSD RVGRKRILII TLTIAAISLL LLANAQCILF ILLARTLFGI
SAGNLSAAAA YIADCTHVRN RRQAIGILTG CIGLGGIVGA GVSGWLSRIS LSAPIYAAFI
LVLGSALVAI WGLKDPSTTS RTADKIAALS ARAILKMPVL RVLIIVMLCH FFAYGMYSSQ
LPVFLSDTFI WNGLPFGPKA LSYLLMADGV INIFVQLFLL GWVSQYFSER KLIILIFALL
CTGFLTAGIA TTIPVLIFAI VCISIADALA KPTYLAALSV HVSPARQGIV IGTAQALIAI
ADFISPVLGG FVLGYALYGV WIGIAISVAI IGLVTAMIYL SKSSVLIAKP ETE