Gene SeSA_A0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0047 
Symbol 
ID6519460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp50450 
End bp51823 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID642745222 
Productxylose-proton symporter 
Protein accessionYP_002113054 
Protein GI194738222 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000774004 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTG TCATTGAGGA TACCCAGCCT TCCGGGTCAG CATCATTGTC TTTACTACAG 
CGTATTAGCT ACGGCTCTTT GGATGTGGCG GGTAATCTGC TGTACTGCTT CGGTTCAACG
TACATTTTAT ATTTCTACAC AGACGTTGCG GGCATTAGCC TTGCCGTAGC AGGCGTTATC
CTGCTGCTGG CGCGTATTAT CGATGGCATA GACGCCCCCA TATGGGGGAT CATCATCGAT
AAAACGCGTT CACGCTACGG TAAATGTCGT CCCTGGTTTT TATGGTTACC GCTGCCTTTT
GCGGTATTCA GCGCGCTATC ATTTTGGTCT CCTGATATCA GTATGACAGG AAAAGCCATC
TATGCAGCAA TATCTTATAT GATTGCCAGC ATTCTATTTA CCGGACTTAA TACACCACTC
AGTGCAATAT TACCCCTGAT GACCTTATCT CCCAAGGAAA GACTGGTTTT AAATTCCTGG
CGAATGACCG GTGGGCAAAT TGGGGTTTTA TTAATGAACG CGACCGCCTT GCCGTTAGTC
GCTTTTTTAG GTAACGGTGA TGATCACGCT GGTTTTATTT ATACGGCAAT TACATTTGCC
ATTATATCCT GCGCGCTAAC GCTCTTTGCG TTTAAAAACA TTCGTGAAAT GGATACGGAT
AAAATACAGC ATGAACCTAA GTTGCCGATG AAAAAAAGTT TCGCGGCGAT GAAAGGTAAC
TGGCCGTGGA TCCTGATGGT GCTGGCTAAT CTGATCTTCT GGATTGCCCT ACAGCAGCGC
AACACGACCA TTGTCTATTA TCTGACCTAC AACCTCGACC GTAAAGATCT GGTACCGCTG
ATTAACAGCC TGGCGACGAT TCAGATCCTG TTTATCATCG CTATCCCCTT CTTTAGCAAA
TACCTGGCTA AAACCTGGAT ATGGGTAGGC GGTCTGCTGG TCGCCACGTT TGGCGGCGTC
ATGATGTGGC TGGCAGCGGA CAACATTACT TTCCTCATCG CCGCCTGGAT ACTCGGCAAT
ATCGGCAGCG GTATCGCCTG CTCAATGCCG TTTGCCATGC TGGGGTTCGC CGTCGATTTC
GGCGCCTGGA AAACCGGTAT TAAGGCTACC GGCATTCTTA TCGCCTTCGG CAGCACCTTC
TGCATCAAGA TGGGTAGTGG CCTCGGCACC GCTTTCGCCG CCTGGATCAT GAACAGTTTT
GGCTATGTCC CCAACCATGC CCAGAGTGCT GCGGGTCTGG AGGGAATTAT CTGGGCCTTT
ATCTGGGCAC CCGCCCTACT CTTCGCGCTC GCAGCGATCC CACTACTTTT CTTTCGCAAA
TACGAAGCGA TGGAAGAGAA GATTCGCCAC GATCTGGAAA CCATCAACTC ATAA
 
Protein sequence
MSSVIEDTQP SGSASLSLLQ RISYGSLDVA GNLLYCFGST YILYFYTDVA GISLAVAGVI 
LLLARIIDGI DAPIWGIIID KTRSRYGKCR PWFLWLPLPF AVFSALSFWS PDISMTGKAI
YAAISYMIAS ILFTGLNTPL SAILPLMTLS PKERLVLNSW RMTGGQIGVL LMNATALPLV
AFLGNGDDHA GFIYTAITFA IISCALTLFA FKNIREMDTD KIQHEPKLPM KKSFAAMKGN
WPWILMVLAN LIFWIALQQR NTTIVYYLTY NLDRKDLVPL INSLATIQIL FIIAIPFFSK
YLAKTWIWVG GLLVATFGGV MMWLAADNIT FLIAAWILGN IGSGIACSMP FAMLGFAVDF
GAWKTGIKAT GILIAFGSTF CIKMGSGLGT AFAAWIMNSF GYVPNHAQSA AGLEGIIWAF
IWAPALLFAL AAIPLLFFRK YEAMEEKIRH DLETINS