Gene SeAg_B4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4249 
Symbol 
ID6795862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4146794 
End bp4148176 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content48% 
IMG OID642778357 
Productinner membrane symporter YihP 
Protein accessionYP_002148936 
Protein GI197249031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA CCTCTGCTGA TCCGGCAACG CTACGCCTGC CGCTTAAAGA AAAAATAGCT 
TATGGCATAG GCGATCTGGG TTCCAATATT CTGCTGGATA TTGGCACACT CTATTTGTTG
AAATTCTACA CAGATGTCCT CGGTCTTCCG GGAACCTATG GTGGGATTAT TTTCTTAATC
GCTAAATTCT TTACCGCATT TGCCGATATG GGCACTGGTA TTGTGCTCGA CTCGCGGCGA
AAAATAGGAC CGAAAGGTAA ATTTCGCCCG TTCGTTCTGT ATGGCTCATT CCCAGTAGCA
CTGTTGGCTA CAGCCAACTT TATAGGAACA CCGTTAGAAA TTACGGGCAA AACGGTCGTC
GCAACGCTGC TGTTTATGCT GTATGGATTA TGCTACAGCC TAATGAACTG CTCTTATGGC
GCGATGGTGC CAGCTATAAC CAAAAATCCA AATGAACGTG CCTCGCTGGC AGCCTGGCGC
CAGGGCGGAT CCACCTTAGG CCTTCTCATT GGCACCGTTG CTTTTGTACC AGTAATGAAT
TTGATTGAAG GCAATCAACA ATTGCAATAT GGCGTAACCG CTGCTCTTTT CTCGTTATGC
GGGCTGCTAT TTATGTGGCT TTGCTATGCG GGTGTCAAAG AGCGTTATGT CGAGGTCAAA
CAGGCTGATT CCGCACAAAA AGCAGGAATT TTGCAATCCT TTCGCGCCAT CGCCGGTAAC
CGCCCGTTGT TTATTCTGTG CGTCGCCAAC CTTTGCACCC TTGCGGCATT TAATGTCAAA
CTGGCGATTC AGGTCTACTA CACCCAGTAC GTGCTGAACG ATCCGATTCT GTTGTCCTAT
ATGGGATTCT TCAGTATGGG TTGTATCTTC ATCGGCGTAT TTTTAATGCC TACCGCTGTA
CGCCGTTTTG GTAAGAAAAA AGTCTATATC GGCGGACTGC TAATTTGGGC CGTGGGTGAT
TTGCTTAACT ACAGCTTCGG CGACAGTTCG GTGAGCTTCG TGGCCTTCTC CTGTCTGGCA
TTCTTTGGTT CAGCATTCGT CAACAGCCTG AACTGGGCGC TAGTCTCGGA CACAGTGGAA
TATGGTGAAT GGCGTACAGG TGTTCGTTCC GAAGGGACGG TTTATACCGG GCTCACCTTC
TTTCGCAAAA TGTCCCAAGC GTTGGCTGGA TTTTTTCCCG GATGGATGCT TACTCAAATT
GGCTACATAC CCAACGTGGT GCAATCAACC AGCACTGTTG AAGGATTACG TCAGTTGATC
TTCATATATC CTTGTGCCCT CGCAGTGTTG GCCATGATTA CAATGGGTTG TTTTTACAAC
CTCAACGAGA AAATGTACAT ACGTATCGTT GAGGAAATAG AAGCACGTAA ACGTACTGCT
TAA
 
Protein sequence
MSQTSADPAT LRLPLKEKIA YGIGDLGSNI LLDIGTLYLL KFYTDVLGLP GTYGGIIFLI 
AKFFTAFADM GTGIVLDSRR KIGPKGKFRP FVLYGSFPVA LLATANFIGT PLEITGKTVV
ATLLFMLYGL CYSLMNCSYG AMVPAITKNP NERASLAAWR QGGSTLGLLI GTVAFVPVMN
LIEGNQQLQY GVTAALFSLC GLLFMWLCYA GVKERYVEVK QADSAQKAGI LQSFRAIAGN
RPLFILCVAN LCTLAAFNVK LAIQVYYTQY VLNDPILLSY MGFFSMGCIF IGVFLMPTAV
RRFGKKKVYI GGLLIWAVGD LLNYSFGDSS VSFVAFSCLA FFGSAFVNSL NWALVSDTVE
YGEWRTGVRS EGTVYTGLTF FRKMSQALAG FFPGWMLTQI GYIPNVVQST STVEGLRQLI
FIYPCALAVL AMITMGCFYN LNEKMYIRIV EEIEARKRTA