Gene SeHA_C4340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4340 
Symbol 
ID6489148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4230149 
End bp4231531 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content48% 
IMG OID642744426 
Productinner membrane symporter YihP 
Protein accessionYP_002048015 
Protein GI194447635 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA CCTCTGCTGA TCCGGCAACG CTACGCCTGC CGCTTAAAGA AAAAATAGCT 
TATGGCATAG GCGATCTGGG TTCCAATATT CTGCTGGATA TTGGCACACT CTATTTGTTG
AAATTCTACA CAGATGTCCT CGGTCTTCCG GGAACCTATG GTGGGATTAT TTTCTTAATC
GCTAAATTCT TTACCGCATT TGCCGATATG GGCACTGGTA TTGTGCTCGA CTCGCGGCGA
AAAATAGGAC CGAAAGGTAA ATTTCGCCCG TTCGTTCTGT ATGGCTCATT CCCAGTAGCA
CTGCTGGCTA CAGCCAACTT TATAGGAACA CCGTTAGAAA TTACGGGCAA AACGGTCGTC
GCAACGCTGC TGTTTATGCT GTATGGATTA TGCTACAGCC TAATGAACTG CTCTTATGGC
GCGATGGTGC CAGCTATAAC CAAAAATCCA AATGAACGTG CCTCGCTGGC AGCCTGGCGC
CAGGGCGGAT CCACCTTAGG CCTTCTCATT GGCACCGTTG CTTTTGTACC AGTAATGAAT
TTGATTGAAG GCAATCAACA ATTGCAATAT GGCGTAACCG CTGCTCTTTT CTCGTTATGC
GGGCTGCTAT TTATGTGGCT TTGCTATGCG GGTGTCAAAG AGCGTTATGT CGAGGTCAAA
CAGGCTGATT CCGCAAAAAA AGCAGGAATT TTGCAATCCT TTCGCGCCAT CGCCGGTAAC
CGCCCGTTGT TTATTCTGTG CGTCGCCAAC CTTTGCACCC TTGCGGCATT TAATGTCAAA
CTGGCGATTC AGGTCTACTA CACCCAGTAC GTGCTGAACG ATCCGATTCT GTTGTCCTAT
ATGGGATTCT TCAGTATGGG TTGTATCTTC ATCGGCGTAT TTTTAATGCC TACCGCTGTA
CGCCGTTTTG GTAAGAAAAA AGTCTATATC GGCGGACTGC TAATTTGGGC CGTGGGTGAT
TTGCTTAACT ACAGCTTCGG CGACAGTTCG GTGAGCTTCG TGGCCTTCTC CTGTCTGGCA
TTCTTTGGTT CAGCATTCGT CAACAGCCTG AACTGGGCGC TAGTCTCGGA CACAGTGGAA
TATGGTGAAT GGCGTACAGG TGTTCGTTCC GAAGGGACCG TTTATACCGG GCTCACCTTC
TTTCGCAAAA TGTCCCAAGC GTTGGCTGGA TTTTTTCCCG GATGGATGCT TACTCAAATT
GGCTACATAC CCAACGTGGT GCAATCAACC AGCACTGTTG AAGGATTACG TCAGTTGATC
TTCATATATC CTTGTGCCCT CGCAGTGTTG GCCATGATTA CAATGGGCTG TTTTTACAAC
CTCAACGAGA AAATGTACAT ACGTATCGTT GAGGAAATAG AAGCACGTAA ACGTACTGCT
TAA
 
Protein sequence
MSQTSADPAT LRLPLKEKIA YGIGDLGSNI LLDIGTLYLL KFYTDVLGLP GTYGGIIFLI 
AKFFTAFADM GTGIVLDSRR KIGPKGKFRP FVLYGSFPVA LLATANFIGT PLEITGKTVV
ATLLFMLYGL CYSLMNCSYG AMVPAITKNP NERASLAAWR QGGSTLGLLI GTVAFVPVMN
LIEGNQQLQY GVTAALFSLC GLLFMWLCYA GVKERYVEVK QADSAKKAGI LQSFRAIAGN
RPLFILCVAN LCTLAAFNVK LAIQVYYTQY VLNDPILLSY MGFFSMGCIF IGVFLMPTAV
RRFGKKKVYI GGLLIWAVGD LLNYSFGDSS VSFVAFSCLA FFGSAFVNSL NWALVSDTVE
YGEWRTGVRS EGTVYTGLTF FRKMSQALAG FFPGWMLTQI GYIPNVVQST STVEGLRQLI
FIYPCALAVL AMITMGCFYN LNEKMYIRIV EEIEARKRTA