Gene SeD_A4406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4406 
Symbol 
ID6874828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4254187 
End bp4255569 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content53% 
IMG OID642787326 
Productinner membrane symporter YihP 
Protein accessionYP_002217937 
Protein GI198246158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0907861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA CATCATCGAA TCCGGCAACC CTACGCTTGC CGTTTAAAGA AAAACTTGCC 
TATGGACTGG GGGATTTAGG TTCTAATATC CTGTTAGATA TCGGAACCCT CTATTTACTC
AAATTTTATA CCGATGTGCT GGGTTTACCA GGGACTTACG GCGGGATCAT TTTCCTGATC
GCCAAATTTT TTACCGCATT TACCGATATG GGTACCGGCA TTATGCTCGA CTCGCGGCGT
AAAATTGGTC CGAAGGGCAA ATTCCGCCCG TTCGTGCTTT ACGCGGCATT TCCGGTAACG
CTACTGGCGA TTGCTAACTT TGTCGGCACA CCGTTTGAGG TGACGGGAAA AACCGTCGTC
GCAACGATGC TGTTTATGCT GTACGGGCTG GTTTTCAGCA TGATGAACTG CTCGTATGGC
GCGATGGTAC CCGCGATTAC CAAGAACCCG GATGAACGCG CCTCGCTTGC CGCCTGGCGT
CAGGGCGGCG CCACTCTCGG CCTGCTGCTG TGTACCGTTG GCTTTGTGCC GGTCATGAAC
CTGATCGAAG GCAATGCCCA ACTCAGCTAT ATTTTCGCCG CCACGCTATT TTCATTGTTT
GGCCTGCTAT TTATGTGGCT GTGCTACGCC GGCGTTAAAG AGCGCTACGT TGAAGTGAAA
CCTGTCGATA GCGCGCAAAA GCCTGGATTA TTGCAGTCGT TCCGCGCCAT CGCCGGTAAC
CGTCCGCTGT TTATTCTGTG TATCGCCAAC CTTTGTACTC TCGGCGCCTT CAACGTCAAA
CTGGCGATTC AGGTTTATTA CACCCAGTAC GTTCTTAACG ACCCGATCCT CCTCTCCTGG
ATGGGCTTCT TTAGCATGGG CTGTATTTTT ATCGGCGTCT TTTTGATGCC CGGCGCAGTC
AGGCGTTTTG GCAAGAAGAA AGTCTATATC GGCGGGCTGT TAATATGGGT GGCAGGCGAT
CTGCTCAACT ACTTCTTTGG CGGCGGCTCG GTCAGTTTTG TCGCCTTCTC CTGCCTGGCG
TTCTTCGGTT CCGCCTTCGT CAACAGCCTG AACTGGGCGC TGGTTTCCGA CACGGTGGAG
TACGGTGAAT GGCGCACCGG CGTCCGCTCG GAAGGGACGG TTTACACCGG CTTCACGTTC
TTCCGTAAGG TCTCCCAGGC GCTGGCAGGG TTCTTCCCCG GCTGGATGCT GACGCAAATC
GGTTATATCC CGAATGTGGT GCAATCAGCA GGCACCGTCG AAGGCCTACG CCAGTTGATC
TTTATTTATC CTTGCGTGCT GGCGGTCATC ACCATTATTG CGATGGGCTG TTTCTACAAC
CTCAACGAGA AGATGTACGT GCGAATTGTG GAAGAGATTG AGGCCCGGAA ACATACGGTT
TAA
 
Protein sequence
MSQTSSNPAT LRLPFKEKLA YGLGDLGSNI LLDIGTLYLL KFYTDVLGLP GTYGGIIFLI 
AKFFTAFTDM GTGIMLDSRR KIGPKGKFRP FVLYAAFPVT LLAIANFVGT PFEVTGKTVV
ATMLFMLYGL VFSMMNCSYG AMVPAITKNP DERASLAAWR QGGATLGLLL CTVGFVPVMN
LIEGNAQLSY IFAATLFSLF GLLFMWLCYA GVKERYVEVK PVDSAQKPGL LQSFRAIAGN
RPLFILCIAN LCTLGAFNVK LAIQVYYTQY VLNDPILLSW MGFFSMGCIF IGVFLMPGAV
RRFGKKKVYI GGLLIWVAGD LLNYFFGGGS VSFVAFSCLA FFGSAFVNSL NWALVSDTVE
YGEWRTGVRS EGTVYTGFTF FRKVSQALAG FFPGWMLTQI GYIPNVVQSA GTVEGLRQLI
FIYPCVLAVI TIIAMGCFYN LNEKMYVRIV EEIEARKHTV