Gene SeHA_C1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1238 
Symbol 
ID6490704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1221171 
End bp1222667 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content45% 
IMG OID642741475 
Productsodium-glucose/galactose cotransporter 
Protein accessionYP_002045126 
Protein GI194449814 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.425742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAC ATTCTTTCGG CATCGTTAAT TATCTTGTAT TATTTGGCTA CCTCCTGGCC 
ATGATGTTAG TCGGTGTCTA TTTTTCCAGA CGGCAAAAAA CAGCAGACGA TTATTTTCGC
GGTGGTGGCC GGGTTCCTGG TTGGGCGGCT GGGGTCAGTG TATTTGCTAC TACGTTAAGC
TCAATTACAT TTATGTCAAT TCCTGCCAAA GCGTTTACTT CCGACTGGAC GTTTATCATT
GGTCAGTATC TGGCTATCGC AATTTTACCG CTGGTTTTTT ATTTCTATAT TCCGTTTTTT
CGGAAATTGA AAGTCACATC AGCCTATGAA TATCTCGAAG CACGGTTCGA TGTGCGCTGC
CGTCTATTCG CCAGCATGTC ATTTATGTTG TTTCATATCG GACGTATCGC CATTATCACT
TTCCTCACCG TGCTGGCCTT GCGCCCCTTC ATCGCTATAG ACCCGGTGAT TTTGGTACTG
TTGATTAGCG TGATGTGTAT CATTTATACC TGGATGGGTG GAATTGAAGG AGTAATATGG
ACTGATGTTA TTCAAGGCCT CTTACTTTCT GGCAGCGCGA TACTGATTTT TATAGTGATA
TGTCTCAAAG TCCAGGGCGG CATTGGTGAA ATTTTTACGG TGACGCAGCA GGCGGATAAA
TTCTTTCCGG CTACGCAGTT CCACTGGAGC TGGACGGAAA GCACAGTACC TGTATTGATG
ATTGGTTTTC TGTTTGCCAA TATTCAGCAA TTTACTGCCA GTCAGGATGT GGTCCAACGC
TATATCGTGA CTGACTCCAT AGAGGAAACG AAGAAAACAT TACTTACAAA TGCCAAACTG
GTTGCTGTGA TCCCTGTTTT CTTTTTTGCT ATAGGCTCGG CATTGTTTGT TTACTATCAG
CAACATCCAC AATTATTACC GGCGGGATTC AACACTGGCG GCATTTTACC CTTATTCGTG
GTCACCGAAA TGCCAGTCGG CATTGCAGGG TTGATAATCG CCGCTATTTT CGCTGCTGCG
CAGTCCAGCA TCTCCAGCAG CTTAAACAGC ATTTCCAGTT GTTTTAATTC CGATATCTAT
CAGCGTTTGA GTCATAAAAA AAGAACGCCA GAAAACCGTA TGAAAATAGC TAAGTTAGTT
ATTCTGGTCG CGGGCCTGAT AAGTAGCGCG GCCTCGGTAT GGCTGGTCAT GGCCGATGAA
TCAGAAATCT GGGATGCATT TAATAGTCTG ATAGGTCTGA TGGGAGGGCC AATGACCGGT
CTGTTCATGC TGGGCATTTT CTTTAAACGA GCAAATGCCG GGAGTGCGGT TTTAGGAATT
ATTATCAGCG TCATTACCGT GCTGGGCGCA CGCTATGCCA CTGACCTTAA CTTCTTCTTT
TATGGGGTCA TTGGCTCGCT AAGCGTGGTG ATCAGCGGCG TTATTTTCGC CCCGTTATTT
GCCCCGGCAC CGCCATTGAC GCTGGATGAA AAACCTGAAC CAAAGGTGAC ATTATGA
 
Protein sequence
MITHSFGIVN YLVLFGYLLA MMLVGVYFSR RQKTADDYFR GGGRVPGWAA GVSVFATTLS 
SITFMSIPAK AFTSDWTFII GQYLAIAILP LVFYFYIPFF RKLKVTSAYE YLEARFDVRC
RLFASMSFML FHIGRIAIIT FLTVLALRPF IAIDPVILVL LISVMCIIYT WMGGIEGVIW
TDVIQGLLLS GSAILIFIVI CLKVQGGIGE IFTVTQQADK FFPATQFHWS WTESTVPVLM
IGFLFANIQQ FTASQDVVQR YIVTDSIEET KKTLLTNAKL VAVIPVFFFA IGSALFVYYQ
QHPQLLPAGF NTGGILPLFV VTEMPVGIAG LIIAAIFAAA QSSISSSLNS ISSCFNSDIY
QRLSHKKRTP ENRMKIAKLV ILVAGLISSA ASVWLVMADE SEIWDAFNSL IGLMGGPMTG
LFMLGIFFKR ANAGSAVLGI IISVITVLGA RYATDLNFFF YGVIGSLSVV ISGVIFAPLF
APAPPLTLDE KPEPKVTL