Gene SNSL254_A1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1223 
Symbol 
ID6485305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1218781 
End bp1220277 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content45% 
IMG OID642736623 
Productsodium-glucose/galactose cotransporter 
Protein accessionYP_002040381 
Protein GI194446720 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.532882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAC ATTCTTTCGG CATCGTTAAC TATCTTGTAT TATTTGGCTA CCTCCTGGCC 
ATGATGTTAG TCGGTGTCTA TTTTTCCAGA CGGCAAAAAA CAGCAGACGA TTATTTTCGC
GGTGGTGGCC GGGTTCCTGG TTGGGCGGCT GGGGTCAGTG TATTTGCTAC TACGTTAAGC
TCAATTACAT TTATGTCAAT TCCTGCCAAA GCGTTTACTT CCGACTGGAC GTTTATCATT
GGTCAGTATC TGGCTATCGC AATTTTACCG CTGGTTTTTT ATTTCTATAT TCCGTTTTTT
CGGAAATTGA AAGTCACATC AGCCTATGAA TATCTCGAAG CACGGTTCGA TGTGCGCTGC
CGTCTATTCG CCAGCATGTC ATTTATGTTG TTTCATATCG GACGTATCGC CATTATCACT
TTCCTCACCG TGCTGGCCTT GCGCCCCTTC ATCGCTATAG ACCCGGTGAT TTTGGTACTG
TTGATTAGCG TGATGTGTAT CATTTATACC TGGATGGGTG GAATTGAAGG AGTAATATGG
ACTGATGTTA TTCAAGGCCT CTTACTTTCT GGCAGCGCAA TACTGATTTT TATAGTGATA
TGTCTCAAAG TCCAGGGCGG CATTGGTGAA ATTTTTACGG TGACGCAGCA GGCGGATAAA
TTCTTTCCGG CTACGCAGTT CCACTGGAGC TGGACGGAAA GCACAGTACC TGTATTGATG
ATTGGTTTTC TGTTTGCCAA TATTCAGCAA TTTACTGCCA GTCAGGATGT GGTCCAACGC
TATATCGTGA CTGACTCCAT AGAGGAAACG AAGAAAACAT TACTTACAAA TGCCAAACTG
GTTGCTGTGA TCCCTGTTTT CTTTTTTGCT ATCGGCTCGG CATTGTTTGT CTACTATCAG
CAACATCCAC AATTATTACC GGCGGGATTC AACACTGGCG GCATTTTACC CTTATTCGTG
GTCACCGAAA TGCCAGTCGG CATTGCAGGG TTGATAATCG CCGCTATTTT CGCTGCTGCG
CAGTCCAGTA TCTCCAGCAG CTTAAACAGC ATTTCCAGTT GTTTTAATTC CGATATCTAT
CAGCGTTTGA GTCATAAAAA AAGAACGCCT GAAAACCGTA TGAAAATAGC TAAGTTAGTT
ATTCTGGTCG CGGGCCTGAT AAGTAGCGCG GCCTCAGTAT GGCTGGTCAT GGCCGATGAA
TCAGAAATCT GGGATGCATT TAATAGTCTG ATAGGTCTGA TGGGAGGGCC AATGACCGGT
CTGTTCATGC TGGGCATTTT CTTTAAACGA GCAAATGCCG GGAGTGCGGT TTTAGGAATT
ATTATCAGCG TCATTACCGT GCTGGGAGCA CGCTATGCCA CTGACCTTAA CTTCTTCTTT
TATGGGGTCA TTGGCTCGCT AAGCGTGGTG ATCAGCGGCG TTATTTTCGC CCCGTTATTT
GCCCCGGCAC CGCCATTGAC GCTGGATGAA AAACCTGAGC CAAAGGTGAC ATTATGA
 
Protein sequence
MITHSFGIVN YLVLFGYLLA MMLVGVYFSR RQKTADDYFR GGGRVPGWAA GVSVFATTLS 
SITFMSIPAK AFTSDWTFII GQYLAIAILP LVFYFYIPFF RKLKVTSAYE YLEARFDVRC
RLFASMSFML FHIGRIAIIT FLTVLALRPF IAIDPVILVL LISVMCIIYT WMGGIEGVIW
TDVIQGLLLS GSAILIFIVI CLKVQGGIGE IFTVTQQADK FFPATQFHWS WTESTVPVLM
IGFLFANIQQ FTASQDVVQR YIVTDSIEET KKTLLTNAKL VAVIPVFFFA IGSALFVYYQ
QHPQLLPAGF NTGGILPLFV VTEMPVGIAG LIIAAIFAAA QSSISSSLNS ISSCFNSDIY
QRLSHKKRTP ENRMKIAKLV ILVAGLISSA ASVWLVMADE SEIWDAFNSL IGLMGGPMTG
LFMLGIFFKR ANAGSAVLGI IISVITVLGA RYATDLNFFF YGVIGSLSVV ISGVIFAPLF
APAPPLTLDE KPEPKVTL