Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1223 |
Symbol | |
ID | 6485305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1218781 |
End bp | 1220277 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642736623 |
Product | sodium-glucose/galactose cotransporter |
Protein accession | YP_002040381 |
Protein GI | 194446720 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.532882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACAC ATTCTTTCGG CATCGTTAAC TATCTTGTAT TATTTGGCTA CCTCCTGGCC ATGATGTTAG TCGGTGTCTA TTTTTCCAGA CGGCAAAAAA CAGCAGACGA TTATTTTCGC GGTGGTGGCC GGGTTCCTGG TTGGGCGGCT GGGGTCAGTG TATTTGCTAC TACGTTAAGC TCAATTACAT TTATGTCAAT TCCTGCCAAA GCGTTTACTT CCGACTGGAC GTTTATCATT GGTCAGTATC TGGCTATCGC AATTTTACCG CTGGTTTTTT ATTTCTATAT TCCGTTTTTT CGGAAATTGA AAGTCACATC AGCCTATGAA TATCTCGAAG CACGGTTCGA TGTGCGCTGC CGTCTATTCG CCAGCATGTC ATTTATGTTG TTTCATATCG GACGTATCGC CATTATCACT TTCCTCACCG TGCTGGCCTT GCGCCCCTTC ATCGCTATAG ACCCGGTGAT TTTGGTACTG TTGATTAGCG TGATGTGTAT CATTTATACC TGGATGGGTG GAATTGAAGG AGTAATATGG ACTGATGTTA TTCAAGGCCT CTTACTTTCT GGCAGCGCAA TACTGATTTT TATAGTGATA TGTCTCAAAG TCCAGGGCGG CATTGGTGAA ATTTTTACGG TGACGCAGCA GGCGGATAAA TTCTTTCCGG CTACGCAGTT CCACTGGAGC TGGACGGAAA GCACAGTACC TGTATTGATG ATTGGTTTTC TGTTTGCCAA TATTCAGCAA TTTACTGCCA GTCAGGATGT GGTCCAACGC TATATCGTGA CTGACTCCAT AGAGGAAACG AAGAAAACAT TACTTACAAA TGCCAAACTG GTTGCTGTGA TCCCTGTTTT CTTTTTTGCT ATCGGCTCGG CATTGTTTGT CTACTATCAG CAACATCCAC AATTATTACC GGCGGGATTC AACACTGGCG GCATTTTACC CTTATTCGTG GTCACCGAAA TGCCAGTCGG CATTGCAGGG TTGATAATCG CCGCTATTTT CGCTGCTGCG CAGTCCAGTA TCTCCAGCAG CTTAAACAGC ATTTCCAGTT GTTTTAATTC CGATATCTAT CAGCGTTTGA GTCATAAAAA AAGAACGCCT GAAAACCGTA TGAAAATAGC TAAGTTAGTT ATTCTGGTCG CGGGCCTGAT AAGTAGCGCG GCCTCAGTAT GGCTGGTCAT GGCCGATGAA TCAGAAATCT GGGATGCATT TAATAGTCTG ATAGGTCTGA TGGGAGGGCC AATGACCGGT CTGTTCATGC TGGGCATTTT CTTTAAACGA GCAAATGCCG GGAGTGCGGT TTTAGGAATT ATTATCAGCG TCATTACCGT GCTGGGAGCA CGCTATGCCA CTGACCTTAA CTTCTTCTTT TATGGGGTCA TTGGCTCGCT AAGCGTGGTG ATCAGCGGCG TTATTTTCGC CCCGTTATTT GCCCCGGCAC CGCCATTGAC GCTGGATGAA AAACCTGAGC CAAAGGTGAC ATTATGA
|
Protein sequence | MITHSFGIVN YLVLFGYLLA MMLVGVYFSR RQKTADDYFR GGGRVPGWAA GVSVFATTLS SITFMSIPAK AFTSDWTFII GQYLAIAILP LVFYFYIPFF RKLKVTSAYE YLEARFDVRC RLFASMSFML FHIGRIAIIT FLTVLALRPF IAIDPVILVL LISVMCIIYT WMGGIEGVIW TDVIQGLLLS GSAILIFIVI CLKVQGGIGE IFTVTQQADK FFPATQFHWS WTESTVPVLM IGFLFANIQQ FTASQDVVQR YIVTDSIEET KKTLLTNAKL VAVIPVFFFA IGSALFVYYQ QHPQLLPAGF NTGGILPLFV VTEMPVGIAG LIIAAIFAAA QSSISSSLNS ISSCFNSDIY QRLSHKKRTP ENRMKIAKLV ILVAGLISSA ASVWLVMADE SEIWDAFNSL IGLMGGPMTG LFMLGIFFKR ANAGSAVLGI IISVITVLGA RYATDLNFFF YGVIGSLSVV ISGVIFAPLF APAPPLTLDE KPEPKVTL
|
| |