Gene SNSL254_A3518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3518 
Symbol 
ID6485686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3412314 
End bp3413687 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID642738801 
ProductPTS family galactitol-specific enzyme IIC 
Protein accessionYP_002042520 
Protein GI194446386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID[TIGR00827] PTS system, galactitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGCG AAATAATGCG TTATATCCTC GACTTAGGCC CAACGGTGAT GTTGCCTCTG 
GTGATCATCG TGTTCTCTAA ACTGCTGGGA ATGAAGCTTG GGGATTGTTT TAAATCGGGT
TTGCATATTG GTATCGGCTT CGTGGGTATT GGTCTGGTCA TCGGCCTGAT GCTGGATTCT
ATCGGCCCCG CGGCGAAAGC CATGGCGGAA CATTTCCAAA TCAATCTCCA CGTTATTGAC
GTCGGCTGGC CAGGTTCATC GCCCATGACC TGGGCGTCAC AAATCGCGCT GGTCGCGATC
CCTGTCGCCA TCGGGGTTAA CGTCCTGATG CTGGTGACCC GCATGACCCG CGTGGTGAAT
GTTGATATCT GGAATATCTG GCACATGACC TTCACGGGCG CCATGCTGCA TCTGGCGACC
GGTTCATACT GGCTGGGGAT TCTGGGCGTT GTGGTTCATG CCGCCTTTGT CTACAAACTG
GGGGACTGGT TTGCCAAAGA TACCCGCGAC TATTTTGGCC TCGAGGGGAT TGCTATCCCA
CACGGTTCAT CCGCGTACCT GGGCCCCGTG GCGGTACTCG TTGATACCAT TATCGAGAAA
ATTCCGGGTC TCAATCGTAT TCACTTTAGC GCAGACGATG TCCAGAAACG CTTCGGACCG
TTTGGCGAGC CGGTGACTGT CGGCTTCGTG ATGGGGCTGG TGATTGGTGT ACTGGCAGGC
TACGACGCCA AAGCCGTTCT GCAACTGGCG GTCAAAACCG CAGCGGTGAT GCTGCTTATG
CCACGCGTCA TTAAACCTAT TATGGATGGC CTAACGCCTA TCGCGAAACA TGCGCGTAAA
CGTCTACAGG CTAAATTTGG CGGGCAGGAG TTCCTGATAG GCCTTGATCC AGCGCTACTG
CTCGGTCATA CCTCTGTTGT CTCCGCGAGC CTGATATTCA TTCCGCTGAC CATCCTGATT
GCCGTCTTAG TACCAGGGAA CCAGGTGCTG CCGTTCGGCG ACCTCGCCAC CATCGGTTTC
TTTATTGCGA TGGCGGTTGC GGTACACCAG GGCAACCTGT TCCGCACGCT GATTTCAGGT
GTCATTATCA TGGGCATCAC CCTGTGGATA GCCACCCAGA CGATTGGCCT GCATACCCAA
CTGGCTGCCA ATGCCGGAGC GCTAAAAGCT GGCGGACAAG TCGCCTCGCT GGATCAGGGC
GGTTCCCCCA TCACCTGGCT GCTGATTCAA CTTTTTACCT GGCAGAATAT CGTCGGCTTC
GCCGTCATTG CCATTATCTA TCTGGCTGGC GTACTGCTGA CCTGGCGTCG CGCCCGACAG
TTTGTCGCGG CTGAGAAAGC CACGGCGCTA CAGCAAAGTC AAATCGCCTC TTAA
 
Protein sequence
MFSEIMRYIL DLGPTVMLPL VIIVFSKLLG MKLGDCFKSG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE HFQINLHVID VGWPGSSPMT WASQIALVAI PVAIGVNVLM LVTRMTRVVN
VDIWNIWHMT FTGAMLHLAT GSYWLGILGV VVHAAFVYKL GDWFAKDTRD YFGLEGIAIP
HGSSAYLGPV AVLVDTIIEK IPGLNRIHFS ADDVQKRFGP FGEPVTVGFV MGLVIGVLAG
YDAKAVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKHARK RLQAKFGGQE FLIGLDPALL
LGHTSVVSAS LIFIPLTILI AVLVPGNQVL PFGDLATIGF FIAMAVAVHQ GNLFRTLISG
VIIMGITLWI ATQTIGLHTQ LAANAGALKA GGQVASLDQG GSPITWLLIQ LFTWQNIVGF
AVIAIIYLAG VLLTWRRARQ FVAAEKATAL QQSQIAS