Gene SNSL254_A3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3601 
Symbol 
ID6486142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3488571 
End bp3490061 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content55% 
IMG OID642738877 
Productputative sialic acid transporter 
Protein accessionYP_002042594 
Protein GI194445856 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTT CTACCCAAAA CATCCCGTGG TATCGCCATC TCAACCGGGC GCAGTGGCGG 
GCATTTTCCG CTGCCTGGCT GGGATATCTG CTTGATGGTT TTGATTTTGT GTTGATTGCT
CTTGTACTGA CTGAGGTACA AAGCGAATTT GGGCTGACGA CGGTACAGGC GGCAAGCCTG
ATTTCTGCGG CTTTTATCTC TCGCTGGTTC GGCGGGTTAT TACTGGGCGC GATGGGCGAT
CGCTATGGGC GTCGTCTGGC GATGGTCAGC AGCATCATTC TGTTTTCGGT GGGAACCCTG
GCATGCGGGT TTGCGCCCGG TTACACCACC ATGTTCATCG CCCGACTGGT GATTGGTATG
GGCATGGCGG GCGAATATGG TTCCAGCGCG ACCTATGTGA TTGAAAGCTG GCCAAAACAT
TTACGCAATA AAGCCAGCGG TTTTCTGATT TCCGGCTTCT CCGTCGGCGC GGTCGTTGCC
GCGCAGGTGT ACAGCCTGGT GGTGCCTGTC TGGGGCTGGC GCGCGCTGTT TTTCATTGGC
ATTTTGCCAA TTATCTTCGC TCTCTGGCTG CGGAAAAACA TTCCGGAAGC GGAAGACTGG
AAAGAGAAAC ACGCGGGTAA AGCGCCGGTA CGCACGATGG TCGACATTCT TTATCGGGGC
GAGCATCGCA TCATCAACAT TTTAATGACT TTCGCCGCCG CCGCTGCGCT GTGGTTCTGT
TTTGCCGGTA ACCTACAAAA TGCTGCGATT GTGGCGGGGC TGGGACTACT GTGCGCGGTT
ATCTTTATCA GCTTTATGGT GCAGAGCAGC GGTAAACGCT GGCCCACTGG CGTCATGCTG
ATGCTGGTGG TACTGTTTGC TTTCCTCTAT TCCTGGCCGA TTCAGGCGCT GTTACCCACT
TATCTGAAAA CCGAGCTGGC CTACGATCCG CATACGGTGG CGAATGTCCT GTTCTTTAGC
GGATTTGGCG CGGCGGTTGG TTGCTGCGTA GGCGGTTTTC TTGGCGACTG GCTGGGAACG
CGTAAAGCAT ATGTCTGTAG CCTGCTGGCC TCGCAAATCC TCATTATTCC GGTCTTTGCG
ATTGGCGGCA CAAACGTCTG GGTTCTGGGT CTGCTACTGT TTTTCCAACA GATGTTGGGG
CAGGGGATTG CCGGGATTCT ACCGAAACTG ATCGGCGGTT ACTTCGATAC CGATCAGCGC
GCGGCGGGGC TGGGCTTTAC TTATAACGTC GGCGCGCTCG GCGGCGCGCT GGCACCGATC
CTGGGAGCGC TGATCGCTCA ACGTCTGGAT CTGGGCACTG CGCTGGCATC GCTCTCTTTC
AGCCTGACGT TTGTCGTGAT CCTGCTTATT GGGCTTGATA TGCCGTCTCG CGTACAGCGT
TGGCTACGTC CGGAAGCGTT ACGTACCCAC GATGCTATCG ACGACAAACC GTTCAGCGGA
GCCGTACCGC TTGGCAGTGG TAAAGGTGCC TTTGTAAAAA CGAAAAGTTA A
 
Protein sequence
MSTSTQNIPW YRHLNRAQWR AFSAAWLGYL LDGFDFVLIA LVLTEVQSEF GLTTVQAASL 
ISAAFISRWF GGLLLGAMGD RYGRRLAMVS SIILFSVGTL ACGFAPGYTT MFIARLVIGM
GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVVA AQVYSLVVPV WGWRALFFIG
ILPIIFALWL RKNIPEAEDW KEKHAGKAPV RTMVDILYRG EHRIINILMT FAAAAALWFC
FAGNLQNAAI VAGLGLLCAV IFISFMVQSS GKRWPTGVML MLVVLFAFLY SWPIQALLPT
YLKTELAYDP HTVANVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQILIIPVFA
IGGTNVWVLG LLLFFQQMLG QGIAGILPKL IGGYFDTDQR AAGLGFTYNV GALGGALAPI
LGALIAQRLD LGTALASLSF SLTFVVILLI GLDMPSRVQR WLRPEALRTH DAIDDKPFSG
AVPLGSGKGA FVKTKS