Gene SeD_A3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3698 
Symbol 
ID6873840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3547347 
End bp3548837 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content55% 
IMG OID642786674 
Productputative sialic acid transporter 
Protein accessionYP_002217308 
Protein GI198246215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTT CTACCCAGAA CATCCCGTGG TATCGCCATC TCAACCGGGC GCAGTGGCGG 
GCATTTTCCG CTGCCTGGCT GGGATATCTG CTTGATGGTT TTGATTTTGT GTTGATTGCT
CTTGTACTGA CTGAGGTACA AAGCGAATTT GGGCTGACGA CGGTACAGGC GGCAAGCCTG
ATTTCTGCGG CTTTTATCTC TCGCTGGTTC GGCGGGTTAT TACTGGGCGC GATGGGCGAT
CGCTATGGGC GTCGTCTGGC GATGGTCAGC AGCATCATTC TGTTTTCGGT GGGAACCCTG
GCATGCGGGT TTGCGCCCGG TTACACCACC ATGTTCATCG CCCGACTGGT GATTGGTATG
GGCATGGCGG GCGAATATGG TTCCAGCGCG ACCTATGTGA TTGAAAGCTG GCCAAAACAT
TTACGCAATA AAGCCAGCGG TTTTCTGATT TCCGGCTTCT CCGTCGGCGC GGTCGTTGCC
GCGCAGGTGT ACAGCCTGGT GGTGCCTGTC TGGGGCTGGC GCGCGCTGTT TTTCATTGGC
ATTTTGCCAA TTATCTTCGC TCTCTGGCTG CGGAAAAACA TTCCGGAAGC GGAAGACTGG
AAAGAGAAAC ACGCGGGTAA AGCGCCGGTA CGCACGATGG TCGACATTCT TTATCGGGGC
GAGCATCGCA TCATCAACAT TTTAATGACT TTCGCCGCCG CCGCTGCGCT GTGGTTCTGT
TTTGCCGGTA ACCTACAAAA TGCTGCGATT GTGGCGGGGC TGGGACTACT GTGCGCGGTT
ATCTTTATCA GCTTTATGGT GCAGAGCAGC GGTAAACGCT GGCCCACTGG CGTCATGCTG
ATGCTGGTGG TACTGTTTGC TTTCCTCTAT TCCTGGCCGA TTCAGGCGCT ATTACCCACT
TATCTGAAAA CCGAGCTGGC CTACGATCCG CATACGGTGG CGAATGTCCT GTTCTTTAGC
GGATTTGGCG CGGCGGTTGG TTGCTGCGTA GGCGGTTTTC TTGGCGACTG GCTGGGAACG
CGTAAAGCAT ATGTCTGTAG CCTGCTGGCC TCGCAAATCC TCATTATTCC GGTCTTTGCG
ATTGGCGGCA CAAACGTCTG GGTTCTGGGT CTGCTACTGT TTTTCCAACA GATGTTGGGG
CAGGGGATTG CCGGGATTCT ACCGAAACTG ATCGGCGGTT ACTTCGATAC CGATCAGCGC
GCGGCGGGGC TGGGCTTTAC TTATAACGTC GGCGCGCTCG GCGGCGCGCT GGCGCCGATC
CTGGGAGCGC TGATCGCTCA ACGTCTGGAT TTGGGCACTG CGCTGGCATC GCTCTCTTTC
AGCCTGACGT TTGTCGTGAT CCTGCTTATT GGGCTTGATA TGCCGTCTCG CGTACAGCGT
TGGCTACGTC CGGAAGCGTT ACGCACCCAC GATGCTATTG ACGACAAACC GTTCAGCGGA
GCCGTACCGC TTGGCAGTGG TAAAGGTGCC TTTGTAAAAA CGAAAAGTTA A
 
Protein sequence
MSTSTQNIPW YRHLNRAQWR AFSAAWLGYL LDGFDFVLIA LVLTEVQSEF GLTTVQAASL 
ISAAFISRWF GGLLLGAMGD RYGRRLAMVS SIILFSVGTL ACGFAPGYTT MFIARLVIGM
GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVVA AQVYSLVVPV WGWRALFFIG
ILPIIFALWL RKNIPEAEDW KEKHAGKAPV RTMVDILYRG EHRIINILMT FAAAAALWFC
FAGNLQNAAI VAGLGLLCAV IFISFMVQSS GKRWPTGVML MLVVLFAFLY SWPIQALLPT
YLKTELAYDP HTVANVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQILIIPVFA
IGGTNVWVLG LLLFFQQMLG QGIAGILPKL IGGYFDTDQR AAGLGFTYNV GALGGALAPI
LGALIAQRLD LGTALASLSF SLTFVVILLI GLDMPSRVQR WLRPEALRTH DAIDDKPFSG
AVPLGSGKGA FVKTKS