Gene SeHA_C3636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3636 
Symbol 
ID6488277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3521827 
End bp3523317 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content55% 
IMG OID642743754 
Productputative sialic acid transporter 
Protein accessionYP_002047366 
Protein GI194449265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00891] putative sialic acid transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTT CTACCCAGAA CATCCCGTGG TATCGCCATC TCAACCGGGC GCAGTGGCGG 
GCATTTTCCG CTGCCTGGCT GGGATATCTG CTTGATGGTT TTGATTTTGT GTTAATTGCT
CTTGTACTGA CTGAGGTACA AAGCGAATTT GGGCTGACGA CGGTACAGGC GGCAAGCCTG
ATTTCTGCGG CTTTTATCTC TCGCTGGTTC GGCGGGTTAT TACTGGGCGC GATGGGCGAT
CGCTATGGGC GTCGTCTGGC GATGGTCAGC AGCATCATTC TGTTTTCGGT GGGAACCCTG
GCCTGCGGGT TTGCGCCCGG TTACACCACC ATGTTCATCG CCCGACTGGT GATTGGTATG
GGCATGGCGG GCGAATATGG TTCCAGCGCG ACCTATGTGA TTGAAAGCTG GCCAAAACAT
TTACGCAATA AGGCCAGTGG TTTTCTGATT TCCGGCTTCT CCGTCGGCGC GGTCGTTGCC
GCGCAGGTGT ACAGCCTGGT GGTGCCTGTC TGGGGCTGGC GCGCGCTGTT TTTCATTGGC
ATTTTGCCAA TTATCTTCGC CCTCTGGCTG CGGAAAAACA TTCCGGAAGC GGAAGACTGG
AAAGAAAAAC ACGCGGGTAA AGCGCCGGTA CGTACGATGG TCGACATTCT TTATCGGGGC
GAACATCGCA TCATCAACAT TTTAATGACT TTCGTCGCCG CCGCTGCGCT GTGGTTCTGT
TTTGCCGGTA ACCTACAAAA TGCTGCGATT GTGGCGGGGC TGGGACTACT GTGCGCGGTT
ATCTTTATCA GCTTTATGGT GCAGAGCAGC GGTAAACGCT GGCCCACTGG CGTCATGCTG
ATGCTGGTGG TACTGTTTGC TTTCCTCTAT TCCTGGCCGA TTCAGGCGCT ATTACCCACT
TATCTGAAAA CCGAGCTGGC CTACGATCCG CATACGGTGG CGAATGTCCT GTTCTTTAGC
GGATTTGGCG CGGCGGTTGG TTGCTGCGTA GGCGGTTTTC TTGGCGACTG GCTGGGAACG
CGTAAAGCAT ATGTCTGTAG CCTGCTGGCC TCGCAAATCC TCATTATTCC GGTCTTTGCG
ATTGGCGGCA CAAACGTCTG GGTTCTGGGT CTGCTACTGT TTTTCCAACA GATGTTGGGG
CAGGGGATTG CCGGGATTCT ACCGAAACTG ATCGGCGGTT ACTTCGATAC CGATCAGCGC
GCGGCGGGGC TGGGCTTTAC TTATAACGTC GGCGCGCTCG GCGGCGCGCT GGCGCCGATC
CTGGGAGCGC TGATCGCTCA ACGTCTGGAT TTGGGCACTG CGCTGGCATC GCTCTCTTTC
AGCCTGACGT TTGTCGTGAT CCTGCTTATT GGGCTTGATA TGCCGTCTCG CGTACAGCGT
TGGCTACGTC CGGAAGCGTT ACGCACCCAC GATGCTATTG ACGACAAACC GTTCAGCGGA
GCCGTACCGC TTGGCAGTGG TAAAGGTGCC TTTGTAAAAA CGAAAAGTTA A
 
Protein sequence
MSTSTQNIPW YRHLNRAQWR AFSAAWLGYL LDGFDFVLIA LVLTEVQSEF GLTTVQAASL 
ISAAFISRWF GGLLLGAMGD RYGRRLAMVS SIILFSVGTL ACGFAPGYTT MFIARLVIGM
GMAGEYGSSA TYVIESWPKH LRNKASGFLI SGFSVGAVVA AQVYSLVVPV WGWRALFFIG
ILPIIFALWL RKNIPEAEDW KEKHAGKAPV RTMVDILYRG EHRIINILMT FVAAAALWFC
FAGNLQNAAI VAGLGLLCAV IFISFMVQSS GKRWPTGVML MLVVLFAFLY SWPIQALLPT
YLKTELAYDP HTVANVLFFS GFGAAVGCCV GGFLGDWLGT RKAYVCSLLA SQILIIPVFA
IGGTNVWVLG LLLFFQQMLG QGIAGILPKL IGGYFDTDQR AAGLGFTYNV GALGGALAPI
LGALIAQRLD LGTALASLSF SLTFVVILLI GLDMPSRVQR WLRPEALRTH DAIDDKPFSG
AVPLGSGKGA FVKTKS