Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1227 |
Symbol | |
ID | 6485069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1223981 |
End bp | 1225261 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642736627 |
Product | putative sialic acid transporter |
Protein accession | YP_002040385 |
Protein GI | 194444129 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATAGCAA AATTCTTCCC GTGGTATAGC GAGATAACAC GTCCACAAAA AAATGCTTTA TTTTCAGCAT GGCTGGGTTA CGTTTTTGAT GGCTTCGACT TTATGCTGAT TTTCTACATT ATGTATCTGA TCAAGGCTGA CTTAGGATTG ACAGATATGG AGGGCGCATT CCTTGCCACA GCGGCCTTTA TTGGGCGACC ATTTGGCGGG GCGCTATTTG GTCTGCTGGC AGACAAATTT GGCCGTAAGC CGTTAATGAT GTGGTCGATA GTTGCCTATT CTGTAGGTAC AGGGTTAAGT GGCCTGGCTT CCGGTGTAAT TATGCTGACG CTTAGTCGCT TCATTGTCGG TATGGGGATG GCGGGGGAGT ATGCTTGCGC TTCTACTTAT GCCGTGGAAA GTTGGCCAAA GCATTTAAAA TCTAAAGCGA GCGCATTTCT GGTTTCAGGT TTCGGTATTG GTAACATCAT AGCAGCCTAT TTTATGCCGT CATTTGCCGA AGCGTATGGT TGGCGTGCTG CTTTTTTTGT CGGTTTGCTA CCCGTTCTTT TAGTAATCTA CATCCGGGCC AGGGCTCCTG AATCTAAAGA GTGGGAAGAA GCCAAACTCA GTGGTCCCGG AAAGCATTCA CAAAGTGCCT GGTCAGTTTT CTCTTTGTCA ATGAAAGGGC TATTTAATCG AGCTCAATTT CCACTGACAT TATGTGTATT TATTGTTCTG TTCTCTATTT TCGGCGCAAA CTGGCCGATC TTTGGTCTAC TGCCTACATA TTTGGCGGGA GAGGGCTTTG ATACGGGCGT GGTCTCTAAT TTAATGACGG CGGCGGCATT CGGCACTGTA TTGGGAAATA TCGTTTGGGG GCTGTGCGCA GATAGAATTG GTTTGAAGAA AACGTTCAGC ATTGGTCTTC TCATGTCCTT TTTATTCATT TTCCCGTTAT TCAGAATTCC GCAAGATAAT TATTTACTGC TGGGCGCATG TTTATTCGGT TTAATGGCGA CTAACGTAGG TGTTGGCGGG TTGGTTCCCA AATTTCTCTA CGACTACTTT CCTCTTGAGG TTCGTGGTTT GGGTACCGGG CTGATTTATA ATCTTGCTGC GACATCAGGC ACATTCAATT CAATGGCGGC GACCTGGCTT GGAATAACAA TGGGGCTAGG CGCTGCGCTA ACGTTCATTG TTGCTTTCTG GACCGCAACA ATTCTACTCA TTATTGGCCT ATCCATTCCG GATAGACTAA AAGCACGTCG TGAAAGTTTT CAATCAACAA AAGAATTTTA A
|
Protein sequence | MIAKFFPWYS EITRPQKNAL FSAWLGYVFD GFDFMLIFYI MYLIKADLGL TDMEGAFLAT AAFIGRPFGG ALFGLLADKF GRKPLMMWSI VAYSVGTGLS GLASGVIMLT LSRFIVGMGM AGEYACASTY AVESWPKHLK SKASAFLVSG FGIGNIIAAY FMPSFAEAYG WRAAFFVGLL PVLLVIYIRA RAPESKEWEE AKLSGPGKHS QSAWSVFSLS MKGLFNRAQF PLTLCVFIVL FSIFGANWPI FGLLPTYLAG EGFDTGVVSN LMTAAAFGTV LGNIVWGLCA DRIGLKKTFS IGLLMSFLFI FPLFRIPQDN YLLLGACLFG LMATNVGVGG LVPKFLYDYF PLEVRGLGTG LIYNLAATSG TFNSMAATWL GITMGLGAAL TFIVAFWTAT ILLIIGLSIP DRLKARRESF QSTKEF
|
| |