Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3952 |
Symbol | |
ID | 6485970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3833313 |
End bp | 3834809 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642739212 |
Product | L-xylulose/3-keto-L-gulonate kinase |
Protein accession | YP_002042922 |
Protein GI | 194443418 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1070] Sugar (pentulose and hexulose) kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACT ACTGGCTGGG GTTAGATTGT GGTGGGAGTT GGCTAAAAGC CGGGTTGTAC GATGGCGCAG GCCGGGAAGT AGCGGTGCAA CGCCTGCCGC TGCACGCTTT AAGCCCGCAG CCAGGCTGGG TTGAACGCGA TATGACCGAA CTGTGGCAAC AGTGCGCTTC GGTCATCAGC AAACTGCTGA CGCACACGGG GGTGAGCGGC TCACAAATCC GCGGTCTGGG TATTTCAGCC CAAGGGAAAG GCCTGTTCCT GTTAGATAAA AGCGATCGGC CATTAGGTAA AGCGATACTC TCTTCCGATC GTCGCGCCAT GGAAATTGTC CAGCGCTGGC AAAAAGAAGC GGTTCCGCAA AAACTCTACC CGCTGACCCG GCAAACCCTG TGGACTGGGC ATCCGGTATC CCTTTTACGC TGGGTAAAAG AGAATGAGCC GCAGCGCTAC GCGCAGATAG GCAGCGTCAT GATGACGCAT GACTATCTGC GCTGGTGCTT AACCGGCGTG AAAGGCTGTG AGGAGAGCAA TATCTCCGAG TCCAACCTCT ACAACATGGC GACGGGCCAG TACGACCCGC GCCTGACCGA ATGGCTGGGC ATCAGTGAAA TCGATAGCGC GCTGCCCCCC GTGGTGGGTT CAGCCGAAAT ATGCGGGGAG ATCACCGCTC AGGCGGCCGC CATAACCGGT CTGGCGGCGG GTACCCCCGT CGTCGGCGGC CTGTTTGATG TGGTTTCCAC CGCCCTTTGC GCCGGTATTG AGAATGAATC GACCCTCAAT GCGGTGATGG GCACCTGGGC CGTCACCAGC GGTATCGCTC ACGGCCTGCG TGACCATGAG GCCCACCCTT ACGTCTATGG CCGCTACGTC AATGACGGGC AGTATATCGT TCACGAAGCC AGCCCGACCT CCTCCGGCAA CCTTGAATGG TTTACCGCCC AGTGGGGCGA CCTCTCTTTT GACGAGATCA ATCAGGCGGT CGCCAGCCTG CCGAAAGCCG GTAGCGACCT TTTTTTTCTG CCGTTTCTCT ACGGCAGCAA TGCCGGACTG GAGATGACCT GCGGCTTTTA CGGCATGCAG GCGCTGCACA CCCGCGCCCA TCTGCTGCAG GCGATTTATG AAGGCGTGGT CTTCAGCCAT ATGACCCACC TCAACCGCAT GCGTGAACGC TTTACCGACG TTTGCGCCCT GCGCGTTACC GGCGGCCCGG CCCACTCCGA CGTCTGGATG CAGATGCTGG CGGACGTCAG CGGTTTACGC ATTGAGCTGC CGCAGGTGGA GGAGACCGGC TGCTTCGGCG CGGCGCTGGC TGCCCGCGTC GGCACCGGCG TATATCGCGA TTTCCGCGAG GCCCAACGCG ACCTGCAGCA CCCGGTGCGC ACGCTGCTGC CGGACATGAC CGCGCACGCC CTCTACCAGC GCAAATACCG CCAATACCAG GATTTGATTG AAGCACTACA GGGCTATCAC GCCCGTATTA AGGAGCATGC ATTATGA
|
Protein sequence | MSNYWLGLDC GGSWLKAGLY DGAGREVAVQ RLPLHALSPQ PGWVERDMTE LWQQCASVIS KLLTHTGVSG SQIRGLGISA QGKGLFLLDK SDRPLGKAIL SSDRRAMEIV QRWQKEAVPQ KLYPLTRQTL WTGHPVSLLR WVKENEPQRY AQIGSVMMTH DYLRWCLTGV KGCEESNISE SNLYNMATGQ YDPRLTEWLG ISEIDSALPP VVGSAEICGE ITAQAAAITG LAAGTPVVGG LFDVVSTALC AGIENESTLN AVMGTWAVTS GIAHGLRDHE AHPYVYGRYV NDGQYIVHEA SPTSSGNLEW FTAQWGDLSF DEINQAVASL PKAGSDLFFL PFLYGSNAGL EMTCGFYGMQ ALHTRAHLLQ AIYEGVVFSH MTHLNRMRER FTDVCALRVT GGPAHSDVWM QMLADVSGLR IELPQVEETG CFGAALAARV GTGVYRDFRE AQRDLQHPVR TLLPDMTAHA LYQRKYRQYQ DLIEALQGYH ARIKEHAL
|
| |