Gene SeD_A4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4059 
Symbol 
ID6873708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3901211 
End bp3902707 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content61% 
IMG OID642787008 
ProductL-xylulose/3-keto-L-gulonate kinase 
Protein accessionYP_002217635 
Protein GI198242694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.822702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACT ACTGGCTGGG GTTAGATTGT GGTGGGAGTT GGCTAAAAGC CGGGTTGTAC 
GATGGCGCAG GCCGGGAAGT AGCGGTGCAA CGCCTGCCGC TGCACGCTTT AAGCCCGCAG
CCAGGCTGGG TTGAACGCGA TATGACCGAA CTGTGGCAAC AGTGCGGCTC GGTCATCAGC
AAACTGCTGA CGCACACGGG GGTGAGCGGC TCACAAATCC GCGGTCTGGG TATTTCCGCT
CAGGGTAAGG GCCTGTTCCT GTTAGATAAA AGCGATCGGC CATTAGGTAA AGCGATACTC
TCTTCCGACC GTCGCGCCAT GGAAATTGTC CAGCGCTGGC AAAAAGAAGC GGTTCCGCAA
AAACTCTACC CGCTGACTCG GCAAACCCTG TGGACCGGGC ATCCGGTCTC CCTTTTACGC
TGGGTAAAAG AGAATGAGCC GCAGCGCTAC TCGCAGATAG GCAGCGTCAT GATGACGCAT
GACTATCTGC GCTGGTGCTT AACCGGCGTG AAAGGCTGTG AGGAGAGCAA CATCTCCGAG
TCCAACCTCT ACAACATGGC GACGGGCCAG TACGACCCGC GTCTGACCGA GTGGCTGGGC
ATCAGTGAAA TCGACAGCGC GCTGCCCCCC GTGGTGGGTT CAGCCGAAAT CTGCGGGGAG
ATCACCGCTC AGGCAGCCGC CATCACCGGT CTGGCGGCGG GCACCCCCGT CGTCGGCGGC
CTGTTTGATG TGGTTTCCAC CGCCCTTTGC GCCGGTATTG AGGATGAATC AACGCTCAAT
GCGGTGATGG GCACCTGGGC CGTCACCAGC GGCATCGCTC ACGGCCTGAG CGACCATGAG
GCCCATCCTT ACGTCTATGG CCGCTACGTC AATGACGGGC AGTATATCGT TCACGAAGCC
AGCCCGACCT CCTCCGGCAA CCTCGAATGG TTTACCGCCC AGTGGGGCGA CCTCTCTTTT
GACGAGATCA ACCAGGCGGT CGCCAGCCTG CCGAAAGCCG GTAGCGACCT CTTTTTTCTG
CCGTTTCTCT ACGGCAGCAA TGCCGGACTG GAGATGACCT GCGGCTTTTA CGGCATGCAG
GCGCTGCACA CCCGCGCCCA CCTGCTGCAG GCGATTTATG AAGGCGTGGT CTTCAGCCAT
ATGACCCACC TCAACCGCAT GCGTGAACGC TTTACCGACG TTTGCGCCCT GCGCGTTACC
GGCGGCCCGG CCCACTCCGA CGTCTGGATG CAGATGCTGG CGGACGTCAG CGGTTTACGC
ATTGAGCTGC CGCAGGTGGA GGAGACCGGC TGCTTCGGCG CGGCGCTGGC TGCCCGCGTC
GGCACCGGCG TATATCGCGA TTTCCGCGAG GCCCAACGCG ACCTGCAGCA CCCGGTGCGC
ACGCTGCTGC CGGACATGAC CGCGCACGCC CTCTACCAGC GCAAATACCG CCAATACCAG
GATTTGATTG AAGCACTACA GGGCTATCAC GCCCGTATTA AGGAGCACGC ATTATGA
 
Protein sequence
MSNYWLGLDC GGSWLKAGLY DGAGREVAVQ RLPLHALSPQ PGWVERDMTE LWQQCGSVIS 
KLLTHTGVSG SQIRGLGISA QGKGLFLLDK SDRPLGKAIL SSDRRAMEIV QRWQKEAVPQ
KLYPLTRQTL WTGHPVSLLR WVKENEPQRY SQIGSVMMTH DYLRWCLTGV KGCEESNISE
SNLYNMATGQ YDPRLTEWLG ISEIDSALPP VVGSAEICGE ITAQAAAITG LAAGTPVVGG
LFDVVSTALC AGIEDESTLN AVMGTWAVTS GIAHGLSDHE AHPYVYGRYV NDGQYIVHEA
SPTSSGNLEW FTAQWGDLSF DEINQAVASL PKAGSDLFFL PFLYGSNAGL EMTCGFYGMQ
ALHTRAHLLQ AIYEGVVFSH MTHLNRMRER FTDVCALRVT GGPAHSDVWM QMLADVSGLR
IELPQVEETG CFGAALAARV GTGVYRDFRE AQRDLQHPVR TLLPDMTAHA LYQRKYRQYQ
DLIEALQGYH ARIKEHAL