Gene SeD_A4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4043 
SymbolxylB 
ID6872036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3885161 
End bp3886615 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content57% 
IMG OID642786992 
Productxylulokinase 
Protein accessionYP_002217619 
Protein GI198245798 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR01312] D-xylulose kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.120116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATCG GGATCGATCT TGGTACATCG GGTGTAAAAG CTATCCTGTT GAATGAGCAG 
GGCGATGTGC TGGCTACGCA TACTGAAAAA CTGACCGTAT CGCGTCCGCA TCCCTTATGG
TCGGAACAGG AACCAGAGCA GTGGTGGCAG GCGACGGATC GTGCGGTTAA AGGTTTAGGT
CGGCAACAGT CGTTAAGTGG CGTTCGGGCG TTGGGCATCG CCGGACAAAT GCATGGCGCG
ACATTGCTTG ATAGCCGTCA GCAGGTTCTG CGACCAGCGA TTTTATGGAA TGACGGACGC
TGTAGCGAAG AGTGCGCCTG GCTGGAAAAA CAGGTGCCGC AGTCGCGTGC GATAACCGGT
AATCTGATGA TGCCCGGTTT TACCGCGCCC AAATTAGTCT GGGTGCAGCA CCACGAACCG
GATATTTTCT ACCAGATAGA CAAGGTTCTG CTGCCGAAAG ATTTTCTGCG GCTGCGAATG
ACAGGCGTCT TTGCCAGCGA TATGTCGGAT GCGGCGGGAA CGATGTGGCT GGACGTGAAA
AAGCGCGACT GGAGCGACGT TATGCTCAAC GCCTGTCATT TAACCCGACA GCAGATGCCA
GCGTTATTTG AAGGTAGTGA CATTACCGGA ACGTTGCTGC CGGAGGTAGC CAGCGCATGG
GGAATGCCAA CAGTACCCGT GGTGGCGGGC GGCGGCGACA ATGCGGCAGG CGCGGTCGGC
GTAGGAATGA TCGATGCCGG ACAGGCGATG CTCTCGCTCG GAACATCGGG CGTCTATTTT
GCCGTCAGCG ACGGCTTTCT GAGTAAACCG GAGAGCGCAG TACACAGTTT TTGCCATGCG
CTGCCGGAAC GCTGGCATTT AATGTCTGTG ATGTTGAGCG CCGCCTCTTG TCTGGACTGG
GCGGCTAAAC TCACCGGGCA GGAGAACGTC CCGGCGCTGA TTGCCGCCGC ACAGCAGGCG
GATGAGCATG CCGATTCGAT CTGGTTTTTG CCGTACCTGT CCGGCGAGCG CACGCCGCAC
AATAATCCGC AGGCAAAAGG CGTTTTCTTT GGTTTAACCC ATCAGCATGG CCCGGCGGAA
CTGGCGCGGG CGGTACTGGA GGGCGTGGGA TATGCTCTGG CGGACGGTAT GGATGTGGTT
CACGCCTGCG GCGTCAAACC CGCCAGCGTG ACGCTTATCG GCGGCGGGGC GCGCAGCGAA
TACTGGCGTC AGATGCTATC TGATATTAGC GGGCTACAGC TTGATTATCG TACTGGCGGC
GATGTAGGGC CAGCGCTGGG CGCGGCGCGT CTGGCGCAAA TTGCGGTGAA TAAACAGACT
CCGCTCGCCG ATGTATTGCC GCAGTTGCCG CTGGAGCAGG CGCATTATCC TGATGCGCAG
CATCATGCGG TTTATCAACA ACGTCGTGAA ACCTTCCGTC GACTTTATCA GCAGCTGTTG
CCGTTGATGT CATAA
 
Protein sequence
MYIGIDLGTS GVKAILLNEQ GDVLATHTEK LTVSRPHPLW SEQEPEQWWQ ATDRAVKGLG 
RQQSLSGVRA LGIAGQMHGA TLLDSRQQVL RPAILWNDGR CSEECAWLEK QVPQSRAITG
NLMMPGFTAP KLVWVQHHEP DIFYQIDKVL LPKDFLRLRM TGVFASDMSD AAGTMWLDVK
KRDWSDVMLN ACHLTRQQMP ALFEGSDITG TLLPEVASAW GMPTVPVVAG GGDNAAGAVG
VGMIDAGQAM LSLGTSGVYF AVSDGFLSKP ESAVHSFCHA LPERWHLMSV MLSAASCLDW
AAKLTGQENV PALIAAAQQA DEHADSIWFL PYLSGERTPH NNPQAKGVFF GLTHQHGPAE
LARAVLEGVG YALADGMDVV HACGVKPASV TLIGGGARSE YWRQMLSDIS GLQLDYRTGG
DVGPALGAAR LAQIAVNKQT PLADVLPQLP LEQAHYPDAQ HHAVYQQRRE TFRRLYQQLL
PLMS