Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3887 |
Symbol | xylB |
ID | 6144582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3954075 |
End bp | 3955529 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618713 |
Product | xylulokinase |
Protein accession | YP_001745852 |
Protein GI | 170680828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1070] Sugar (pentulose and hexulose) kinases |
TIGRFAM ID | [TIGR01312] D-xylulose kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATATCG GGATAGATCT TGGCACCTCG GGCGTAAAAG TTATTTTGCT CAATGAGCAG GGTGAGGTGG TTGCTGCGCA AACGGAAAAG CTGACCGTTT CGCGCCCGCA TCCACTCTGG TCGGAACAAG ACCCGGAACA GTGGTGGCAG GCAACTGATC GCGCAATGAA AGCTCTGGGC GTTCAGCATT CTCTGCAAGA CGTTAAAGCA TTAGGTATTG CCGGACAGAT GCACGGTGCA ACCTTACTGG ATGCCCAGCA ACGGGTATTA CGCCCTGCCA TTTTGTGGAA CGACGGGCGC TGTGCGCAAG AGTGCACTTT GCTGGAAGCG CGAGTTCCGC AATCACGGGT GATTACCGGT AACCTGATGA TGCCCGGATT TACTGCGCCT AAATTGCTAT GGGTTCATCG GCATGAGCCG GAGATATTCC GTCAAATCGA TAAAGTATTA TTACCGAAAG ATTACTTGCG TCTGCGTATG ACGGGGGAGT TTGCCAGCGA TATGTCTGAC GCAGCTGGCA CTATGTGGCT GGATGTCGCA AAGCGTGACT GGAGCGATCT CATGCTGCAG GCTTGCCACT TATCTCGTGA CCAGATGCCC GCATTATACG AAGGCAGCGA AATTACTGGT GCTTTGCTAC CTGAAGTTGC GAAAGCGTGG GGTATGGCGA CGGTGCCAGT TGTCGCAGGC GGTGGCGATA ATGCAGCTGG TGCGGTTGGT GTGGGAATGG TTGATGCTAA TCAGGCAATG TTATCGCTGG GGACGTCGGG GGTCTATTTT GCTGTCAGCG AAGGATTCTT AAGCAAGCCA GAAAGCGCCG TACACAGCTT TTGCCATGCG CTCCCGCAGC GTTGGCATTT AATGTCTGTG ATGCTGAGTG CGGCTTCTTG CCTGGATTGG GCCGCAAAAT TAACCGGTCT GAGCAATGTT CCAGCTTTAA TCGCGGCTGC TCAACAGGCT GATGAAAGTA CCGAGCCTGT CTGGTTTCTG CCTTATCTTT CCGGTGAACG TACGCCACAC AACAATCCCC AGGCGAAGGG GGTTTTCTTT GGTTTGACCC ATCAACATGG TGCTAATGAA CTGGCGCGAG CAGTGCTGGA AGGCGTGGGT TATGCGCTGG CAGATGGCAT GGATGTCGTA CATGCCTGCG GCATTAAACC GCAAAGTGTT ACGCTGATTG GTGGCGGGGC TCGTAGTGAG TACTGGCGTC AGATGCTGGC GGATATCAGC GGTCAGCAGC TCGATTACCG CACAGGGGGA GATGTGGGGC CAGCACTGGG TGCAGCAAGG CTGGCGCAGA TTGCGACGAA TCCAGAGAAA TCGCTCATTG AATTGTTGCC ACAACTACCG TTAGAACAGT CGCATCTACC AGATGCGCAG CGTTATGCCG CTTATCAGCC ACGACGAGAA ACGTTCCGTC GCCTCTATCA GCAACTTCTG CCATTAATGG CGTAA
|
Protein sequence | MYIGIDLGTS GVKVILLNEQ GEVVAAQTEK LTVSRPHPLW SEQDPEQWWQ ATDRAMKALG VQHSLQDVKA LGIAGQMHGA TLLDAQQRVL RPAILWNDGR CAQECTLLEA RVPQSRVITG NLMMPGFTAP KLLWVHRHEP EIFRQIDKVL LPKDYLRLRM TGEFASDMSD AAGTMWLDVA KRDWSDLMLQ ACHLSRDQMP ALYEGSEITG ALLPEVAKAW GMATVPVVAG GGDNAAGAVG VGMVDANQAM LSLGTSGVYF AVSEGFLSKP ESAVHSFCHA LPQRWHLMSV MLSAASCLDW AAKLTGLSNV PALIAAAQQA DESTEPVWFL PYLSGERTPH NNPQAKGVFF GLTHQHGANE LARAVLEGVG YALADGMDVV HACGIKPQSV TLIGGGARSE YWRQMLADIS GQQLDYRTGG DVGPALGAAR LAQIATNPEK SLIELLPQLP LEQSHLPDAQ RYAAYQPRRE TFRRLYQQLL PLMA
|
| |