Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4295 |
Symbol | rhaB |
ID | 6144349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4398407 |
End bp | 4399876 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619116 |
Product | rhamnulokinase |
Protein accession | YP_001746240 |
Protein GI | 170683095 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1070] Sugar (pentulose and hexulose) kinases |
TIGRFAM ID | [TIGR02627] rhamnulokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0758535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCTATCG TTTTAACAAT GGCCTGCATA GCCAGAACGG TTATGTCACC TGGGATGTGG ATAGCCTGGA AAGTGCCATT CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CTACAGGGTA AGCGTGTGGG TCTGCCCGTT GCTTATCGCG ATAGCCGCAC CAATGGCCTG ATGACGCAGG CACAACAGCA ACTCGGCAAA CGCGATATTT ATCAGCGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAGTTG CGTGCGCTGA CGGAGCAACA ACCTGAGCTT ATTCCACACA TTGCTCACGC TCTGCTGATG CCGGATTACT TCAGTTATCG CCTGACCGGC AAGATGAACT GGGAGTACAC CAACGCCACG ACCACGCAAC TGGTCAATAT TAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT TGGGCACTGG ATTTGCCCGC AGGGTAATGA GATTCCGGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC GCGGTTATCG CCTCGCCGTT AAACGGTTCA CGCGCCGCTT ATCTCTCTTC TGGCACCTGG TCATTGATGG GCTTCGAAAG CCAGACACCA TTTACCAATG ACACGGCGCT GGCAGCCAAC ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA TGGCTGCTTC AGCGGGTGCT TCAGGAGCGG CAAATCAACG ATCTTCCGGC GCTTATCGCC GCGACACAGG CACTGCCGGC TTGCCGCTTC ACCATCAATC CCAATGACGA TCGCTTTATT AATCCTGACG AGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAACGGC GCAACCAATC CCGGAAAGTG ATGCTGAACT GGCGCGCTGT ATTGTCGACA GTCTGGCGCT GCTGTATGCC GATGTGTTGC ATGAGCTGGC GCAACTGCGC GGTGAAGATT TCTCACAACT GCATATTGTC GGCGGTGGCT GCCAGAACGC GCTGCTCAAC CAGTTATGTG CTGATGCCTG CGGTATTCGG GTGATCGCCG GGCCTGTTGA AGCCTCGACG CTCGGCAATA TCGGCATCCA GTTAATGACG CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCA GATTCACTCT ACACGACAGA CAAAGGAGCT TTGTGCATGA
|
Protein sequence | MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIYRFNN GLHSQNGYVT WDVDSLESAI RLGLNKVCEE GIRIDSIGID TWGVDFVLLD LQGKRVGLPV AYRDSRTNGL MTQAQQQLGK RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL WLLQRVLQER QINDLPALIA ATQALPACRF TINPNDDRFI NPDEMCSEIQ AACRETAQPI PESDAELARC IVDSLALLYA DVLHELAQLR GEDFSQLHIV GGGCQNALLN QLCADACGIR VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVAQIHS TRQTKELCA
|
| |