Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0065 |
Symbol | araB |
ID | 6147000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 73411 |
End bp | 75111 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641614966 |
Product | ribulokinase |
Protein accession | YP_001742182 |
Protein GI | 170683895 |
COG category | [C] Energy production and conversion |
COG ID | [COG1069] Ribulose kinase |
TIGRFAM ID | [TIGR01234] L-ribulokinase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.268824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.306332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC TGCGCCACCG GTGAAGAGAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAGAAAGGG CAATTTTGTG ATGCCCCAAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA ATGGAAGCGG CGCTGAAAAC CGTGCTTGCA GAGCTTAGCG CCGAGCAGCG CGCAGCTGTG GTCGGGATTG GCGTTGACAC AACCGGCTCG ACGCCCGCAC CGATTGACGC GGACGGTAAC GTCCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA GACCACACCG CGGTTGAAGA AGCGGAAGAG ATAACCCGCT TATGTCACAC GCCGGGCAAC GTTGACTACT CCCGCTATAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATA CTGCATGTGA CTCGACAGGA CAGCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG TGCGACTGGG TACCAGCTCT GCTTTCCGGT ACCACGCGCC CGCAGGATAT TCGTCGCGGA CGTTGCAGCG CCGGGCATAA GTCGCTATGG CATGAAAGCT GGGGCGGCCT GCCGCCAGCC AGTTTCTTTG ACGAACTGGA CCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT GATACCTGGA CTGCCGATAT TCCGGTGGGC ACGTTATGTC CGGAATGGGC ACAGCGTCTC GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT GGTGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTGA AAGGTATTTG CGGTCAGGTT GATGGCAGCG TGGTGCCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGCGAT ATCTACGCCT GGTTCGGTCG CGTACTCGGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT CCGGAACTGA AAGAGCAAAT CAACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA GCATGGGCCA AAAATCCGTC TCTGGATCAC CTACCGGTGG TGCTCGACTG GTTTAACGGC CGCCGCACAC CGAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT GAACCTCGCC ACCGACGCAC CGCTGCTGTT CGGCGGTTTA ATTGCCGCCA CCGCCTTTGG CGCACGCGCA ATTATGGAGT GCTTCACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC ACTGGGCGGC ATCGCGCGCA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG CAAATTGTTG CCTCTGACCA GTGCTGTGCG CTCGGTGCGG CGATTTTCGC TGCCGTCGCC GCGAAAGTGC ACGCAGATAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA ACCCTGCAAC CGCGCAGCGA ACAAGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC CAGGCCGTTC CGACTCTATA A
|
Protein sequence | MAIAIGLDFG SDSVRALAVD CATGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES MEAALKTVLA ELSAEQRAAV VGIGVDTTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK DHTAVEEAEE ITRLCHTPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDSAV AQSAASWIEL CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLG WPLEQLAAQH PELKEQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRTPNANQRL KGVITDLNLA TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ QWAMSAEQHY LPTSAPAQAA QAVPTL
|
| |