Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0068 |
Symbol | araB |
ID | 6967927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 73175 |
End bp | 74875 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643384148 |
Product | ribulokinase |
Protein accession | YP_002268671 |
Protein GI | 209400593 |
COG category | [C] Energy production and conversion |
COG ID | [COG1069] Ribulose kinase |
TIGRFAM ID | [TIGR01234] L-ribulokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.876967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC TGCACCACCG GTGAAGAAAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAAAAAGGG CAATTTTGTG ATGCCCCGAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA ATGGAAGCGG CACTGAAAAC CGTGCTTGCA GAACTTAGCG TCGAACAGCG CGCAGCTGTG GTCGGGATTG GCGTTGACAG TACCGGCTCG ACGCCCGCAC CGATTGATGC CGACGGTAAC GTGCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA GACCACACTG CGGTTGAAGA AGCGGAAGAG ATTACCCGTT TGTGCCACGC GCCGGGCAAT GTTGACTACT CCCGCTATAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATC CTGCATGTGA CTCGCCAGGA CAGCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG TGCGACTGGG TGCCAGCTCT GCTTTCCGGT ACCACGCGCC CGCAGGATAT TCGTCGCGGA CGTTGCAGCG CCGGGCATAA GTCGCTATGG CACGAAAGCT GGGGCGGCCT GCCGCCAGCC AGTTTCTTTG ATGAGCTGGA TCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT GATACCTGGA CTGCCGATAT TCCGGTGGGC ACCTTATGCC CGGAATGGGC GCAGCGTCTC GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT GGTGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTGA AAGGTATTTG CGGTCAGGTT GATGGCAGCG TGGTGCCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGTGAT ATCTACGCCT GGTTCGGTCG CGTACTCGGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT CCGGAACTAA AAGAGCAAAT CGACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA GCATGGGCCA AAAATCCGTC TCTGGATCAC CTGCCGGTGG TGCTCGACTG GTTTAACGGC CGCCGCACAC CGAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT TAACCTCGCT ACCGACGCTC CGCTGCTGTT CGGCGGTTTG ATTGCTGCCA CCGCCTTTGG CGCACGCGCA ATCATGGAGT GCTTTACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC GCTGGGCGGC ATCGCGCGGA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG CAAATTGTTG CCTCTGACCA ATGCTGTGCG CTCGGTGCGG CGATTTTTGC TGCCGTCGCC GCGAAAGTGC ACGCAGACAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA ACCCTGCAAC CGCGCAGCGA ACAGGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC CAGGCCGTTC CGACTCTATA A
|
Protein sequence | MAIAIGLDFG SDSVRALAVD CTTGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES MEAALKTVLA ELSVEQRAAV VGIGVDSTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK DHTAVEEAEE ITRLCHAPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDSAV AQSAASWIEL CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLG WPLEQLAAQH PELKEQIDAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRTPNANQRL KGVITDLNLA TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ QWAMSAEQHY LPTSAPAQAA QAVPTL
|
| |