Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0067 |
Symbol | araB |
ID | 5593126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 70103 |
End bp | 71803 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640919255 |
Product | ribulokinase |
Protein accession | YP_001456850 |
Protein GI | 157159532 |
COG category | [C] Energy production and conversion |
COG ID | [COG1069] Ribulose kinase |
TIGRFAM ID | [TIGR01234] L-ribulokinase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 67 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC TGCGCCAGCG GTGAAGAGAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAAAAAGGG CAATTTTGTG ATGCCCCGAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA ATGGAAGCGG CACTGAAAAC CGTGCTTGCA GAGCTTAGCG TCGAACAGCG CGCAGCTGTG GTCGGGATTG GCGTTGACAC AACCGGCTCG ACGCCCGCAC CGATTGATGC CGACGGTAAC GTGCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA GACCACACTG CGGTTGAAGA AGCGGAAGAG ATTACCCGTT TGTGCCACGC GCCGGGCAAC GTTGACTACT CCCGCTACAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATC CTGCATGTGA CTCGCCAGGA CACCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG TGCGACTGGG TGCCAGCTCT GCTTTCCGGT ACCACCCGCC CGCAGGATAT TCGTCGCGGA CGTTGCAGCG CCGGGCATAA GTCGCTATGG CACGAAAGCT GGGGTGGCTT GCCGCCAGCC AGTTTCTTTG ATGAGCTGGA CCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT GACACCTGGA CTGCCGATAT TCCGGTGGGC ACCTTATGCC CGGAATGGGC GCAGCGTCTC GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT GGCGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTGA AAGGTATTTG CGGTCAGGTT GATGGCAGCG TGGTACCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGTGAT ATCTACGCCT GGTTTGGTCG CGTACTCGGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT CCGGAACTGA AAGCGCAAAT CAACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA GCATGGGCCA AAAATCCGTC TCTGGATCAC CTGCCGGTGG TGCTCGACTG GTTTAACGGC CGCCGCACAC CGAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT TAACCTCGCT ACCGACGCTC CGCTGCTGTT CGGCGGTTTG ATTGCTGCCA CCGCCTTTGG CGCACGCGCA ATCATGGAGT GCTTTACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC GCTGGGCGGC ATCGCGCGGA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG CAAATTGTTG CCTCTGACCA GTGCTGTGCG CTCGGTGCGG CGATTTTTGC TGCCGTCGCC GCGAAAGTGC ACGCAGACAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA ACCCTGCAAC CGCGCAGCGA ACAGGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC CAGGCCGTTG CGACTCTATA A
|
Protein sequence | MAIAIGLDFG SDSVRALAVD CASGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES MEAALKTVLA ELSVEQRAAV VGIGVDTTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK DHTAVEEAEE ITRLCHAPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDTAV AQSAASWIEL CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLG WPLEQLAAQH PELKAQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRTPNANQRL KGVITDLNLA TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ QWAMSAEQHY LPTSAPAQAA QAVATL
|
| |