Gene EcHS_A0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0067 
SymbolaraB 
ID5593126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp70103 
End bp71803 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID640919255 
Productribulokinase 
Protein accessionYP_001456850 
Protein GI157159532 
COG category[C] Energy production and conversion 
COG ID[COG1069] Ribulose kinase 
TIGRFAM ID[TIGR01234] L-ribulokinase 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC 
TGCGCCAGCG GTGAAGAGAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAAAAAGGG
CAATTTTGTG ATGCCCCGAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA
ATGGAAGCGG CACTGAAAAC CGTGCTTGCA GAGCTTAGCG TCGAACAGCG CGCAGCTGTG
GTCGGGATTG GCGTTGACAC AACCGGCTCG ACGCCCGCAC CGATTGATGC CGACGGTAAC
GTGCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA
GACCACACTG CGGTTGAAGA AGCGGAAGAG ATTACCCGTT TGTGCCACGC GCCGGGCAAC
GTTGACTACT CCCGCTACAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATC
CTGCATGTGA CTCGCCAGGA CACCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG
TGCGACTGGG TGCCAGCTCT GCTTTCCGGT ACCACCCGCC CGCAGGATAT TCGTCGCGGA
CGTTGCAGCG CCGGGCATAA GTCGCTATGG CACGAAAGCT GGGGTGGCTT GCCGCCAGCC
AGTTTCTTTG ATGAGCTGGA CCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT
GACACCTGGA CTGCCGATAT TCCGGTGGGC ACCTTATGCC CGGAATGGGC GCAGCGTCTC
GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT
GGCGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT
CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTGA AAGGTATTTG CGGTCAGGTT
GATGGCAGCG TGGTACCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGTGAT
ATCTACGCCT GGTTTGGTCG CGTACTCGGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT
CCGGAACTGA AAGCGCAAAT CAACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA
GCATGGGCCA AAAATCCGTC TCTGGATCAC CTGCCGGTGG TGCTCGACTG GTTTAACGGC
CGCCGCACAC CGAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT TAACCTCGCT
ACCGACGCTC CGCTGCTGTT CGGCGGTTTG ATTGCTGCCA CCGCCTTTGG CGCACGCGCA
ATCATGGAGT GCTTTACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC GCTGGGCGGC
ATCGCGCGGA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG
CAAATTGTTG CCTCTGACCA GTGCTGTGCG CTCGGTGCGG CGATTTTTGC TGCCGTCGCC
GCGAAAGTGC ACGCAGACAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA
ACCCTGCAAC CGCGCAGCGA ACAGGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG
CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC
CAGGCCGTTG CGACTCTATA A
 
Protein sequence
MAIAIGLDFG SDSVRALAVD CASGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES 
MEAALKTVLA ELSVEQRAAV VGIGVDTTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK
DHTAVEEAEE ITRLCHAPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDTAV AQSAASWIEL
CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT
DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI
LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLG WPLEQLAAQH
PELKAQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRTPNANQRL KGVITDLNLA
TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL
QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ
QWAMSAEQHY LPTSAPAQAA QAVATL