Gene EcolC_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3594 
Symbol 
ID6066889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3929533 
End bp3931233 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID641603012 
Productribulokinase 
Protein accessionYP_001726535 
Protein GI170021581 
COG category[C] Energy production and conversion 
COG ID[COG1069] Ribulose kinase 
TIGRFAM ID[TIGR01234] L-ribulokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.560965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000207418 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC 
TGCGCCAGCG GTGAAGAGAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAAAAAGGG
CAATTTTGTG ATGCCCCGAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA
ATGGAAGCGG CACTGAAAAC CGTGCTTGCA GAGCTTAGCG TCGAACAGCG CGCAGCTGTG
GTCGGGATTG GCGTTGACAG TACCGGCTCG ACGCCCGCAC CGATTGATGC CGACGGTAAC
GTGCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA
GACCACACTG CGGTTGAAGA AGCGGAAGAG ATTACCCGTT TGTGCCACGC GCCGGGCAAT
GTTGACTACT CCCGCTATAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATC
CTGCATGTGA CTCGCCAGGA CAGCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG
TGCGACTGGG TGCCAGCTCT GCTTTCCGGT ACCACCCGCC CGCAGGATAT TCGTCGCGGA
CGTTGCAGCG CCGGGCATAA ATCTCTGTGG CACGAAAGCT GGGGCGGCTT GCCGCCAGCC
AGTTTCTTTG ATGAGCTGGA CCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT
GACACCTGGA CTGCCGATAT TCCGGTGGGC ACCTTATGCC CGGAATGGGC GCAGCGTCTC
GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT
GGCGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT
CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTTA AAGGTATTTG CGGTCAGGTT
GATGGCAGCG TGGTGCCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGTGAT
ATCTACGCCT GGTTCGGTCG CGTACTCAGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT
CCGGAACTGA AAGCGCAAAT CAACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA
GCATGGGCCA AAAATCCGTC TCTGGATCAC CTGCCGGTGG TGCTCGACTG GTTTAACGGT
CGTCGCTCGC CAAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT TAACCTCGCT
ACCGACGCTC CGCTGCTGTT CGGCGGTTTG ATTGCTGCCA CCGCCTTTGG CGCACGCGCA
ATCATGGAGT GCTTTACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC GCTGGGCGGC
ATCGCGCGGA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG
CAAATTGTTG CCTCTGACCA GTGCTGTGCG CTCGGTGCGG CGATTTTTGC TGCCGTCGCC
GCGAAAGTGC ACGCAGACAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA
ACCCTGCAAC CGCGCAGCGA ACAGGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG
CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC
CAGGCCGTTG CGACTCTATA A
 
Protein sequence
MAIAIGLDFG SDSVRALAVD CASGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES 
MEAALKTVLA ELSVEQRAAV VGIGVDSTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK
DHTAVEEAEE ITRLCHAPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDSAV AQSAASWIEL
CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT
DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI
LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLS WPLEQLAAQH
PELKAQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRSPNANQRL KGVITDLNLA
TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL
QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ
QWAMSAEQHY LPTSAPAQAA QAVATL