Gene EcSMS35_0065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0065 
SymbolaraB 
ID6147000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp73411 
End bp75111 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID641614966 
Productribulokinase 
Protein accessionYP_001742182 
Protein GI170683895 
COG category[C] Energy production and conversion 
COG ID[COG1069] Ribulose kinase 
TIGRFAM ID[TIGR01234] L-ribulokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.268824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.306332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATTG CAATTGGCCT CGATTTTGGC AGTGATTCTG TGCGAGCTTT GGCGGTGGAC 
TGCGCCACCG GTGAAGAGAT CGCCACCAGC GTAGAGTGGT ATCCCCGTTG GCAGAAAGGG
CAATTTTGTG ATGCCCCAAA TAACCAGTTC CGTCATCATC CGCGTGACTA CATTGAGTCA
ATGGAAGCGG CGCTGAAAAC CGTGCTTGCA GAGCTTAGCG CCGAGCAGCG CGCAGCTGTG
GTCGGGATTG GCGTTGACAC AACCGGCTCG ACGCCCGCAC CGATTGACGC GGACGGTAAC
GTCCTGGCGC TGCGCCCGGA GTTTGCCGAA AACCCGAACG CGATGTTCGT ATTGTGGAAA
GACCACACCG CGGTTGAAGA AGCGGAAGAG ATAACCCGCT TATGTCACAC GCCGGGCAAC
GTTGACTACT CCCGCTATAT TGGCGGTATT TATTCCAGCG AATGGTTCTG GGCAAAAATA
CTGCATGTGA CTCGACAGGA CAGCGCCGTG GCGCAATCTG CCGCATCGTG GATTGAGCTG
TGCGACTGGG TACCAGCTCT GCTTTCCGGT ACCACGCGCC CGCAGGATAT TCGTCGCGGA
CGTTGCAGCG CCGGGCATAA GTCGCTATGG CATGAAAGCT GGGGCGGCCT GCCGCCAGCC
AGTTTCTTTG ACGAACTGGA CCCGATCCTC AATCGCCATT TGCCTTCCCC GCTGTTCACT
GATACCTGGA CTGCCGATAT TCCGGTGGGC ACGTTATGTC CGGAATGGGC ACAGCGTCTC
GGCCTGCCTG AAAGCGTGGT GATTTCCGGC GGCGCGTTTG ACTGCCATAT GGGCGCAGTT
GGTGCAGGCG CACAGCCTAA CGCACTGGTA AAAGTTATCG GTACTTCCAC CTGCGACATT
CTGATTGCCG ACAAACAGAG CGTTGGCGAG CGGGCAGTGA AAGGTATTTG CGGTCAGGTT
GATGGCAGCG TGGTGCCTGG ATTTATCGGT CTGGAAGCAG GCCAATCGGC GTTTGGCGAT
ATCTACGCCT GGTTCGGTCG CGTACTCGGC TGGCCGCTGG AACAGCTTGC CGCCCAGCAT
CCGGAACTGA AAGAGCAAAT CAACGCCAGC CAGAAACAAC TGCTTCCGGC GCTGACCGAA
GCATGGGCCA AAAATCCGTC TCTGGATCAC CTACCGGTGG TGCTCGACTG GTTTAACGGC
CGCCGCACAC CGAACGCTAA CCAACGCCTG AAAGGGGTGA TTACCGATCT GAACCTCGCC
ACCGACGCAC CGCTGCTGTT CGGCGGTTTA ATTGCCGCCA CCGCCTTTGG CGCACGCGCA
ATTATGGAGT GCTTCACCGA TCAGGGGATC GCCGTCAATA ACGTGATGGC ACTGGGCGGC
ATCGCGCGCA AAAACCAGGT CATTATGCAG GCCTGCTGCG ACGTGCTGAA TCGCCCGCTG
CAAATTGTTG CCTCTGACCA GTGCTGTGCG CTCGGTGCGG CGATTTTCGC TGCCGTCGCC
GCGAAAGTGC ACGCAGATAT CCCATCAGCC CAGCAAAAAA TGGCCAGTGC GGTAGAGAAA
ACCCTGCAAC CGCGCAGCGA ACAAGCACAA CGCTTTGAAC AGCTTTATCG CCGCTATCAG
CAATGGGCGA TGAGCGCCGA ACAACACTAT CTTCCAACTT CCGCCCCGGC ACAGGCTGCC
CAGGCCGTTC CGACTCTATA A
 
Protein sequence
MAIAIGLDFG SDSVRALAVD CATGEEIATS VEWYPRWQKG QFCDAPNNQF RHHPRDYIES 
MEAALKTVLA ELSAEQRAAV VGIGVDTTGS TPAPIDADGN VLALRPEFAE NPNAMFVLWK
DHTAVEEAEE ITRLCHTPGN VDYSRYIGGI YSSEWFWAKI LHVTRQDSAV AQSAASWIEL
CDWVPALLSG TTRPQDIRRG RCSAGHKSLW HESWGGLPPA SFFDELDPIL NRHLPSPLFT
DTWTADIPVG TLCPEWAQRL GLPESVVISG GAFDCHMGAV GAGAQPNALV KVIGTSTCDI
LIADKQSVGE RAVKGICGQV DGSVVPGFIG LEAGQSAFGD IYAWFGRVLG WPLEQLAAQH
PELKEQINAS QKQLLPALTE AWAKNPSLDH LPVVLDWFNG RRTPNANQRL KGVITDLNLA
TDAPLLFGGL IAATAFGARA IMECFTDQGI AVNNVMALGG IARKNQVIMQ ACCDVLNRPL
QIVASDQCCA LGAAIFAAVA AKVHADIPSA QQKMASAVEK TLQPRSEQAQ RFEQLYRRYQ
QWAMSAEQHY LPTSAPAQAA QAVPTL