Gene EcSMS35_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4120 
SymbolrbsK 
ID6145739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4217078 
End bp4218022 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content52% 
IMG OID641618944 
Productribokinase 
Protein accessionYP_001746082 
Protein GI170681947 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID[TIGR02152] ribokinase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.444588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.199768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCC CGAATATGCA AAACGCAGGC AGCCTCGTTG TTCTTGGCAG CATTAATGCT 
GACCACATTC TTAATCTTCA ATCTTTTCCT ACCCCAGGCG AAACCGTAAC CGGTAACCAC
TATCAGGTTG CATTTGGCGG CAAAGGCGCT AATCAGGCTG TGGCTGCTGG GCGTAGCGGT
GCGAATATCG CGTTTATTGC CTGCACTGGC GATGACAGCA TTGGTGAGAG CGTTCGCCAG
CAGCTCGCCA CTGATAACAT CGATATTTCT CCGGTCAGCG TGATCAAAGG CGAATCAACA
GGTGTGGCGC TGATTTTTGT TAATGGCGAA GGTGAGAATG TCATCGGTAT TCATGCCGGC
GCTAATGCTG CCCTTTCCCC GGCACTGGTG GAAGCGCAAC GTGAGCGTAT TGCCAACGCG
TCAGCATTAT TAATGCAGCT GGAATCACCA CTCGAAAGTG TGATGGCAGC GGCGAAAATC
GCCCATCAAA ATAAGACTAT CGTTGCGCTT AACCCGGCTC CGGCTCGCGA ACTTCCTGAC
GAACTGCTGG CACTGGTGGA CATTATTACG CCAAACGAAA CGGAAGCAGA AAAGCTCACC
GGTATTCGTG TTGAAAATGA TGAAGATGCA GCGAAGGCGG CGCAGGTACT GCATGAAAAA
GGTATCCGTA CTGTACTGAT TACTTTAGGA AGTCGTGGTG TATGGGCTAG CGTGAATGGT
GAAGGTCAGC GCGTTCCTGG ATTCCGGGTG CAGGCTGTCG ATACCATTGC TGCCGGAGAT
ACCTTTAACG GCGCGTTAAT CACGGCGTTG CTGGAAGAAA AACCATTGCC AGAGGCGATT
CGGTTTGCCC ATGCTGCCGC TGCGATTGCC GTAACACGTA AAGGCGCACA ACCTTCCGTA
CCGTGGCGTG AAGAGATCGA CGCATTTTTA GACAGGCAGA GGTGA
 
Protein sequence
MDIPNMQNAG SLVVLGSINA DHILNLQSFP TPGETVTGNH YQVAFGGKGA NQAVAAGRSG 
ANIAFIACTG DDSIGESVRQ QLATDNIDIS PVSVIKGEST GVALIFVNGE GENVIGIHAG
ANAALSPALV EAQRERIANA SALLMQLESP LESVMAAAKI AHQNKTIVAL NPAPARELPD
ELLALVDIIT PNETEAEKLT GIRVENDEDA AKAAQVLHEK GIRTVLITLG SRGVWASVNG
EGQRVPGFRV QAVDTIAAGD TFNGALITAL LEEKPLPEAI RFAHAAAAIA VTRKGAQPSV
PWREEIDAFL DRQR