Gene EcSMS35_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0521 
Symbolgsk 
ID6144471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp528334 
End bp529638 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content52% 
IMG OID641615415 
Productinosine kinase 
Protein accessionYP_001742622 
Protein GI170680689 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0701891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTC CCGGTAAACG TAAATCCAAA CATTACTTCC CCGTAAATGC ACGCGATCCG 
CTGCTTCAGC AATTCCAGCC AGAAAACGAA ACCAGCGCTG CCTGGGTAGT GGGTATCGAT
CAAACGCTGG TCGATATTGA AGCGAAAGTG GATGATGAAT TCATTGAGCG TTATGGATTA
AGCGCCGGGC ATTCACTGGT GATTGAGGAT GACGTAGCCG AAGCGCTTTA TCAGGAACTA
AAACAGAAAA ACCTGATTAC CCATCAGTTT GCGGGTGGCA CCATTGGCAA CACCATGCAC
AACTACTCGG TGCTCGCGGA CGACCGTTCG GTGCTGCTGG GTGTGATGTG CAGCAATATT
GAAATTGGCA GCTATGCCTA TCGTTACCTG TGTAATACCT CCAGCCGTAC CGATCTTAAC
TATCTACAAG GCGTGGATGG CCCGATTGGT CGTTGCTTTA CGCTGATTGG CGAGTCCGGG
GAACGTACCT TTGCTATCAG CCCCGGCCAC ATGAACCAGC TGCGGGCTGA AAGTATTCCG
GAAGATGTGA TTGCCGGAGC CTCGGCTCTG GTTCTCACCT CTTATCTGGT GCGTTGCAAG
CCGGGTGAAC CCATGCCGGA AGCAACTATG AAAGCCATTG AGTACGCGAA GAAATATAAC
GTACCGGTGG TGCTGACGCT GGGCACCAAG TTTGTCATTG CCGAGAATCC GCAGTGGTGG
CAGCAATTCC TCAAAGACCA CGTCTCTATC CTTGCGATGA ACGAAGATGA AGCCGAAGCG
TTGACCGGAG AAAGCGATCC GTTGTTGGCA TCTGACAAGG CGCTGGACTG GGTTGATCTG
GTGCTGTGCA CCGCCGGGCC AATCGGCTTG TATATGGCGG GCTTTACCGA AGACGAAGCG
AAACGTAAAA CCCAGCATCC GCTGCTGCCG GGTGCTATAG CGGAATTTAA CCAGTATGAG
TTTAGCCGCG CCATGCGCCA CAAGGATTGC CAGAATCCGC TGCGTGTCTA TTCGCACATT
GCGCCGTACA TGGGCGGGCC GGAAAAAATC ATGAATACCA ACGGAGCAGG AGATGGCGCA
CTGGCAGCGT TGCTGCATGA CATTACCGCC AACAGCTACC ACCGTAGCAA CGTACCAAAC
TCCAGCAAAC ATAAATTCAC CTGGTTAACT TATTCATCGT TAGCGCAGGT GTGTAAATAT
GCTAACCGTG TGAGCTATCA GGTACTGAAC CAGCATTCAC CTCGTTTAAC GCGCGGCTTG
CCGGAGCGTG AAGACAGCCT GGAAGAGTCT TACTGGGATC GTTAA
 
Protein sequence
MKFPGKRKSK HYFPVNARDP LLQQFQPENE TSAAWVVGID QTLVDIEAKV DDEFIERYGL 
SAGHSLVIED DVAEALYQEL KQKNLITHQF AGGTIGNTMH NYSVLADDRS VLLGVMCSNI
EIGSYAYRYL CNTSSRTDLN YLQGVDGPIG RCFTLIGESG ERTFAISPGH MNQLRAESIP
EDVIAGASAL VLTSYLVRCK PGEPMPEATM KAIEYAKKYN VPVVLTLGTK FVIAENPQWW
QQFLKDHVSI LAMNEDEAEA LTGESDPLLA SDKALDWVDL VLCTAGPIGL YMAGFTEDEA
KRKTQHPLLP GAIAEFNQYE FSRAMRHKDC QNPLRVYSHI APYMGGPEKI MNTNGAGDGA
LAALLHDITA NSYHRSNVPN SSKHKFTWLT YSSLAQVCKY ANRVSYQVLN QHSPRLTRGL
PEREDSLEES YWDR