Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0521 |
Symbol | gsk |
ID | 6144471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 528334 |
End bp | 529638 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615415 |
Product | inosine kinase |
Protein accession | YP_001742622 |
Protein GI | 170680689 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0701891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTC CCGGTAAACG TAAATCCAAA CATTACTTCC CCGTAAATGC ACGCGATCCG CTGCTTCAGC AATTCCAGCC AGAAAACGAA ACCAGCGCTG CCTGGGTAGT GGGTATCGAT CAAACGCTGG TCGATATTGA AGCGAAAGTG GATGATGAAT TCATTGAGCG TTATGGATTA AGCGCCGGGC ATTCACTGGT GATTGAGGAT GACGTAGCCG AAGCGCTTTA TCAGGAACTA AAACAGAAAA ACCTGATTAC CCATCAGTTT GCGGGTGGCA CCATTGGCAA CACCATGCAC AACTACTCGG TGCTCGCGGA CGACCGTTCG GTGCTGCTGG GTGTGATGTG CAGCAATATT GAAATTGGCA GCTATGCCTA TCGTTACCTG TGTAATACCT CCAGCCGTAC CGATCTTAAC TATCTACAAG GCGTGGATGG CCCGATTGGT CGTTGCTTTA CGCTGATTGG CGAGTCCGGG GAACGTACCT TTGCTATCAG CCCCGGCCAC ATGAACCAGC TGCGGGCTGA AAGTATTCCG GAAGATGTGA TTGCCGGAGC CTCGGCTCTG GTTCTCACCT CTTATCTGGT GCGTTGCAAG CCGGGTGAAC CCATGCCGGA AGCAACTATG AAAGCCATTG AGTACGCGAA GAAATATAAC GTACCGGTGG TGCTGACGCT GGGCACCAAG TTTGTCATTG CCGAGAATCC GCAGTGGTGG CAGCAATTCC TCAAAGACCA CGTCTCTATC CTTGCGATGA ACGAAGATGA AGCCGAAGCG TTGACCGGAG AAAGCGATCC GTTGTTGGCA TCTGACAAGG CGCTGGACTG GGTTGATCTG GTGCTGTGCA CCGCCGGGCC AATCGGCTTG TATATGGCGG GCTTTACCGA AGACGAAGCG AAACGTAAAA CCCAGCATCC GCTGCTGCCG GGTGCTATAG CGGAATTTAA CCAGTATGAG TTTAGCCGCG CCATGCGCCA CAAGGATTGC CAGAATCCGC TGCGTGTCTA TTCGCACATT GCGCCGTACA TGGGCGGGCC GGAAAAAATC ATGAATACCA ACGGAGCAGG AGATGGCGCA CTGGCAGCGT TGCTGCATGA CATTACCGCC AACAGCTACC ACCGTAGCAA CGTACCAAAC TCCAGCAAAC ATAAATTCAC CTGGTTAACT TATTCATCGT TAGCGCAGGT GTGTAAATAT GCTAACCGTG TGAGCTATCA GGTACTGAAC CAGCATTCAC CTCGTTTAAC GCGCGGCTTG CCGGAGCGTG AAGACAGCCT GGAAGAGTCT TACTGGGATC GTTAA
|
Protein sequence | MKFPGKRKSK HYFPVNARDP LLQQFQPENE TSAAWVVGID QTLVDIEAKV DDEFIERYGL SAGHSLVIED DVAEALYQEL KQKNLITHQF AGGTIGNTMH NYSVLADDRS VLLGVMCSNI EIGSYAYRYL CNTSSRTDLN YLQGVDGPIG RCFTLIGESG ERTFAISPGH MNQLRAESIP EDVIAGASAL VLTSYLVRCK PGEPMPEATM KAIEYAKKYN VPVVLTLGTK FVIAENPQWW QQFLKDHVSI LAMNEDEAEA LTGESDPLLA SDKALDWVDL VLCTAGPIGL YMAGFTEDEA KRKTQHPLLP GAIAEFNQYE FSRAMRHKDC QNPLRVYSHI APYMGGPEKI MNTNGAGDGA LAALLHDITA NSYHRSNVPN SSKHKFTWLT YSSLAQVCKY ANRVSYQVLN QHSPRLTRGL PEREDSLEES YWDR
|
| |