Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0569 |
Symbol | gsk |
ID | 6969832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 569502 |
End bp | 570806 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384614 |
Product | inosine kinase |
Protein accession | YP_002269128 |
Protein GI | 209399882 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.858923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTC CCGGTAAACG TAAATCCAAA CATTACTTCC CCGTAAATGC ACGCGATCCG CTGCTTCAGC AATTCCAGCC AGAAAACGAA ACCAGCGCCG CCTGGGTAGT GGGTATCGAT CAAACGCTGG TCGATATTGA AGCGAAAGTG GATGATGAAT TCATTGAGCG TTATGGATTA AGCGCCGGGC ATTCACTGGT GATTGAGGAT GATGTAGCCG AAGCGCTTTA TCAGGAACTA AAACAGAAAA ACCTGATTAC CCATCAGTTT GCGGGTGGCA CCATTGGTAA CACCATGCAT AACTACTCGG TGCTCGCGGA CGACCGTTCG GTGCTGCTGG GCGTCATGTG CAGCAATATT GAAATTGGCA GCTATGCCTA TCGTTACCTG TGTAACACCT CCAGCCGTAC CGATCTTAAC TATCTACAAG GCGTGGATGG CCCGATTGGT CGTTGCTTTA CGCTGATTGG CGAGTCCGGG GAACGTACCT TTGCTATCAG CCCTGGCCAC ATGAACCAGC TGCGGGCTGA AAGTATTCCG GAAGATGTGA TTGCCGGAGC CTCGGCTCTG GTTCTCACCT CTTATCTGGT GCGTTGCAAG CCGGGTGAAC CCATGCCGGA AGCAACCATG AAAGCCATTG AGTACGCGAA GAAATATAAC GTACCGGTGG TGCTGACGCT GGGAACTAAG TTTGTCATTG CCGAGAATCC ACAGTGGTGG CAGCAATTCC TCAAAGACCA CGTCTCTATC CTTGCGATGA ACGAAGATGA AGCCGAAGCG TTGACCGGAG AAAGCGATCC GTTGTTGGCA TCTGACAAGG CGCTGGACTG GGTAGATCTG GTGCTGTGCA CCGCCGGGCC AATCGGCTTG TATATGGCGG GCTTTACCGA AGACGAAGCG AAACGTAAAA CCCAGCATCC GTTGCTGCCG GGCGCTATAG CCGAATTCAA CCAGTATGAG TTTAGCCGCG CCATGCGCCA CAAGGATTGT CAGAATCCGC TGCGTGTCTA TTCGCACATT GCGCCGTACA TGGGCGGGCC GGAAAAAATC ATGAACACCA ACGGAGCAGG GGATGGCGCA CTGGCAGCGT TGCTGCATGA CATTACCGCC AACAGCTACC ATCGTAGCAA CGTACCAAAC TCCAGCAAAC ATAAATTCAC CTGGTTAACT TATTCATCGT TAGCGCAGGT GTGTAAATAT GCTAACCGTG TGAGCTATCA GGTACTGAAC CAGCATTCAC CTCGTTTAAC GCGCGGCTTG CCGGAGCGTG AAGACAGCCT GGAAGAGTCT TACTGGGATC GTTAA
|
Protein sequence | MKFPGKRKSK HYFPVNARDP LLQQFQPENE TSAAWVVGID QTLVDIEAKV DDEFIERYGL SAGHSLVIED DVAEALYQEL KQKNLITHQF AGGTIGNTMH NYSVLADDRS VLLGVMCSNI EIGSYAYRYL CNTSSRTDLN YLQGVDGPIG RCFTLIGESG ERTFAISPGH MNQLRAESIP EDVIAGASAL VLTSYLVRCK PGEPMPEATM KAIEYAKKYN VPVVLTLGTK FVIAENPQWW QQFLKDHVSI LAMNEDEAEA LTGESDPLLA SDKALDWVDL VLCTAGPIGL YMAGFTEDEA KRKTQHPLLP GAIAEFNQYE FSRAMRHKDC QNPLRVYSHI APYMGGPEKI MNTNGAGDGA LAALLHDITA NSYHRSNVPN SSKHKFTWLT YSSLAQVCKY ANRVSYQVLN QHSPRLTRGL PEREDSLEES YWDR
|
| |