Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2905 |
Symbol | scrK |
ID | 6143957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2977437 |
End bp | 2978357 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617774 |
Product | aminoimidazole riboside kinase |
Protein accession | YP_001744929 |
Protein GI | 170682402 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00998461 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAA GAGTATGGGT ACTCGGTGAT GCGGTTGTTG ATTTATTACC CGAAAGCCAG GGGAGACTAC TACAGTGTCC TGGCGGGGCG CCTGCTAATG TTGCAGTCGG TATCGCAAGG CTGGGGGGGA AAAGTGCCTT TATTGGCAAA GTTGGCGATG ATCCTTTCGG TCGCTTTATG TATCAGACAC TGAGTACAGA AAATGTTGAT ACACATTATA TGTCTCTTGA TCCTCAACAA CGCACCTCAA TTGTGGCTGT AGGACTTGAT GAGCAAGGAG AAAGAAACTT TACCTTTATG GTACGCCCAA GTGCCGATCT TTTTTTACAA CCTGGTGACC TTCCTGCATT TGGGCCGGGT GAATGGCTCC ATCTTTGTTC CATTGCGCTC AGTGCAGAAC CTTCCCGAAG TACCGCATTT CTGGCTATGG AGAAAATACG TCAGGCTGGC GGAAACATCA GTTTTGATCC CAATATCCGC AGCGATCTCT GGCAGAGTGA AGCGCTATTA AGGAAATACC TTGATCGCGC ACTTTCGCTG GCGAATATCG CTAAATTGTC CGAAGAAGAG TTGCTATTCA TCAGTGGCGA AAGCCAGGTT CAGCAAGGCG CATATTCATT AGTACAACGT TATTCGTTGA CTTTATTGCT TATTACACAA GGAAAAAATG GCGTACTTGT GTATTTTCAG GGGCAGTTTA TCCACTATCC CGCCAAACCT GTTTCTGTCG TCGATACGAC CGGGGCAGGA GATGCTTTTG TCGCTGGATT ACTTGCAGGT CTGGCTGATT CTGGAATACC AACAAATACC AGACAGCTTG AACGAATCAT TGCACAAGCT CAGATTTGTG GTGCTCTGGC GACCACGGCT AAAGGCGCGA TAACCGCCTT ACCCCGACAA CACGATCTCC CTTCACAATA G
|
Protein sequence | MSARVWVLGD AVVDLLPESQ GRLLQCPGGA PANVAVGIAR LGGKSAFIGK VGDDPFGRFM YQTLSTENVD THYMSLDPQQ RTSIVAVGLD EQGERNFTFM VRPSADLFLQ PGDLPAFGPG EWLHLCSIAL SAEPSRSTAF LAMEKIRQAG GNISFDPNIR SDLWQSEALL RKYLDRALSL ANIAKLSEEE LLFISGESQV QQGAYSLVQR YSLTLLLITQ GKNGVLVYFQ GQFIHYPAKP VSVVDTTGAG DAFVAGLLAG LADSGIPTNT RQLERIIAQA QICGALATTA KGAITALPRQ HDLPSQ
|
| |