Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4550 |
Symbol | alsK |
ID | 6144175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4652478 |
End bp | 4653407 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619366 |
Product | D-allose kinase |
Protein accession | YP_001746478 |
Protein GI | 170680209 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.598137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAAAC AGCATAACGT CGTAGCGGGC GTGGATATGG GGGCAACGCA TATCCGCTTT TGTCTGCGGA CAGCAGAAGG TGAAACGCTA CACTGCGAAA AAAAGCGGAC CGCAGAAGTT ATTGCTCCCG ACCTGGTGTC GGGTATCGGC GAAATGATTG ACGAGCAACT CAGGCGCTTT AACGCTCGCT GTCGTGGTCT GGTGATGGGA TTTCCGGCGC TGGTCAGTAA AGATAAACGC ACCATTATTT CTACGCCTAA CCTGCCGTTA ACAGCGGCGG ATTTATATGA TCTCGCCGAT AAGCTCGAAA ATACGCTGAA TTGTCCGGTT GAGTTTTCCC GCGACGTTAA CCTGCAACTC TCCTGGGACG TAGTAGAAAA CAGCCTTACG CAACAACTGG TTCTGGCGGC CTATCTCGGT ACGGGGATGG GGTTCGCAGT GTGGATGAAC GGTGCGCCGT GGACGGGTGC ACACGGTGTG GCAGGCGAAC TGGGCCATAT CCCCCTGGGA GATATGACCC AACACTGCGC GTGTGGCAAT CCTGGGTGCC TGGAAACCAA TTGCTCTGGA ATGGCGCTAA GACGCTGGTA CGAACAACAG CCCCGAAATT ACCCATTGAG CGCTCTTTTC GTCCATGCGG AAAACGCCCC TTTCGTCCAG AGTCTGCTTG AAAACGCGGC ACGAGCCATT GCCACCAGCA TTAATCTGTT CGATCCCGAT GCGGTAATTC TGGGTGGTGG CGTGATGGAT ATGCCCACCT TCCCACGCGA GACTCTCATT GCCATGACCC AAAAGTACCT GCGCCGTCCA CTGCCGTATC AGGTCGTGCG CTTTATTGCC GCCTCATCTT CTGACTTTAA TGGCGCTCAG GGTGCAGCAA TATTGGCACA TCAACGTTTT TTGCCACAGT CCTGTGCTAA AGCCCCATGA
|
Protein sequence | MQKQHNVVAG VDMGATHIRF CLRTAEGETL HCEKKRTAEV IAPDLVSGIG EMIDEQLRRF NARCRGLVMG FPALVSKDKR TIISTPNLPL TAADLYDLAD KLENTLNCPV EFSRDVNLQL SWDVVENSLT QQLVLAAYLG TGMGFAVWMN GAPWTGAHGV AGELGHIPLG DMTQHCACGN PGCLETNCSG MALRRWYEQQ PRNYPLSALF VHAENAPFVQ SLLENAARAI ATSINLFDPD AVILGGGVMD MPTFPRETLI AMTQKYLRRP LPYQVVRFIA ASSSDFNGAQ GAAILAHQRF LPQSCAKAP
|
| |