Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4240 |
Symbol | |
ID | 6145834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4337198 |
End bp | 4338184 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619062 |
Product | serine/threonine protein kinase |
Protein accession | YP_001746186 |
Protein GI | 170680021 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00616333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.232925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACA GCGCTTTTAC TTTCCAGACA CTACACCCGG ATACCATCAT GGATGCTCTG TTTGAGCAAG GGATCCGGGT GGATTCCGGT CTTACCCCGC TTAACAGCTA TGAAAACCGT GTCTATCAAT TTCAGGACGA AGATCGTCGA CGTTTTGTCG TCAAATTTTA TCGTCCTGAA CGTTGGGCAG CCGATCAAAT CCTCGAAGAA CATCAATTTG CGTTGCAGTT GGTAAATGAT GAAGTTCCGG TCGCAGCACC TGTGGCCTTT AACGGTCAGA CTTTATTGAA TCATCAGGGA TTTTATTTCG CTGTTTTTCC AAGCGTCGGT GGTCGCCAGT TCGAAGCTGA TAATATCGAT CAGATGGAAG CGGTTGGGCG TTATTTAGGG CGTATGCACC AGACGGGGCG CAAACAGCTT TTTATCCATC GCCCGACCAT CGGTCTGAAT GAATATCTCA TTGAGCCTCG CAAGCTGTTT GAGGACGCTA CACTGATACC TTCCGGGTTG AAAGCGGCAT TTCTGAAAGC GACAGATGAG CTGATCGCCG CCGTTACAGC ACACTGGCGG GAAGATTTCA CCGTTCTGCG GCTACATGGA GACTGCCACG CTGGGAATAT TCTCTGGCGC GATGGTCCAA TGTTTGTTGA TCTGGATGAT GCACGTAATG GTCCGGCTAT TCAGGATTTG TGGATGTTGC TCAATGGCGA TAAAGCCGAG CAGCGGATGC AACTGGAAAC TATTATTGAG GCTTATGAAG AATTTAGCGA GTTCGACACC GCTGAAATCG GACTGATTGA ACCTTTACGC GCCATGCGTT TGGTTTATTA TCTTGCCTGG CTAATGCGGC GTTGGGCTGA TCCCGCGTTC CCGAAAAATT TCCCGTGGTT AACCGGGGAA GATTACTGGC TACGACAGAC GGCGACTTTT ATAGAACAGG CAAAAGTTCT ACAAGAACCC CCTTTGCAAT TAACACCTAT GTATTAA
|
Protein sequence | MNNSAFTFQT LHPDTIMDAL FEQGIRVDSG LTPLNSYENR VYQFQDEDRR RFVVKFYRPE RWAADQILEE HQFALQLVND EVPVAAPVAF NGQTLLNHQG FYFAVFPSVG GRQFEADNID QMEAVGRYLG RMHQTGRKQL FIHRPTIGLN EYLIEPRKLF EDATLIPSGL KAAFLKATDE LIAAVTAHWR EDFTVLRLHG DCHAGNILWR DGPMFVDLDD ARNGPAIQDL WMLLNGDKAE QRMQLETIIE AYEEFSEFDT AEIGLIEPLR AMRLVYYLAW LMRRWADPAF PKNFPWLTGE DYWLRQTATF IEQAKVLQEP PLQLTPMY
|
| |