Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1930 |
Symbol | hemK |
ID | 6144851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1950168 |
End bp | 1951001 |
Gene Length | 834 bp |
Protein Length | 277 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616806 |
Product | N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase |
Protein accession | YP_001743982 |
Protein GI | 170683182 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00345869 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0875835 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATATC AACACTGGTT ACGTGAAGCA ATAAGCCAAC TTCAGGCGAG CGAAAGCCCG CGGCGTGATG CTGAAATCCT GCTGGCGCAT GTTACCGGCA AAGGGCGTAC TTTTATTCTC GCCTTTGGTG AAACGCAGCT GACTGACGAA CAATGTCAGC AACTTGATGC GCTACTGACG CGTCGTCGCG ATGGTGAACC TATTGCTCAT TTAACCGGGG TGCGAGAATT CTGGTCGCTG CCGTTATTTG TTTCGCCAGC GACCTTAATT CCGCGCCCGG ATACGGAGTG TCTGGTGGAG CAGGCACTGG CGCGGTTGCC TGAACAGCCT TGCCGTATTC TCGATCTCGG GACGGGTACC GGGGCGATTG CGCTGGCGCT GGCTAGCGAG CGCCCGGACT GCGAAATTAC CGCTGTAGAT TGTATGCCTG ATGCTGTCTC TCTGGCGCAA CGTAATGCCC AGAATCTGGC GATCAAAAAT ATCCACATTC TGCAAAGTGA CTGGTTTAGC GCGCTAGCCG GGCAGCAGTT TGCGATGATT GTCAGCAATC CGCCGTATAT TGACGAGCAG GACCCACATC TTCAACAAGG CGATGTCCGC TTTGAGCCGC TCACTGCACT GGTTGCGGCT GACAGTGGAA TGGCCGACAT CGTGCATATC ATCGAACAGT CGCGTAACGC GCTGGTATCC GGCGGCTTTC TGCTTCTGGA ACATGGCTGG CAGCAGGGCG AAGCGGTGCG ACAGGCATTT ATCCACGCGG GATATCATGA CGTCGAAACC TGTCGGGACT ATGGTGATAA CGAGCGCGTA ACGCTCGGCC GCTATTATCA ATGA
|
Protein sequence | MEYQHWLREA ISQLQASESP RRDAEILLAH VTGKGRTFIL AFGETQLTDE QCQQLDALLT RRRDGEPIAH LTGVREFWSL PLFVSPATLI PRPDTECLVE QALARLPEQP CRILDLGTGT GAIALALASE RPDCEITAVD CMPDAVSLAQ RNAQNLAIKN IHILQSDWFS ALAGQQFAMI VSNPPYIDEQ DPHLQQGDVR FEPLTALVAA DSGMADIVHI IEQSRNALVS GGFLLLEHGW QQGEAVRQAF IHAGYHDVET CRDYGDNERV TLGRYYQ
|
| |