Gene EcSMS35_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1930 
SymbolhemK 
ID6144851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1950168 
End bp1951001 
Gene Length834 bp 
Protein Length277 aa 
Translation table11 
GC content55% 
IMG OID641616806 
ProductN5-glutamine S-adenosyl-L-methionine-dependent methyltransferase 
Protein accessionYP_001743982 
Protein GI170683182 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00345869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0875835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATATC AACACTGGTT ACGTGAAGCA ATAAGCCAAC TTCAGGCGAG CGAAAGCCCG 
CGGCGTGATG CTGAAATCCT GCTGGCGCAT GTTACCGGCA AAGGGCGTAC TTTTATTCTC
GCCTTTGGTG AAACGCAGCT GACTGACGAA CAATGTCAGC AACTTGATGC GCTACTGACG
CGTCGTCGCG ATGGTGAACC TATTGCTCAT TTAACCGGGG TGCGAGAATT CTGGTCGCTG
CCGTTATTTG TTTCGCCAGC GACCTTAATT CCGCGCCCGG ATACGGAGTG TCTGGTGGAG
CAGGCACTGG CGCGGTTGCC TGAACAGCCT TGCCGTATTC TCGATCTCGG GACGGGTACC
GGGGCGATTG CGCTGGCGCT GGCTAGCGAG CGCCCGGACT GCGAAATTAC CGCTGTAGAT
TGTATGCCTG ATGCTGTCTC TCTGGCGCAA CGTAATGCCC AGAATCTGGC GATCAAAAAT
ATCCACATTC TGCAAAGTGA CTGGTTTAGC GCGCTAGCCG GGCAGCAGTT TGCGATGATT
GTCAGCAATC CGCCGTATAT TGACGAGCAG GACCCACATC TTCAACAAGG CGATGTCCGC
TTTGAGCCGC TCACTGCACT GGTTGCGGCT GACAGTGGAA TGGCCGACAT CGTGCATATC
ATCGAACAGT CGCGTAACGC GCTGGTATCC GGCGGCTTTC TGCTTCTGGA ACATGGCTGG
CAGCAGGGCG AAGCGGTGCG ACAGGCATTT ATCCACGCGG GATATCATGA CGTCGAAACC
TGTCGGGACT ATGGTGATAA CGAGCGCGTA ACGCTCGGCC GCTATTATCA ATGA
 
Protein sequence
MEYQHWLREA ISQLQASESP RRDAEILLAH VTGKGRTFIL AFGETQLTDE QCQQLDALLT 
RRRDGEPIAH LTGVREFWSL PLFVSPATLI PRPDTECLVE QALARLPEQP CRILDLGTGT
GAIALALASE RPDCEITAVD CMPDAVSLAQ RNAQNLAIKN IHILQSDWFS ALAGQQFAMI
VSNPPYIDEQ DPHLQQGDVR FEPLTALVAA DSGMADIVHI IEQSRNALVS GGFLLLEHGW
QQGEAVRQAF IHAGYHDVET CRDYGDNERV TLGRYYQ