Gene ECD_04215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04215 
SymbolhsdM 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4488575 
End bp4490164 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content56% 
IMG OID 
ProductDNA methylase M 
Protein accessionACT45996 
Protein GI253980326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATA ACGATCTGGT CGCGAAGCTG TGGAAACTGT GCGACAACCT GCGCGATGGC 
GGCGTTTCCT ATCAAAACTA CGTCAATGAA CTCGCCTCGC TGCTGTTTTT GAAAATGTGT
AAAGAAACCG GACAGGAAGC GGAATACCTG CCGGAAGGCT ACCGCTGGGA TGACCTGAAA
TCCCGCATCG GCCAGGAGCA GTTGCAGTTC TACCGTAACC TGCTGGTGCA TCTGGGCGCC
GACAATCAAA AGCTGGTGCA GGCGGTGTTC CAGAACGTCA ACACCACCAT TACCCAGCCG
AAACAGTTGA CCGAACTGGT CAGCAATATG GATTCACTGG ACTGGTACAA CGGCGCGCAC
GGTAAGTCAC GCGATGACTT CGGCGATATG TACGAAGGGC TGTTGCAGAA AAACGCCAAC
GAAACCAAGT CTGGCGCGGG CCAGTACTTC ACCCCACGTC CGCTGATTAA AACCATTATT
CATCTGCTGA AACCGCAGCC GCGTGAAGTG GTGCAGGACC CGGCAGCAGG TACAGCGGGC
TTTTTGATTG AAGCTGACCG CTACGTTAAG TCGCAGACTA ACGATCTGGA CGACCTTGAT
GGCGACACGC AGGATTTCCA GATCCACCGC GCGTTTATCG GCCTCGAACT GGTACCCGGC
ACCCGTCGTC TGGCGCTAAT GAACTGCCTG CTGCACGATA TCGAAGGCAA CCTCGACCAC
GGTGGCGCAA TCCGTCTGGG CAACACCCTG GGTAGCGACG GTGAAAACCT GCCGAAGGCG
CATATTGTCG CCACTAACCC GCCGTTTGGT AGCGCCGCAG GCACCAACAT TACCCGTACC
TTTGTTCACC CGACCAGCAA CAAACAATTG TGCTTTATGC AGCATATTAT CGAAACACTG
CACCCCGGCG GTCGTGCGGC GGTGGTGGTG CCGGATAACG TGCTGTTTGA AGGCGGCAAA
GGCACGGACA TCCGTCGTGA CCTGATGGAT AAGTGTCATC TGCACACTAT TCTGCGTCTG
CCGACCGGTA TTTTTTACGC GCAGGGCGTG AAGACGAACG TGCTGTTCTT TACCAAAGGG
ACGGTGGCGA ACCCGCATCA GGATAAGAAC TGTACCGATG ATGTGTGGGT GTACGACCTG
CGTACCAATA TGCCGAGCTT CGGCAAACGC ACGCCGTTTA CCGACGAGCA TCTGCAGCCG
TTTGAGCGCG TGTATGGCGA AGATCCGCAC GGTTTAAGCC CGCGTAGCGA AGGGGAATGG
AGTTTTAACG CCGAAGAGAC GGAAGTTGCC GACAGCGAAG AGAACAAAAA CACCGACCAG
CACCTGGCTA CCAGCCGCTG GCGTAAGTTC ACCCGCGAGT GGATCCGCAC CACGAAATCC
GATTCGCTGG ATATCTCCTG GCTGAAAGAT AAAGATAGCA TTGATGCCGA CAACCTGCCG
GAGCCGGATG TATTAGCGGC AGAAGCGATG GGCGAGCTGG TACAGGCGCT GGGCGAACTG
GATGCGCTGA TACGTGAACT GGGAGCGAGC GATGAGGCGG ATGCACAGCG TCAGTTGCTG
GAAGAAGCGT TTGGTGGGGT GAAGGAATGA
 
Protein sequence
MNNNDLVAKL WKLCDNLRDG GVSYQNYVNE LASLLFLKMC KETGQEAEYL PEGYRWDDLK 
SRIGQEQLQF YRNLLVHLGA DNQKLVQAVF QNVNTTITQP KQLTELVSNM DSLDWYNGAH
GKSRDDFGDM YEGLLQKNAN ETKSGAGQYF TPRPLIKTII HLLKPQPREV VQDPAAGTAG
FLIEADRYVK SQTNDLDDLD GDTQDFQIHR AFIGLELVPG TRRLALMNCL LHDIEGNLDH
GGAIRLGNTL GSDGENLPKA HIVATNPPFG SAAGTNITRT FVHPTSNKQL CFMQHIIETL
HPGGRAAVVV PDNVLFEGGK GTDIRRDLMD KCHLHTILRL PTGIFYAQGV KTNVLFFTKG
TVANPHQDKN CTDDVWVYDL RTNMPSFGKR TPFTDEHLQP FERVYGEDPH GLSPRSEGEW
SFNAEETEVA DSEENKNTDQ HLATSRWRKF TREWIRTTKS DSLDISWLKD KDSIDADNLP
EPDVLAAEAM GELVQALGEL DALIRELGAS DEADAQRQLL EEAFGGVKE