Gene B21_04181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04181 
SymbolhsdM 
ID8112902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4486664 
End bp4488253 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content56% 
IMG OID644850322 
Producthypothetical protein 
Protein accessionYP_003001895 
Protein GI251787591 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATA ACGATCTGGT CGCGAAGCTG TGGAAACTGT GCGACAACCT GCGCGATGGC 
GGCGTTTCCT ATCAAAACTA CGTCAATGAA CTCGCCTCGC TGCTGTTTTT GAAAATGTGT
AAAGAAACCG GACAGGAAGC GGAATACCTG CCGGAAGGCT ACCGCTGGGA TGACCTGAAA
TCCCGCATCG GCCAGGAGCA GTTGCAGTTC TACCGTAACC TGCTGGTGCA TCTGGGCGCC
GACAATCAAA AGCTGGTGCA GGCGGTGTTC CAGAACGTCA ACACCACCAT TACCCAGCCG
AAACAGTTGA CCGAACTGGT CAGCAATATG GATTCACTGG ACTGGTACAA CGGCGCGCAC
GGTAAGTCAC GCGATGACTT CGGCGATATG TACGAAGGGC TGTTGCAGAA AAACGCCAAC
GAAACCAAGT CTGGCGCGGG CCAGTACTTC ACCCCACGTC CGCTGATTAA AACCATTATT
CATCTGCTGA AACCGCAGCC GCGTGAAGTG GTGCAGGACC CGGCAGCAGG TACAGCGGGC
TTTTTGATTG AAGCTGACCG CTACGTTAAG TCGCAGACTA ACGATCTGGA CGACCTTGAT
GGCGACACGC AGGATTTCCA GATCCACCGC GCGTTTATCG GCCTCGAACT GGTACCCGGC
ACCCGTCGTC TGGCGCTAAT GAACTGCCTG CTGCACGATA TCGAAGGCAA CCTCGACCAC
GGTGGCGCAA TCCGTCTGGG CAACACCCTG GGTAGCGACG GTGAAAACCT GCCGAAGGCG
CATATTGTCG CCACTAACCC GCCGTTTGGT AGCGCCGCAG GCACCAACAT TACCCGTACC
TTTGTTCACC CGACCAGCAA CAAACAATTG TGCTTTATGC AGCATATTAT CGAAACACTG
CACCCCGGCG GTCGTGCGGC GGTGGTGGTG CCGGATAACG TGCTGTTTGA AGGCGGCAAA
GGCACGGACA TCCGTCGTGA CCTGATGGAT AAGTGTCATC TGCACACTAT TCTGCGTCTG
CCGACCGGTA TTTTTTACGC GCAGGGCGTG AAGACGAACG TGCTGTTCTT TACCAAAGGG
ACGGTGGCGA ACCCGCATCA GGATAAGAAC TGTACCGATG ATGTGTGGGT GTACGACCTG
CGTACCAATA TGCCGAGCTT CGGCAAACGC ACGCCGTTTA CCGACGAGCA TCTGCAGCCG
TTTGAGCGCG TGTATGGCGA AGATCCGCAC GGTTTAAGCC CGCGTAGCGA AGGGGAATGG
AGTTTTAACG CCGAAGAGAC GGAAGTTGCC GACAGCGAAG AGAACAAAAA CACCGACCAG
CACCTGGCTA CCAGCCGCTG GCGTAAGTTC ACCCGCGAGT GGATCCGCAC CACGAAATCC
GATTCGCTGG ATATCTCCTG GCTGAAAGAT AAAGATAGCA TTGATGCCGA CAACCTGCCG
GAGCCGGATG TATTAGCGGC AGAAGCGATG GGCGAGCTGG TACAGGCGCT GGGCGAACTG
GATGCGCTGA TACGTGAACT GGGAGCGAGC GATGAGGCGG ATGCACAGCG TCAGTTGCTG
GAAGAAGCGT TTGGTGGGGT GAAGGAATGA
 
Protein sequence
MNNNDLVAKL WKLCDNLRDG GVSYQNYVNE LASLLFLKMC KETGQEAEYL PEGYRWDDLK 
SRIGQEQLQF YRNLLVHLGA DNQKLVQAVF QNVNTTITQP KQLTELVSNM DSLDWYNGAH
GKSRDDFGDM YEGLLQKNAN ETKSGAGQYF TPRPLIKTII HLLKPQPREV VQDPAAGTAG
FLIEADRYVK SQTNDLDDLD GDTQDFQIHR AFIGLELVPG TRRLALMNCL LHDIEGNLDH
GGAIRLGNTL GSDGENLPKA HIVATNPPFG SAAGTNITRT FVHPTSNKQL CFMQHIIETL
HPGGRAAVVV PDNVLFEGGK GTDIRRDLMD KCHLHTILRL PTGIFYAQGV KTNVLFFTKG
TVANPHQDKN CTDDVWVYDL RTNMPSFGKR TPFTDEHLQP FERVYGEDPH GLSPRSEGEW
SFNAEETEVA DSEENKNTDQ HLATSRWRKF TREWIRTTKS DSLDISWLKD KDSIDADNLP
EPDVLAAEAM GELVQALGEL DALIRELGAS DEADAQRQLL EEAFGGVKE