Gene CHU_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3354 
SymbolhsdM 
ID4185049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3833441 
End bp3835027 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content37% 
IMG OID638073343 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_679933 
Protein GI110639723 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.501947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0329699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAG ATCAGAAACG AATATTAGAA CAGCAGCTTT GGAATATTGC TAATACCCTT 
CGAGGTAAAA TGAATGCAGA TGAATTCCGT GATTATATTC TGGGATTTAT TTTTTATAAA
TATCTGAGTG AAAAAATGGA GATTTTTGCC AATGATATTC TTAAGCAAGA TAAAATAAGC
TTTCGCGAAA TAACTCCTAA ACTAAAACAA GGCAAGGAAT ATTTAGAAGC AATCAGAGAA
GAGGCTTTAG AAAAATTAGG TTATTTTTTA AAACCGGAAG AATTGTTTAG TGAGGTAGCG
AAGAGGGGTA GAGGTTCCAA TGACGAAGGA GAAAATTTTG ATGAAGCTAA AACAAATTTC
ATTTTAGAAG ATCTTCAAAA AATTTTAATC AATATTCAGC TAAGTACCAT GGGTACGGAT
AGTGAAGAGG ACTTCGATAA TCTTTTCGAG GACATGGATT TGAACAGTAC CAAACTGGGC
AAAACGCCAG ATGCACGAAA TGCGATTATT GCAAAAGTCC TGACTCACCT CGATAAGATA
GATTTTAAAT TAGAAGATTT AGAGTCGGAT GTATTGGGAG ACTCGTATGA ATATCTCATA
GGACAATTTG CAAGTGGTGC AGGGAAAAAG GCAGGTGAAT TTTATACTCC TCAGCAGGTT
TCTAAAATCC TCGCAAAGAT TGTTACTACA GAAAAACATA AACTAAAGTC TGTGTATGAT
CCTACATGTG GTTCGGGCTC TTTATTGCTT AGAGTAGCGA GAGAAGTAAA AGATGTTGCA
AAGTTTTACG GACAGGAGAT GAATCGTACT ACGTATAACC TTGCACGAAT GAATATGATC
CTGCATGGTG TGCATTACAG AAAATTTGAT ATAAAGCAGG AAGACACGCT TGAGCATCCA
CAGCACATGG GGCAACAGTT TGAAGCTATC GTAGCCAATC CACCTTTTTC AGCGCAATGG
AGCGCGAACC CATTGCATTT AAGTGATGAT CGTTTCAGTC AATATGGTAA ACTGGCTCCG
GCAAGTAAAG CAGATTATGC GTTTGTACAG CACATGGTAC ATCATTTGGC GGAGAATGGG
ATCATGGCGC TTGTATTACC GCATGGCGTA TTGTTTAGAG GTGGAGCAGA ACAACATATC
CGTAAATATT TGATCGAACA GAAAAATTAT CTTGATGCAG TGATCGGCTT GCCGGGAAAT
ATTTTTTATG GAACAAGTAT TCCAACCTGT ATTCTGGTGA TTAAAAAATG TCGCGAAATG
CCGGATAATA TTTTATTCAT TGATGCCAGC AAAGAATTTG AAAAAGTAAA AACGCAGAAT
ATTTTAAGGG AAAAACATAT TGATAAAATT GTTGATACAT ACCGTAGCAG AAAGGAAATA
GAAAAGTACA GTCACTGTGC TTCGTTAAAG GAAATCGCTG AGAATGATTT CAACCTCAAT
ATCCCACGAT ACGTAGACAC GTTTGAAGAG GAAGAAGAAA TAGATATACA GGCAGTGATG
GCTGAAATAA AAAATCTTGA AGCCAAGCGT ACGGATCTGG ATAAACAGAT TGATGTGTAT
ATGAAAGAAC TTGGATTGGT ATTTTAA
 
Protein sequence
MSEDQKRILE QQLWNIANTL RGKMNADEFR DYILGFIFYK YLSEKMEIFA NDILKQDKIS 
FREITPKLKQ GKEYLEAIRE EALEKLGYFL KPEELFSEVA KRGRGSNDEG ENFDEAKTNF
ILEDLQKILI NIQLSTMGTD SEEDFDNLFE DMDLNSTKLG KTPDARNAII AKVLTHLDKI
DFKLEDLESD VLGDSYEYLI GQFASGAGKK AGEFYTPQQV SKILAKIVTT EKHKLKSVYD
PTCGSGSLLL RVAREVKDVA KFYGQEMNRT TYNLARMNMI LHGVHYRKFD IKQEDTLEHP
QHMGQQFEAI VANPPFSAQW SANPLHLSDD RFSQYGKLAP ASKADYAFVQ HMVHHLAENG
IMALVLPHGV LFRGGAEQHI RKYLIEQKNY LDAVIGLPGN IFYGTSIPTC ILVIKKCREM
PDNILFIDAS KEFEKVKTQN ILREKHIDKI VDTYRSRKEI EKYSHCASLK EIAENDFNLN
IPRYVDTFEE EEEIDIQAVM AEIKNLEAKR TDLDKQIDVY MKELGLVF