Gene Hmuk_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0450 
Symbol 
ID8409949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp430420 
End bp431778 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content53% 
IMG OID645018773 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_003176291 
Protein GI257386518 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.618512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG AACGGCTTAC AGCTATCGAC CTGTTCTGTG GAGCCGGCGG ACTTTCTCAG 
GGGCTTCACG ATGCGGGATT TGAGACACTC TGGGGCATTG ATCACGAGGA GAATACCAAG
CCAACGTACG AGGCAAACCA CGACTGTGAG ATGACGGTCG GAGATATCCG GGAAGAAGAA
CCACCGGATC TCGGACTGGA GGAGGGAGAA CTCGATCTCG TCGCAGGTGG GCCGCCCTGT
CCGACGTTTT CACTTGTGGG CCGGAGCAAA ATCAATTCTA TCGAGGGCCG GGACAACCAG
AGCGACGATC GTCATCTACT GTACGAAGAC TTCCTCCGTT TCGTAGATCA CTACCAGCCC
AAAGCCTTCC TGATGGAGAA CGTCGAAGGG ATGCTCTCGG CAGAAAATGA AGACGGAAAG
CCGGTCGTCG ATACCATCAA GGAACAGATG CGTGGAGAGC GAGAAGTTGC AGATCTTGAC
CTCGATCTAA ATTACAGTGT TCGAGTCCAG CTGCTGGATT CAGCAGACTA CGGGATCCCC
CAGCACAGAA AACGTCTCTT CTTCATCGGT AACCGTATCG GCGAAGAGAA TCCGGACATG
ACTGAGTGGG AAACCCACCG GAAGCCGAAG AACGAGGAAG AGAAGAAAAT CAAATACAAG
GAAGACCCAT CAGAGCGATC TGAGGAAGAC CAGTCCACAT TGCACGGGTT TGTGGATGAA
GATGGGACCG AAGAGTTCCC TGCCTTCCGC AAAAACAGAC AGAGCAAGGA GCCATGGAAC
ACAGTAGCTG ATGGCATTCT CGATCTTCCT CCTGTTTCCC CCAGTGGAGA TACACCACCG
ACCAAAGCAG AAGAATACGA GATTGGTCCC GTCTCAGAAT TCCAGTACTG GGCACGGAAT
CTCAGCGAGG AACAGGACTG GGAAGATCAG CCCCTTCTAA ATCACGAGTG CCGGGGACAC
AACATGCGTG ACCTCACCCT CTACAAGCTC CTCGGGGAGG GTACCTCGTA CATCATCGGA
GACATCCCAG AAGAGCACCA GCCGTATCGG ACCGACATCT TCCCGGACAA GCTAAAGAAG
CAGAACCCGA AAGAACCTGC AACAACGATT GTGGCCCATC TCTACAAGGA CGGGCATATG
TTCATCCACC CGAACGAGGC TCGATCGATT ACGGTGCGGG AAGCTGCTCG ACTTCAGTCT
TTCAAAGACA CCTTCGAGTT CCCGGTCTCA CGTACACACG CATTCAAACA GGTCGGTAAT
GCCGTCCCAC CACTTCTGGC ACAGGCTTTA GCCACTGCTA TCCGAACAGA AATCTTCCAT
TCCCCAGTAG AGAGAACTGA ACCTCGAGAG GCTTATTAA
 
Protein sequence
MSEERLTAID LFCGAGGLSQ GLHDAGFETL WGIDHEENTK PTYEANHDCE MTVGDIREEE 
PPDLGLEEGE LDLVAGGPPC PTFSLVGRSK INSIEGRDNQ SDDRHLLYED FLRFVDHYQP
KAFLMENVEG MLSAENEDGK PVVDTIKEQM RGEREVADLD LDLNYSVRVQ LLDSADYGIP
QHRKRLFFIG NRIGEENPDM TEWETHRKPK NEEEKKIKYK EDPSERSEED QSTLHGFVDE
DGTEEFPAFR KNRQSKEPWN TVADGILDLP PVSPSGDTPP TKAEEYEIGP VSEFQYWARN
LSEEQDWEDQ PLLNHECRGH NMRDLTLYKL LGEGTSYIIG DIPEEHQPYR TDIFPDKLKK
QNPKEPATTI VAHLYKDGHM FIHPNEARSI TVREAARLQS FKDTFEFPVS RTHAFKQVGN
AVPPLLAQAL ATAIRTEIFH SPVERTEPRE AY