Gene Hmuk_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1040 
Symbol 
ID8410559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp992296 
End bp994917 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content63% 
IMG OID645019376 
ProductMCM family protein 
Protein accessionYP_003176874 
Protein GI257387101 
COG category[L] Replication, recombination and repair 
COG ID[COG1241] Predicted ATPase involved in replication control, Cdc46/Mcm family 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.304437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0782008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCG GCGTCGACAA CACCGAACTC ACCGACGCGT TCGAGGAGTT CTACCGGGAC 
TACTACCGCA ACGAGATCGG TGAACTCGCC CAGAAGTACC CCAACGACCA GAAGTCGCTG
TGGGTCGACT GGGACGACCT CTATCGCTTC GACCCCGACC TCGCGGACGA CGTGCGCAAC
CGTCCCGAGC AGATGCAAGA CTACGCCGAG GAGGCGTTGC GCCTGTACGA CCTGCCGGTC
GACGTGAAGC TCGGCCAGGC CCACGTCCGC TTTCACGACC TCCCCGAGTC CGAGGACATC
CGTGCGATTC GCCACGAGCA CCACGGGATG CTCATCGCCG TCCAGGGGAT CGTCCGCAAG
GCCACCGACG TTCGGCCAAA GGTCACGAAC GCCGCCTTCG AGTGCCAGCG CTGTGGCACC
CTCACCCGGA TTCCGCAGGT CGCCGGTGAC TTTCAGGAGC CCCACGAGTG CCAGGGCTGT
GAGCGCCAGG GCCCCTTCCG CCTGAACATG GACCAGTCGG AGTTCGTCGA CGCCCAGAAG
ATCCGTGTCC AGGAGTCCCC CGAAGGACTG CGTGGCGGCG AGACCCCCCA GGCCATCGAC
GTGAACATCG AGGACGATAT CACCGGCGAG GTGACCGCCG GCGACCACGT CCGCGTGACC
GGCGTCCTCA AACTCGACCA GCAGGGCGAC GACCGCAGCC AGTCGCCGAT GTTCGACCTC
TACATGGACG GGATCGACGT CTCCATCGAA GACGAGCAGT TCGAGGACAT GGACATCACC
GAGGAGGACA AGAAAGAGAT CATCGAACTC TCCAACGAGG ACGACCTCTA CGACAAGATG
GTCGGTGCGA TCGCGCCCTC GATCTACGGC TACGAACGCG AGAAACTCGC GATGATGCTC
CAGCTGTTCT CCGGGGTGAC CAAACACCTC CCGGACGGCT CTCGAATCCG TGGCGACCTC
CACATGTTGC TGATCGGTGA TCCGGGTACG GGGAAGTGCC TCAGCGGTGA CACTCATGTG
ACGCTCGGTG ATGGCTCTGA GGTTCCGATT CGAACGCTTG TGGAAGACAA CCTCGACGAC
CCAAAACCCG TCGATGATGG CGTTTGGGAC ACTGTCGACT TCGAGGTGCC ATCGTTGCAG
GAAGATGGGA CTATTTCCCA GCAGCAGGCA ACGAAGGTCT GGAAGCGTGA GGCACCGGAG
CAGCTGTATC GGATTCGGAC GGCGACGGGA CGAGAACTCG ATATTACTCC GTCACACCCG
TTGTTTGTGC AGTCTAACGG TCGGTTCGAG GCTGTGAAAG CCGAACAGCT GACAGCCGGG
CAGATGATCG CGTGCAAGGG CAATAACGAC GAGACCGAAC ACGGGCAGAG CACCGTCGCA
GCGGACGGTG GGGTAGTCAC TGCCCAAACT GATCGAATCG AATCCATCGA GCCAGTCGAA
CCGGAGGACG AGTGGGTTTA CGACCTCGAA GTCGGGGGAA CACACAACTA CGTCTCCAAC
GGCGTCGTTT CCCACAACTC GCAGATGCTC TCCTACATCC AGAACATCGC ACCACGGTCT
GTCTACACCT CCGGGAAGGG ATCCAGCAGC GCAGGCCTTA CCGCAGCGGC TGTGAGAGAC
GACTTCGGCG ACGGCCAGCA GTGGACGCTG GAGGCGGGCG CGCTCGTGCT CGCGGACCTC
GGGATCGCCG CTGTCGACGA GCTCGACAAG ATGAATCCCG ACGACCGCTC CGCGATGCAC
CAGGCCCTCG AACAGCAGGA GATCTCCATC AACAAGGCCG GGATCAACGC GACGCTGAAG
TCCCGGTGTT CCCTGCTGGG GGCGGCCAAC CCCAAGTACG GCCGGTTCGA CCAGTTCGAG
CCCATCGGCG AGCAGATCGA TCTCGAACCC GCGCTGGTCT CACGGTTCGA CCTCATTTTT
ACGGTGACCG ACGAGCCCGA CGAGGAGGAA GACCGGAACC TGGCGAGTCA CATCATCCAG
ACGAACTACG CGGGGGAACT CCACACCCAT CGCGTGGAGA ATCCCACCTC GGACTACAGC
CAGGAGCAGG TCGACGCCGT CACCGAGGAG GTCGCGCCGA CGATCGAGCC GGACCTGTTG
CGAAAGTACG TCGCTCACGC GAAGACGAGT TGCTTCCCGA CGATGACCGA GGAGGCAAAG
ACCGAGATCG AGGACTTCTA CGTCGATCTG CGGGTCCAGG GGACCGACGA GGACGCCGCG
GTGCCGGTGA CGGCCCGAAA GCTGGAGGCG CTGGTCCGTC TCTCCGAGGC GTCGGCGCGG
ATCCGACTCT CGGACACGGT CGAGAAAGAG GACGCCGAGC GGGCGACGAC GATCGCTCGC
TACTGCATGG AGCAGATCGG CGTCGATCCC GAGACGGGCG AGTTCGACGC CGACGTTGTC
GAGACCGGCA CCTCGAAGAG CCAGCGCGAC CGGATACAGA ACCTCAAGGG AATCATCTCC
GACATCGAGG AGGAGTACGA CGAGGGCGCG CCGGTCGACG TGGTCGTCGA GCGAGCGGAG
GAGGTCGGAA TCGAGGAGTC CAAGGCCGAA CACGAGATCG AGAAGCTCAA GCAGAAAGGC
GAGGTGTACG AACCACGGAC CGATCACCTG AGGACGACGT AG
 
Protein sequence
MATGVDNTEL TDAFEEFYRD YYRNEIGELA QKYPNDQKSL WVDWDDLYRF DPDLADDVRN 
RPEQMQDYAE EALRLYDLPV DVKLGQAHVR FHDLPESEDI RAIRHEHHGM LIAVQGIVRK
ATDVRPKVTN AAFECQRCGT LTRIPQVAGD FQEPHECQGC ERQGPFRLNM DQSEFVDAQK
IRVQESPEGL RGGETPQAID VNIEDDITGE VTAGDHVRVT GVLKLDQQGD DRSQSPMFDL
YMDGIDVSIE DEQFEDMDIT EEDKKEIIEL SNEDDLYDKM VGAIAPSIYG YEREKLAMML
QLFSGVTKHL PDGSRIRGDL HMLLIGDPGT GKCLSGDTHV TLGDGSEVPI RTLVEDNLDD
PKPVDDGVWD TVDFEVPSLQ EDGTISQQQA TKVWKREAPE QLYRIRTATG RELDITPSHP
LFVQSNGRFE AVKAEQLTAG QMIACKGNND ETEHGQSTVA ADGGVVTAQT DRIESIEPVE
PEDEWVYDLE VGGTHNYVSN GVVSHNSQML SYIQNIAPRS VYTSGKGSSS AGLTAAAVRD
DFGDGQQWTL EAGALVLADL GIAAVDELDK MNPDDRSAMH QALEQQEISI NKAGINATLK
SRCSLLGAAN PKYGRFDQFE PIGEQIDLEP ALVSRFDLIF TVTDEPDEEE DRNLASHIIQ
TNYAGELHTH RVENPTSDYS QEQVDAVTEE VAPTIEPDLL RKYVAHAKTS CFPTMTEEAK
TEIEDFYVDL RVQGTDEDAA VPVTARKLEA LVRLSEASAR IRLSDTVEKE DAERATTIAR
YCMEQIGVDP ETGEFDADVV ETGTSKSQRD RIQNLKGIIS DIEEEYDEGA PVDVVVERAE
EVGIEESKAE HEIEKLKQKG EVYEPRTDHL RTT