Gene Hmuk_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0763 
Symbol 
ID8410277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp731208 
End bp735329 
Gene Length4122 bp 
Protein Length1373 aa 
Translation table11 
GC content68% 
IMG OID645019098 
ProductDNA polymerase II, large subunit DP2 
Protein accessionYP_003176601 
Protein GI257386828 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.449591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGG TCGACGAACG CTACTTCGAC CGCCTGGAAT CCCAGCTCGA CGACGCCTTC 
GAGGTGGCCC AGGCGGCCAA GCGACGCGGC GGCGATCCCG TCCCCGAGGT CGAGATCCCG
ACGGCCAGAG ACATGGCCGA CCGCGTCGAG AACATCCTCG GGATCGACGG CGTCGCCGAA
CGCGTCCGCG AACTCGAAGG CGAGATGAGC AGGGAGGAGG CCGCACTGGA GCTGGTCTCC
GACTTCGTGG AGGGCGAGGT CGGGGACTAC GACTCCAAGG CCGGCAAGGT CGAGGGTGCC
GTTCGAACGG CCGTCGCCCT GTTGACCGAG GGGGTCGTCG CCGCACCGAT CGAGGGGATC
GACCGCGTCG AGATCCTCGA AAACGACGAC GGCACCGAGT TCGTCAACGT CTACTACGCC
GGCCCGATCC GCTCGGCCGG CGGGACCGCA CAGGCGCTCT CGGTTCTCGT GGCCGACTAC
GCTCGCGCGC TGCTGGGCAT CGAGGAGTAC GAGGCCCGCG ACGAGGAGAT CAACCGCTAC
GCCGAGGAGA TCGAACTGTA CGACAAGGAG ACGGGCCTCC AGTACTCGCC CAAGGAGAAG
GAGTCGAAGT TCATCGCCGA GCACATGCCG ATCATGCTGG ACGGCGAGGC GACCGGCGAC
GAGGAGGTCT CGGGCTTTCG GGACCTCGAA CGGGTCGACA CCAACTCCGC CCGCGGTGGG
ATGTGTCTCG TCGCCGCCGA GGGCATCGCG CTGAAGGCTC CGAAGATCCA GCGCTACACC
CGCAACTTAG AGGAGGTCGA CTGGCCGTGG CTGCAAGACC TCATCGACGG AACGATCGGC
AAAGACGGCG ACGAGGACGG CGAGGAGGAC GCGGCCGACG AGGGAGACGA CGGAGACGAG
GACGAGGGCG ACGAGACCTC TACGAGCGAG GACGACGGCC CGCCGCGTGC CGATCCCAGT
ACGAAGTTCC TCCGTGACCT CATCGCCGGC CGACCGGTCT TCTCACATCC GAGCGAACCC
GGTGGCTTTC GCCTGCGGTA CGGCCGGGCG CGCAACCACG GCAACGCGAC TGCCGGCGTC
CACCCCGCGA CGATGCACCT GGTCGACGAC TTCCTCGCGA CCGGCACCCA GATCAAGACC
GAACGGCCCG GCAAGGCCGC TGGCGTGGTC CCGGTCGACA CCATCGAGGG GCCGACGGTC
AAGCTCGCCA ACGGGGACGT GCGCCGGGTC GACGACCCGG CCGAGGCAAT CGAGGTCCGC
AACGGCGTCG AGAAGGTGCT GGATCTGGGC GAGTACCTCG TCAACTACGG CGAGTTCGTC
GAGAACAACC ACCCGCTCGT GCCGGCCTCC TACACCGTCG AGTGGTGGCG ACAGGAGTTC
GACGACGCCG GTGGCGACGT GCAGGCCCTG CAAGACGACG TTCACGTCGA CCTGGCAGAT
CCGACTGCCG AGGAGGCGCT GCGGTGGGCG ACCGACTACG ACGCCCCCCT GCACCCCAAG
TACACCTACT GCTGGCACGA CGTGACCGTC GACGCCGTCG GCGCGCTGGC GGCCGCCGTC
GAAGACGCCG AGAGGGCCGA GACGGACGGC GCGGTCGCGC CGACGCCGTC GCCCGACCGG
GGCACCGACG GCGACCTCGT CCTCCCGAAC CGCGAGCCCG TCGCCCAGAC GCTCGAACAC
CTGCTCGTCG AGCACACTCA GCGCGAGGAC ACCATCGTCG TCCCCGACTG GGAGCCGCTG
GTCCGGACGC TGGGCTTCGA CGCCGCTCTC GAACGCGAGT GGTCGCTCGG GGACCTCTCG
GAGCACGCTC GAACGTACGC GGACGGCGAC AACGCCATCG AGGCGATCAA CGAGATCGCT
CCCTTCCGGC TCCGCGAGCG CGCCCCGACC CGGATCGGCA ACCGGATGGG TCGCCCCGAG
AAATCAGAGG AGCGAGAGCT GTCGCCGGCC GTCCACACGC TGTTTCCCAT CGACGAGGCC
GGCGGCGCTC AGCGCTCGGT CGCCGACGCC ACCAAACACG CCGAGAAGAT GAACGACCAG
CAGGGCCTCG TCGAGGTCGA GGTCGGTCGC CGCCGCTGTC GGGACTGTGG CACCGAGACC
TTTCGGGGCC GCTGTCCGGA CTGCAACGGC GTCACCGACG CCGTCTACGT CTGTCCCGAC
TGCGACTCGG AGGTCGAGCC CGACGAGTCG GGCCGCGCCG AGTGTGCCCA CTGCGAGACG
ATGGCCTACC CCACCCAGTA CGAGGCCATC GACCTCGGCG AGGAGTTCCG CGACGCGCTG
GGTGCCGTCG GCGAACGCGA GACGGCCTTC GACATCGTCA AAGGCGTCGA GGGGCTGACC
TCCAAGGAGA AGATTCCCGA ACCGATGGAG AAGGGGATCC TCCGGGCGAA ACACGACGTG
TCGGCGTTCA AAGACGGCAC CGTCCGCTAC GACATGACGG ACCTGCCGGT CACCGCGGTC
CGGGCGTCGG AACTCGACGT GAGCGTCGAC CAGCTCCGCG GGCTGGGCTA CGAGGCGGAC
ATCCACGGCG ACCCGCTGCG CCACGAGGAC CAGCTCGTCG AGCTCAGAGT CCAGGACGTG
GTGCTCTCCG ACGGCGCGGC CGAGCACATG CTCCAGACCG CCGCGTTCGT CGACGACCTC
TTAGAGCAGT ACTACGGGCT CGACCGGTTC TACGAACTCG ACGACCGCGA GGACCTCGTC
GGCGAACTCG TCTTCGGGAT GGCACCCCAC ACCAGCGCCG CGACGGTCGG GCGCGTCGTC
GGCTTCACCT CCGCGGCGGT CGGGTACGCG CATCCGTACT TCCACGCCGC CAAGCGCCGC
AACTGCTTCC ACCCCGAATC GCGACTCTGG TACGAGGACG AGGCTGGCGA CTGGCAGTAC
GGCCCTATCG CCGATCTCGT CGAGGAGCGA CTCGACGACC CGCGCGAGGA CGACTTCGGG
ACCCTGGTCG AGGAACTGGA CGGCGAGGTC ACGGTGCCCT CGGTCGACGC CGACGGTCGC
CCGTGTCGCA AGCCCGTCGA GGCGGTCTCG AAACATCACG CTCCCGATCA CATGGTCCGC
ATCGAGGTCG GTGACCGCTC GCTGCGGGTA ACTCCCGACC ACACGATGCT CCGGCGGAGC
GGTGACGGGC TCGAAGAAGC ACCGGCCAGC GAACTCTCGG CCGGCGACGA ACTCCCGGCC
TACGACGGCG GCGAGACGAC CGTGATGACG GCGAGCGATG CCCCACAGCC CTCGGCGGGT
GCCGACGACG GCGTTCCGTT CGACGAAGTG GTGTCGGTCG AGTACGTCGA CAGCGACACG
AACCACGTCT ACTGTCTCAC CGTCGCCGAC ACCCACCGGG TCGCCATCGA GGGCACCTAC
TCCGGCCAGT GCGACGGCGA CGAGGACTGC GTGATGCTGC TGATGGACGG CCTCCTGAAC
TTCTCGAAGA CGTTCCTGCC GAACCAGCGG GGCGGTCGGA TGGACGCACC CTTGGTGATG
TCCTCCCGCA TCGACCCCAG CGAGATCGAC GACGAGGCCC ACAACATGGA CATCATGCGG
GAGTACCCCC GGGAGTTCTA CGAGGCCACC CGCGAGATGG CCGACCCCGA GGATGTCGAG
GACGTCATGA CCATCGCCGA GGAGACGCTG GGCACCGACC ACGAGTACAC CGGCTTCGAT
CACACCCACG ACACCACCGA CATCGCGCTC GGCCCGGACC TCTCGGCCTA CAAGACGCTC
GGTTCGATGA CCGACAAGAT GGACGCCCAG CTCGATCTCT CGCGGACGCT GCGGGCTGTC
GACGAGACCG ACGTGGCCGA ACGGATCATC GAGTACCACT TCCTGCCGGA CCTGATCGGC
AACCTCCGGG CCTTCTCCCG CCAGGAAGTC CGCTGTCTCG ACTGCGGCGA GAAGTACCGC
CGGATGCCCC TGACCGGCGA CTGCCGGGAG TGTGGCGGCC GGGTCAACCT CACGGTCCAC
GAAGGGTCGG TCAACAAGTA CATCGACACG GCGACGATGG TCGCCGAGGA GTTCGACTGC
CGAGAGTACA CCAAACAGCG CCTCGAAATC CTCGACAAGT CGATCAAGCG GGTGTTCGAG
AACGACAAAA ACAAGCAGAG CGGGATTGCG GACTTCATGT AG
 
Protein sequence
MREVDERYFD RLESQLDDAF EVAQAAKRRG GDPVPEVEIP TARDMADRVE NILGIDGVAE 
RVRELEGEMS REEAALELVS DFVEGEVGDY DSKAGKVEGA VRTAVALLTE GVVAAPIEGI
DRVEILENDD GTEFVNVYYA GPIRSAGGTA QALSVLVADY ARALLGIEEY EARDEEINRY
AEEIELYDKE TGLQYSPKEK ESKFIAEHMP IMLDGEATGD EEVSGFRDLE RVDTNSARGG
MCLVAAEGIA LKAPKIQRYT RNLEEVDWPW LQDLIDGTIG KDGDEDGEED AADEGDDGDE
DEGDETSTSE DDGPPRADPS TKFLRDLIAG RPVFSHPSEP GGFRLRYGRA RNHGNATAGV
HPATMHLVDD FLATGTQIKT ERPGKAAGVV PVDTIEGPTV KLANGDVRRV DDPAEAIEVR
NGVEKVLDLG EYLVNYGEFV ENNHPLVPAS YTVEWWRQEF DDAGGDVQAL QDDVHVDLAD
PTAEEALRWA TDYDAPLHPK YTYCWHDVTV DAVGALAAAV EDAERAETDG AVAPTPSPDR
GTDGDLVLPN REPVAQTLEH LLVEHTQRED TIVVPDWEPL VRTLGFDAAL EREWSLGDLS
EHARTYADGD NAIEAINEIA PFRLRERAPT RIGNRMGRPE KSEERELSPA VHTLFPIDEA
GGAQRSVADA TKHAEKMNDQ QGLVEVEVGR RRCRDCGTET FRGRCPDCNG VTDAVYVCPD
CDSEVEPDES GRAECAHCET MAYPTQYEAI DLGEEFRDAL GAVGERETAF DIVKGVEGLT
SKEKIPEPME KGILRAKHDV SAFKDGTVRY DMTDLPVTAV RASELDVSVD QLRGLGYEAD
IHGDPLRHED QLVELRVQDV VLSDGAAEHM LQTAAFVDDL LEQYYGLDRF YELDDREDLV
GELVFGMAPH TSAATVGRVV GFTSAAVGYA HPYFHAAKRR NCFHPESRLW YEDEAGDWQY
GPIADLVEER LDDPREDDFG TLVEELDGEV TVPSVDADGR PCRKPVEAVS KHHAPDHMVR
IEVGDRSLRV TPDHTMLRRS GDGLEEAPAS ELSAGDELPA YDGGETTVMT ASDAPQPSAG
ADDGVPFDEV VSVEYVDSDT NHVYCLTVAD THRVAIEGTY SGQCDGDEDC VMLLMDGLLN
FSKTFLPNQR GGRMDAPLVM SSRIDPSEID DEAHNMDIMR EYPREFYEAT REMADPEDVE
DVMTIAEETL GTDHEYTGFD HTHDTTDIAL GPDLSAYKTL GSMTDKMDAQ LDLSRTLRAV
DETDVAERII EYHFLPDLIG NLRAFSRQEV RCLDCGEKYR RMPLTGDCRE CGGRVNLTVH
EGSVNKYIDT ATMVAEEFDC REYTKQRLEI LDKSIKRVFE NDKNKQSGIA DFM