Gene Hmuk_2894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2894 
Symbol 
ID8412446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2775797 
End bp2779834 
Gene Length4038 bp 
Protein Length1345 aa 
Translation table11 
GC content62% 
IMG OID645021240 
ProductDNA polymerase B exonuclease 
Protein accessionYP_003178706 
Protein GI257388933 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.182169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC AGGGCCAGCA GGACCTCGGC GCGTTCGTCG ACAGCGACGA CACTGACGAC 
GAGCGCGACG TTGCCCAAGA GGCAGCGGCG GTCGCCGGCA CGAGCGCGTC GACCGCCGAC
GTGGTCGACG CCAGCGACCG GCTCTTCGAG GACGTCGAGG AGACGGTTTC GATGGCCGTC
ACCCAGATCG ACTACACCAT CGAGGGGCGC GGCGACGACG AGTATCCAGT CATCCACGTT
TTCGGCCGAA CTGAGTCCGA GGACGCCGTC CACGCCCGCG TCTACGACTT TCGACCGTAC
TTCTACGCGC CCGCAGAGAA CGTCACGGAG GAGCGACTGC GAGGCTACGA CAGCATCACT
GGCTGGGAGG AGACCGACGC AGACGGCGAT CCCTACGAGT CGATCCGAGG CGAGCGACTC
GTCAAGATCT TCGGTCGGAC GCCCCGTGAC GTGGGACAGA TCCGCGACGA GTTCGACCAC
TACGAGGCCG ACATCCTCTT TCCCAACCGA TTTCTCATCG ACAAGGGACT CACCAGCGGT
GTCGAAGTCC CGCTCAGAGA GGACGACGAC GGCAGTCTCC GGGTCCATCA CGCGGAGGTC
GCGCCCGCGA ACGTGCAGGC GACACCGCGG GTGAACGTCT TCGACATCGA GGTCGACGAC
CGCTCGGGGT TCCCCGAGGA CGGCGAGGAG CCGATCGTCT GCCTGACCAG TCACGACTCC
TACCGCGACG AGTACGTCCT CTGGCTGTAC GAGGCCCCCG ACGGGATCGA AGGGCCGGAC
CATCTCCCCG ACTACGAGCC TATCGAGGGC GAGATCGACG CCGAGATTCG AAGATTCACC
GAGGAGGAGG CGATGCTGGA GGCCTTCGTC GACTACATCG ACGAGACGGA CCCGGACATC
CTCAGCGGGT GGAACTTCGA CGACTTCGAC GCGCCGTACT TCCTCGACCG ACTGGAAGAA
CTGCAGGGCC CACACCACGA CTACGACCTC AGTGTCGACC GCCTCTCGCG GGTGGACGAG
GTCTGGCGCT CGGGCTGGGG CGGCCCCGAC ATCAAGGGGC GAGTCGTCTT CGACCTGCTG
TACGCCTACC AGCGCACCCA GTTCAGCGAA CTCGACTCCT ACCGGCTCGA CGCGGTCGGC
GAGGTCGAAC TGGGCGTCGG CAAGGAACGC TATCCCGGCG ACATCGGCGA CCTCTGGGAA
GACGACCCCG AGCGCCTGCT GGAGTACAAC CTCCGGGACG TGGAGCTCTG CGTCGAACTC
GACCGCGATC AGGACATCGT CGCCTTCTGG GACGAGGTCC GCACCTTCGT CGGCTGTAAG
CTCGAAGACG CCACCACGCC CGGTGACGCC GTCGACATGT ACGTCCTGCA CAAACTCCAC
GGCGAGTTCG CACTCCCCTC GAAGGGCCAG CAAGAGAGCG AGGACTACGA GGGCGGGGCC
GTGTTCGACC CCATCACCGG TGTCAGGGAG AACGTGACGG TGCTGGACCT GAAGAGCCTC
TATCCGATGT GCATGGTGAC GACCAACGCG AGCCCCGAGA CGAAGGTCGA CCCGGAGGCC
TACGACGGCG ACACCTACCG CGCGCCCAAC GGGACCCACT TCCGGAAGGA ACCGGACGGC
GCGATCCGGG AGATGGTCGA CGAACTCCTA TCAGAACGCG AGGAGAAAAA GTCGCTTCGG
AACGATCACG ACCCGAAAGA GGACGCCTAC GAGCGCTACG ACCGGCAACA AGCGGCGGTG
AAGGTGATCA TGAACTGCTT TACGCCGGAC ACCGAGGTCG TCACGCCCGA GGGCGTTCGC
GACATCACGG ATCTCGAAGT CGGCGACGAA GTGTACTCTC TCGATCCGGA AACGATGCGC
ATGGAGGTCA AGCCGGTCGT CGAGACCCAC GACTACCCCG ACTATCGGGG CGACCTCGTC
GACATCGAGA CGAGTAAGAT CGACTTCCGA GTGACGCCGA ACCACCGGAT GCTCGTCCGG
AAGAACGAGA CAAACGGGAT CACCGAAGAC GAGTACGGGT TCGTCGAAGC GGGCGAACTC
GATCGAGCGA CGAACTACGA ACTGCCCCAC GACTGGGACG GTCCCGACGG TGAGCGGCTC
AACGAGGTGG ATCTCGTCGA CATCCTTGAG GGTGCGTTCG AGGTATGGTG TGACAACGAC
GACCACGGAC ACACTCTTGC CGCAGAGGTT GGGTACCGTC CGGACAAGAT GCAAAAGCAG
GGTGAAGACG GGACTGGCTA CGTGTTTGAC GCCGAATCGT TCCGGGAGCA CCGTGCGTAT
ATTGATGAGA GCTGTTCAGG GTTTTACGTC CACTCTGAGC GCGGTCGCAA GTGGATTCCA
CGGTTTTACG ACGGTGACGA CTTCCTCGAA CTGCTGGCGT GGTACATCAC CGAAGGAAGC
ATCTACACAT CTGCGGATAA GCAGTTTGGT GAGCATTTCC GAGGTAGTTC AACAACAGTG
AACATTGCAC AGGATGCGGT CCCGGACGGA GGATCTGGTG ACGACCACAC ACGGATAGGC
AGCCTCCTCG ACGGCATGGG GTTCGATGCG TACGTCGACG AGAAAGGGTA CCAGTTCACT
TCGAAACTGC TTGGTGATCT CTTCGAACGG CTCTGTGGTA GCGACAGCTT CGAGAAGCGG
ATTCCCGAGT TCGTGTTCGA AACAAGCCAA GCACAGAAAC GGACCTTCCT TGATACGCTT
ATCGCTGGCG ACGGGGACTG GCAGACCAAC AGTTGGCGCT ACAGCACTGC AAGTGAACGG
CTCCGCGACG ATGTCTTACG ACTCTGTGCT CATCTCGGAC TCACCGCCAG CTACAATGAA
GACAGTGGAA CGTACCGCAT CTACGTCACC GAAGACAGCA AGAACACGCT CCGGATGCAC
CGGAGTGGCG GTGAAAGCAC GGCTGAAAAC GGTGTTCACT GTGTCACGGT TGAGGACAAC
CACACTCTGC TCGCCGGGCG GAACGGGAAA TTCCAGTTCG TCGGTCAGTC GCTCTATGGT
GTCCTCGGAT GGGATCGGTT CCGTCTATAC GACAAGGAGA TGGGGGCGGC CGTCACGGCG
ACCGGTCGCA AGGTGATCGA TTACACCGAC GAAGTCGTCG CCAGAGAGGG GTACGAGGTC
GTCTACGGGG ATACTGACAG CGTCATGCTA CAGGTCGGAG ACATCGGCCC GGACGACGTC
GAAGGCGACG TTGTGGTCAC CGACGAGATG CGGGCAAAAC ACCCCGAGAT GGACGACGGC
GAACTGGAAA CCGTCGCGGC GACGATCCAG AAGGGGTTCG AACTCGAAGA GACGATCAAC
GAGGCCTACG ACGATTTTGC CCTCGAAGAA CTCAACGCGC AGTTTCACCG CTTCCAGATC
GAGTTCGAGA AGCTCTATCG GCGCTTCTTC CAGGCGGGCA AGAAGAAGCG CTACGCCGGC
CACATCGTCT GGAAGGAGGG CAAAGACGTC GACGACATCG ACATCACCGG CTTCGAGTAC
CAGCGTTCGG ACATCGCACC GATCACCAAG CGCGTCCAGA AGGAGGTCAT CGACATGATC
GTCCACGGCG AGGACGCCGA CACGATCAAG GAGTACGTCC ACGACGTGAT CGAGGACTAC
CAGGCCGGCA ACGTCGACCC CGAAGACGTG GGTATCCCCG GCGGCATCGG CCAGAAACTC
GACAGCTACG ACACCGACAC GGCCCAGGTG CGCGGGGCGA AGTACGCCAA CATGCTGCTG
GGGACGAACT TCCAGAGCGG CTCGAAACCC AAACGGCTCT ACCTGGATCG AGTCCATGAC
GACTTCTTCC AGCGGATCGA GGCCGAGAAG GGACTCGATC CCGCCGAGGA TCCGCTGTAC
GGCGAGTTCC GCCGCGACCC GGACGTGATC TGTTTCGAGT TCGCCGACCA GATTCCCGAA
GAGTTCGAGG TCGACTGGGA GAAGATGCTC GACAAGACGC TGAAGGGTCC GATCGCCAGG
ATCCTCGAAG CGATGGGTAT CTCGTGGGAC GAGGTCAAGT CCGGCCAGGA ACAGACCGGC
CTCGGCCAGT TCATGTGA
 
Protein sequence
MSEQGQQDLG AFVDSDDTDD ERDVAQEAAA VAGTSASTAD VVDASDRLFE DVEETVSMAV 
TQIDYTIEGR GDDEYPVIHV FGRTESEDAV HARVYDFRPY FYAPAENVTE ERLRGYDSIT
GWEETDADGD PYESIRGERL VKIFGRTPRD VGQIRDEFDH YEADILFPNR FLIDKGLTSG
VEVPLREDDD GSLRVHHAEV APANVQATPR VNVFDIEVDD RSGFPEDGEE PIVCLTSHDS
YRDEYVLWLY EAPDGIEGPD HLPDYEPIEG EIDAEIRRFT EEEAMLEAFV DYIDETDPDI
LSGWNFDDFD APYFLDRLEE LQGPHHDYDL SVDRLSRVDE VWRSGWGGPD IKGRVVFDLL
YAYQRTQFSE LDSYRLDAVG EVELGVGKER YPGDIGDLWE DDPERLLEYN LRDVELCVEL
DRDQDIVAFW DEVRTFVGCK LEDATTPGDA VDMYVLHKLH GEFALPSKGQ QESEDYEGGA
VFDPITGVRE NVTVLDLKSL YPMCMVTTNA SPETKVDPEA YDGDTYRAPN GTHFRKEPDG
AIREMVDELL SEREEKKSLR NDHDPKEDAY ERYDRQQAAV KVIMNCFTPD TEVVTPEGVR
DITDLEVGDE VYSLDPETMR MEVKPVVETH DYPDYRGDLV DIETSKIDFR VTPNHRMLVR
KNETNGITED EYGFVEAGEL DRATNYELPH DWDGPDGERL NEVDLVDILE GAFEVWCDND
DHGHTLAAEV GYRPDKMQKQ GEDGTGYVFD AESFREHRAY IDESCSGFYV HSERGRKWIP
RFYDGDDFLE LLAWYITEGS IYTSADKQFG EHFRGSSTTV NIAQDAVPDG GSGDDHTRIG
SLLDGMGFDA YVDEKGYQFT SKLLGDLFER LCGSDSFEKR IPEFVFETSQ AQKRTFLDTL
IAGDGDWQTN SWRYSTASER LRDDVLRLCA HLGLTASYNE DSGTYRIYVT EDSKNTLRMH
RSGGESTAEN GVHCVTVEDN HTLLAGRNGK FQFVGQSLYG VLGWDRFRLY DKEMGAAVTA
TGRKVIDYTD EVVAREGYEV VYGDTDSVML QVGDIGPDDV EGDVVVTDEM RAKHPEMDDG
ELETVAATIQ KGFELEETIN EAYDDFALEE LNAQFHRFQI EFEKLYRRFF QAGKKKRYAG
HIVWKEGKDV DDIDITGFEY QRSDIAPITK RVQKEVIDMI VHGEDADTIK EYVHDVIEDY
QAGNVDPEDV GIPGGIGQKL DSYDTDTAQV RGAKYANMLL GTNFQSGSKP KRLYLDRVHD
DFFQRIEAEK GLDPAEDPLY GEFRRDPDVI CFEFADQIPE EFEVDWEKML DKTLKGPIAR
ILEAMGISWD EVKSGQEQTG LGQFM