Gene Hlac_0150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0150 
Symbol 
ID7401671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp156012 
End bp160112 
Gene Length4101 bp 
Protein Length1366 aa 
Translation table11 
GC content66% 
IMG OID643707213 
ProductDNA polymerase II large subunit 
Protein accessionYP_002564825 
Protein GI222478588 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCGG ACGACGAGCG CTACTTCGCT CAGATCGAAG ACCGGCTCGA CGAGGCGTGG 
GACGTGGCCG AGGCCGCCAA GGAGCAGGGT CACGACCCGA AACCGAAGAT CGAGATCCCG
ATCGCCCGCG ATATGGCCGA CCGGGTCGAG AACATTCTCG GAATCGACGG GGTCGCGGAG
CGCGTCCGCG AGTTGGAAGG CGAGATGTCC CGCGAGGAGG CGGCCCTCGA ACTCGTCACC
GACTTCGTCG AGGGGACCGT CGGCGACTAC GACTCCCGGG CCGGCAAGGT CGAAGGCGCG
GTCCGGACCG CGGTCGCGCT GCTCACCGAG GGGGTCGTCG CCGCGCCGAT CGAGGGGATC
GACCGGGTCG AGCTGCTGGA AAACGACGAC GGCACCGAAT TCGTCAACGT CTACTACGCC
GGTCCGATCC GGTCTGCGGG CGGGACCGCG CAGGCGCTGT CCGTGCTCGT CGCCGACTAC
GCCCGATCGT TGCTCGACAT CGACGAGTAC AAGCCGCGAG ACGTAGAGGT CGAGCGCTAC
GCCGAGGAGA TCTCGTTGTA CGACAAGGAG ACAGGCCTCC AGTACTCGCC GAAGGACAAG
GAGTCGAAGT TCATCGCACA GCACATGCCC GTCATGCTCG ACGGGGAAGC GACGGGCGAC
GAGGAGGTCT CCGGGTTCCG CGATCTGGAA CGCGTCGACA CCAACTCCGC GCGCGGCGGG
ATGTGTCTCG TGGCCGCCGA AGGGATCGCG CTGAAGGCCC CGAAGATCCA GCGGTACACC
CGCGACCTCG ACGAGGTCGA CTGGCCGTGG CTGCAAGATC TGATCGACGG AACGATCGGC
AAGGACGGCG GCGGCGCGAG CGACGAGGCT CCCGAGAATG CCGGCGCAGA GGGAGACGAC
GACAGCGAGG ACACCGCGGG CAACGAGAGC GACGGGGACG ACGCGGATGA TGCCGATTCG
GAAACCGATG GCCCTGCTGG CCCCCTCCGT CCGAAGTCCT CACAGAAGTT CCTCCGGGAC
CTGATCGCAG GCCGTCCCGT CTTTACCCAC CCGAGTGAGG CGGGTGGATT CCGACTTCGA
TACGGTCGCG CGCGCAACCA CGGGTTCGCG ACCGGCGGGG TCCACCCGGC GACCATGCAC
CTCGTCGACG ACTTCCTCGC GGCGGGCACC CAGATCAAGA CGGAGCGTCC GGGGAAAGCC
CACGGGGTCA TCCCCGTCGA CTCCATCGAG GGGCCGACCG TGCGACTCGC GAACGGCGAG
GTCCGGCGGA TCGACGACCC TGAGGAGGCG AAGGAGATCC GGAACGGCGT CGAGAAAGTG
CTCGATCTCG GCGAGTATCT CGTCAACTAC GGCGAGTTCG TCGAGAACAA CCACCCGCTC
GCGCCCGCCT CGTACGTCTA CGAGTGGTGG GTTCAGGAGT TTGAGACCGC CGGTGCCGAC
GTGCAGGCCA TGCGGGACGA CCCCCACGTC GACCTTGAAC ACCCCGATGT CGACGAGGCG
CTCGCGTGGG CGCGCGAGTA CGACTGTCCG CTTCACCCGG AGTACACGTA CCTCTGGCAC
GACGTGTCGG TCGATGCGTT CGAGGCGCTC GCAGATGCGG TTGCCGCCGG CGACGTGGTC
GAGGGCGTCC TCGCGATCGA GCGCACGGAG ACGACGCAGC ACGCGCTGGA GTCGCTGCTG
GTCCAACACG TCGCGACGCC GGACGCGCTC CGGATCCCCA CGTGGCGCCC GCTGGCGCGG
TCGCTCGGAA TCGACGACGG GCTCCGAAAG ACGTGGAGCG CGGACGACCT CTCGGCGGCG
GCCCGCGAGT GGGATGACGG TGAGAACGCG GTGAAAGCCA TCAACGAAGT CGCTCCCTTC
GAGATTCGCG AGCGGGCGCC GACCCGGATC GGCAACCGGA TGGGCCGCCC GGAGAAGTCG
GAGAGCCGTG ACCTCTCGCC GGCGGTCCAC ACCCTGTTCC CGATCAACGA GGCCGGCGGC
CCCCAGCGCG ACGTGGCGGA GGCCGCGGGG ACGATGGACG ACTCGGGCCA CCGAGGGCGG
CTGGACCTGG AGGTCTCCGA CCGGGTGTGC CCCGACTGTG GCGAGCACAC GTACCGGGCG
CAGTGCCCGG ACTGCGGGGT CCACACCGAC CTCCACTACG AGTGCGGCGA CTGCGGCACC
GTCTGCGAGC CCGACGAGGC CGGCCGCGTC GAGTGCCCGC GCTGCGAGTG GGAGGTGACG
GCGGCGACCT ACCAGGAGGT CGACCTCAAC GACGCGTACC GCTCCGCGCT GGAGACCGTC
GGCGAGCGGG AATCCGCCTT CGAGATTCTG AAGGGCGTTC AGGGGCTCAC CTCGCGGAAC
AAGACCCCCG AGCCGATCGA GAAGGGGGTC CTCCGCGCGA AAAATGGTGT CACATCGTTC
AAGGACGGCA CCGTCAGGTA CGACATGACC GACCTCCCGG TCACGTCGGT CCGCGCCGAG
GAGCTCGACA TCACCGCCGA CCACCTCCGA GAGATCGGTT ACGAGACAGA TATCGACGGC
GAACCCCTGC GCCACGACGA TCAACTGGTG GAGCTTCGGG TGCAGGACAT CGTCCTCTCC
GACGGCGCTG CCGAGCACAT GCTCAAGACT GCGGGCTTCG TCGACGACCT CTTAGAGCAG
TTCTACGGGC TCGACCGCTT CTACGAGTTC AACGAGCGCG ACGACCTCGT CGGCGAGCTC
GTCTTCGGGA TGGCGCCGCA CACGAGCGCC GCAACTGTCG GGAGAGTGGT GGGTTTCACG
TCGGCCGCCG TGGGATACGC TCATCCGTAC TTTCACGCCG CCAAGCGGCG CAACTGCTTC
CACCCGGAGA CGGAGATTGA GTATCGAGAC GACGCAGGGT GGCACCGCGA GACCATCGAG
AAATTCGTTG AGGAGCGACT GAATGATCCC CAGACCGACG ACTTCGGAAC ACTCGTCAAC
GAACTCGATG GCAACGTCGA GGTTCCTTCG ATCGACGAAC ACGGAAACAA GTCGACACAG
CCGGTGACTG CCGTCTCGAA ACACCTGAGT CAGGACCATC TCGTCCAGGT CGAGACCCAG
CGCGGACGTT CGATTCGCGT GACACCCGAC CACACGATGC TGCGAGTCGC GGATGGAGGC
GTTCAAAAGG TCGCTGCAAA CGAACTGGAG ATCGGCGATT CGGTCCCGAC GACAACTCTT
CGTCACGAGA TGTCGGTGGG CGCTACTGTC GACGTGACGA CGGACGGGGG GATCGAGGCG
GACACAGTTG CCTCGGTCGA TTTCCTCGAA AGCGACATCG AACACACTTA CAATCTCACA
GTCGCCGAGA CGCACACGCT CGCGGCGAAC GATTTACTGG TCGCTCAGTG TGACGGAGAC
GAGGATTGCG TTATGCTCCT CATGGACGGA CTTCTCAACT TCTCGAAGAC GTTCTTACCG
GACCAACGCG GCGGTCGTAT GGACGCGCCC CTCGTGATGT CCTCGCGGAT CGACCCGTCG
GAGATCGACG ACGAGGCGCA CAACGTCGAC ATCGCGCGGG AGTACCCTCG CGAGTTCTAC
GACGCGACGC TGGAGATGGC CGATCCGGAG TCGGTCGAGG ACCTGATTCA GATCGGCGAG
GACACGCTCG GGACCGACGA GGAGTACCAC GGGTTCGGCC ACACCCACGA CACGACGGAC
ATCGCGATGG GACCCGATCT GTCGGCGTAC AAGACGCTCG GCGACATGAT GGAGAAGATG
GACGCCCAGC TGGAGCTGGC CCGGAAGCTG CGCGCGGTCG ACGAGACGGA CGTGGCCGAG
CGCGTGATCG AGTACCACTT CCTGCCGGAC ATCATCGGGA ACCTCCGGGC GTTCTCGCGG
CAGAAGACGC GGTGTCTCGA CTGCGGCGAG AAGTACCGAC GGATGCCGCT ATCTGGCGAC
TGCCGCGAGT GCGGCGGGCG GGTGAACCTC ACCGTCCACG AGGGGTCGGT GAGCAAGTAC
GTCGACACGG CGATCGAGGT CGCCGATCGG TTCGGCTGCC GGCCCTACAC GAAACAGCGG
TTGAAGGTGT TAGACCAGTC GCTGGAGTCG ATCTTCGAGG ACGACACCAA CAAGCAGTCG
GGTATCGCGG ACTTCATGTG A
 
Protein sequence
MRPDDERYFA QIEDRLDEAW DVAEAAKEQG HDPKPKIEIP IARDMADRVE NILGIDGVAE 
RVRELEGEMS REEAALELVT DFVEGTVGDY DSRAGKVEGA VRTAVALLTE GVVAAPIEGI
DRVELLENDD GTEFVNVYYA GPIRSAGGTA QALSVLVADY ARSLLDIDEY KPRDVEVERY
AEEISLYDKE TGLQYSPKDK ESKFIAQHMP VMLDGEATGD EEVSGFRDLE RVDTNSARGG
MCLVAAEGIA LKAPKIQRYT RDLDEVDWPW LQDLIDGTIG KDGGGASDEA PENAGAEGDD
DSEDTAGNES DGDDADDADS ETDGPAGPLR PKSSQKFLRD LIAGRPVFTH PSEAGGFRLR
YGRARNHGFA TGGVHPATMH LVDDFLAAGT QIKTERPGKA HGVIPVDSIE GPTVRLANGE
VRRIDDPEEA KEIRNGVEKV LDLGEYLVNY GEFVENNHPL APASYVYEWW VQEFETAGAD
VQAMRDDPHV DLEHPDVDEA LAWAREYDCP LHPEYTYLWH DVSVDAFEAL ADAVAAGDVV
EGVLAIERTE TTQHALESLL VQHVATPDAL RIPTWRPLAR SLGIDDGLRK TWSADDLSAA
AREWDDGENA VKAINEVAPF EIRERAPTRI GNRMGRPEKS ESRDLSPAVH TLFPINEAGG
PQRDVAEAAG TMDDSGHRGR LDLEVSDRVC PDCGEHTYRA QCPDCGVHTD LHYECGDCGT
VCEPDEAGRV ECPRCEWEVT AATYQEVDLN DAYRSALETV GERESAFEIL KGVQGLTSRN
KTPEPIEKGV LRAKNGVTSF KDGTVRYDMT DLPVTSVRAE ELDITADHLR EIGYETDIDG
EPLRHDDQLV ELRVQDIVLS DGAAEHMLKT AGFVDDLLEQ FYGLDRFYEF NERDDLVGEL
VFGMAPHTSA ATVGRVVGFT SAAVGYAHPY FHAAKRRNCF HPETEIEYRD DAGWHRETIE
KFVEERLNDP QTDDFGTLVN ELDGNVEVPS IDEHGNKSTQ PVTAVSKHLS QDHLVQVETQ
RGRSIRVTPD HTMLRVADGG VQKVAANELE IGDSVPTTTL RHEMSVGATV DVTTDGGIEA
DTVASVDFLE SDIEHTYNLT VAETHTLAAN DLLVAQCDGD EDCVMLLMDG LLNFSKTFLP
DQRGGRMDAP LVMSSRIDPS EIDDEAHNVD IAREYPREFY DATLEMADPE SVEDLIQIGE
DTLGTDEEYH GFGHTHDTTD IAMGPDLSAY KTLGDMMEKM DAQLELARKL RAVDETDVAE
RVIEYHFLPD IIGNLRAFSR QKTRCLDCGE KYRRMPLSGD CRECGGRVNL TVHEGSVSKY
VDTAIEVADR FGCRPYTKQR LKVLDQSLES IFEDDTNKQS GIADFM