Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0150 |
Symbol | |
ID | 7401671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 156012 |
End bp | 160112 |
Gene Length | 4101 bp |
Protein Length | 1366 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707213 |
Product | DNA polymerase II large subunit |
Protein accession | YP_002564825 |
Protein GI | 222478588 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1933] Archaeal DNA polymerase II, large subunit |
TIGRFAM ID | [TIGR00354] DNA polymerase, archaeal type II, large subunit [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCCGG ACGACGAGCG CTACTTCGCT CAGATCGAAG ACCGGCTCGA CGAGGCGTGG GACGTGGCCG AGGCCGCCAA GGAGCAGGGT CACGACCCGA AACCGAAGAT CGAGATCCCG ATCGCCCGCG ATATGGCCGA CCGGGTCGAG AACATTCTCG GAATCGACGG GGTCGCGGAG CGCGTCCGCG AGTTGGAAGG CGAGATGTCC CGCGAGGAGG CGGCCCTCGA ACTCGTCACC GACTTCGTCG AGGGGACCGT CGGCGACTAC GACTCCCGGG CCGGCAAGGT CGAAGGCGCG GTCCGGACCG CGGTCGCGCT GCTCACCGAG GGGGTCGTCG CCGCGCCGAT CGAGGGGATC GACCGGGTCG AGCTGCTGGA AAACGACGAC GGCACCGAAT TCGTCAACGT CTACTACGCC GGTCCGATCC GGTCTGCGGG CGGGACCGCG CAGGCGCTGT CCGTGCTCGT CGCCGACTAC GCCCGATCGT TGCTCGACAT CGACGAGTAC AAGCCGCGAG ACGTAGAGGT CGAGCGCTAC GCCGAGGAGA TCTCGTTGTA CGACAAGGAG ACAGGCCTCC AGTACTCGCC GAAGGACAAG GAGTCGAAGT TCATCGCACA GCACATGCCC GTCATGCTCG ACGGGGAAGC GACGGGCGAC GAGGAGGTCT CCGGGTTCCG CGATCTGGAA CGCGTCGACA CCAACTCCGC GCGCGGCGGG ATGTGTCTCG TGGCCGCCGA AGGGATCGCG CTGAAGGCCC CGAAGATCCA GCGGTACACC CGCGACCTCG ACGAGGTCGA CTGGCCGTGG CTGCAAGATC TGATCGACGG AACGATCGGC AAGGACGGCG GCGGCGCGAG CGACGAGGCT CCCGAGAATG CCGGCGCAGA GGGAGACGAC GACAGCGAGG ACACCGCGGG CAACGAGAGC GACGGGGACG ACGCGGATGA TGCCGATTCG GAAACCGATG GCCCTGCTGG CCCCCTCCGT CCGAAGTCCT CACAGAAGTT CCTCCGGGAC CTGATCGCAG GCCGTCCCGT CTTTACCCAC CCGAGTGAGG CGGGTGGATT CCGACTTCGA TACGGTCGCG CGCGCAACCA CGGGTTCGCG ACCGGCGGGG TCCACCCGGC GACCATGCAC CTCGTCGACG ACTTCCTCGC GGCGGGCACC CAGATCAAGA CGGAGCGTCC GGGGAAAGCC CACGGGGTCA TCCCCGTCGA CTCCATCGAG GGGCCGACCG TGCGACTCGC GAACGGCGAG GTCCGGCGGA TCGACGACCC TGAGGAGGCG AAGGAGATCC GGAACGGCGT CGAGAAAGTG CTCGATCTCG GCGAGTATCT CGTCAACTAC GGCGAGTTCG TCGAGAACAA CCACCCGCTC GCGCCCGCCT CGTACGTCTA CGAGTGGTGG GTTCAGGAGT TTGAGACCGC CGGTGCCGAC GTGCAGGCCA TGCGGGACGA CCCCCACGTC GACCTTGAAC ACCCCGATGT CGACGAGGCG CTCGCGTGGG CGCGCGAGTA CGACTGTCCG CTTCACCCGG AGTACACGTA CCTCTGGCAC GACGTGTCGG TCGATGCGTT CGAGGCGCTC GCAGATGCGG TTGCCGCCGG CGACGTGGTC GAGGGCGTCC TCGCGATCGA GCGCACGGAG ACGACGCAGC ACGCGCTGGA GTCGCTGCTG GTCCAACACG TCGCGACGCC GGACGCGCTC CGGATCCCCA CGTGGCGCCC GCTGGCGCGG TCGCTCGGAA TCGACGACGG GCTCCGAAAG ACGTGGAGCG CGGACGACCT CTCGGCGGCG GCCCGCGAGT GGGATGACGG TGAGAACGCG GTGAAAGCCA TCAACGAAGT CGCTCCCTTC GAGATTCGCG AGCGGGCGCC GACCCGGATC GGCAACCGGA TGGGCCGCCC GGAGAAGTCG GAGAGCCGTG ACCTCTCGCC GGCGGTCCAC ACCCTGTTCC CGATCAACGA GGCCGGCGGC CCCCAGCGCG ACGTGGCGGA GGCCGCGGGG ACGATGGACG ACTCGGGCCA CCGAGGGCGG CTGGACCTGG AGGTCTCCGA CCGGGTGTGC CCCGACTGTG GCGAGCACAC GTACCGGGCG CAGTGCCCGG ACTGCGGGGT CCACACCGAC CTCCACTACG AGTGCGGCGA CTGCGGCACC GTCTGCGAGC CCGACGAGGC CGGCCGCGTC GAGTGCCCGC GCTGCGAGTG GGAGGTGACG GCGGCGACCT ACCAGGAGGT CGACCTCAAC GACGCGTACC GCTCCGCGCT GGAGACCGTC GGCGAGCGGG AATCCGCCTT CGAGATTCTG AAGGGCGTTC AGGGGCTCAC CTCGCGGAAC AAGACCCCCG AGCCGATCGA GAAGGGGGTC CTCCGCGCGA AAAATGGTGT CACATCGTTC AAGGACGGCA CCGTCAGGTA CGACATGACC GACCTCCCGG TCACGTCGGT CCGCGCCGAG GAGCTCGACA TCACCGCCGA CCACCTCCGA GAGATCGGTT ACGAGACAGA TATCGACGGC GAACCCCTGC GCCACGACGA TCAACTGGTG GAGCTTCGGG TGCAGGACAT CGTCCTCTCC GACGGCGCTG CCGAGCACAT GCTCAAGACT GCGGGCTTCG TCGACGACCT CTTAGAGCAG TTCTACGGGC TCGACCGCTT CTACGAGTTC AACGAGCGCG ACGACCTCGT CGGCGAGCTC GTCTTCGGGA TGGCGCCGCA CACGAGCGCC GCAACTGTCG GGAGAGTGGT GGGTTTCACG TCGGCCGCCG TGGGATACGC TCATCCGTAC TTTCACGCCG CCAAGCGGCG CAACTGCTTC CACCCGGAGA CGGAGATTGA GTATCGAGAC GACGCAGGGT GGCACCGCGA GACCATCGAG AAATTCGTTG AGGAGCGACT GAATGATCCC CAGACCGACG ACTTCGGAAC ACTCGTCAAC GAACTCGATG GCAACGTCGA GGTTCCTTCG ATCGACGAAC ACGGAAACAA GTCGACACAG CCGGTGACTG CCGTCTCGAA ACACCTGAGT CAGGACCATC TCGTCCAGGT CGAGACCCAG CGCGGACGTT CGATTCGCGT GACACCCGAC CACACGATGC TGCGAGTCGC GGATGGAGGC GTTCAAAAGG TCGCTGCAAA CGAACTGGAG ATCGGCGATT CGGTCCCGAC GACAACTCTT CGTCACGAGA TGTCGGTGGG CGCTACTGTC GACGTGACGA CGGACGGGGG GATCGAGGCG GACACAGTTG CCTCGGTCGA TTTCCTCGAA AGCGACATCG AACACACTTA CAATCTCACA GTCGCCGAGA CGCACACGCT CGCGGCGAAC GATTTACTGG TCGCTCAGTG TGACGGAGAC GAGGATTGCG TTATGCTCCT CATGGACGGA CTTCTCAACT TCTCGAAGAC GTTCTTACCG GACCAACGCG GCGGTCGTAT GGACGCGCCC CTCGTGATGT CCTCGCGGAT CGACCCGTCG GAGATCGACG ACGAGGCGCA CAACGTCGAC ATCGCGCGGG AGTACCCTCG CGAGTTCTAC GACGCGACGC TGGAGATGGC CGATCCGGAG TCGGTCGAGG ACCTGATTCA GATCGGCGAG GACACGCTCG GGACCGACGA GGAGTACCAC GGGTTCGGCC ACACCCACGA CACGACGGAC ATCGCGATGG GACCCGATCT GTCGGCGTAC AAGACGCTCG GCGACATGAT GGAGAAGATG GACGCCCAGC TGGAGCTGGC CCGGAAGCTG CGCGCGGTCG ACGAGACGGA CGTGGCCGAG CGCGTGATCG AGTACCACTT CCTGCCGGAC ATCATCGGGA ACCTCCGGGC GTTCTCGCGG CAGAAGACGC GGTGTCTCGA CTGCGGCGAG AAGTACCGAC GGATGCCGCT ATCTGGCGAC TGCCGCGAGT GCGGCGGGCG GGTGAACCTC ACCGTCCACG AGGGGTCGGT GAGCAAGTAC GTCGACACGG CGATCGAGGT CGCCGATCGG TTCGGCTGCC GGCCCTACAC GAAACAGCGG TTGAAGGTGT TAGACCAGTC GCTGGAGTCG ATCTTCGAGG ACGACACCAA CAAGCAGTCG GGTATCGCGG ACTTCATGTG A
|
Protein sequence | MRPDDERYFA QIEDRLDEAW DVAEAAKEQG HDPKPKIEIP IARDMADRVE NILGIDGVAE RVRELEGEMS REEAALELVT DFVEGTVGDY DSRAGKVEGA VRTAVALLTE GVVAAPIEGI DRVELLENDD GTEFVNVYYA GPIRSAGGTA QALSVLVADY ARSLLDIDEY KPRDVEVERY AEEISLYDKE TGLQYSPKDK ESKFIAQHMP VMLDGEATGD EEVSGFRDLE RVDTNSARGG MCLVAAEGIA LKAPKIQRYT RDLDEVDWPW LQDLIDGTIG KDGGGASDEA PENAGAEGDD DSEDTAGNES DGDDADDADS ETDGPAGPLR PKSSQKFLRD LIAGRPVFTH PSEAGGFRLR YGRARNHGFA TGGVHPATMH LVDDFLAAGT QIKTERPGKA HGVIPVDSIE GPTVRLANGE VRRIDDPEEA KEIRNGVEKV LDLGEYLVNY GEFVENNHPL APASYVYEWW VQEFETAGAD VQAMRDDPHV DLEHPDVDEA LAWAREYDCP LHPEYTYLWH DVSVDAFEAL ADAVAAGDVV EGVLAIERTE TTQHALESLL VQHVATPDAL RIPTWRPLAR SLGIDDGLRK TWSADDLSAA AREWDDGENA VKAINEVAPF EIRERAPTRI GNRMGRPEKS ESRDLSPAVH TLFPINEAGG PQRDVAEAAG TMDDSGHRGR LDLEVSDRVC PDCGEHTYRA QCPDCGVHTD LHYECGDCGT VCEPDEAGRV ECPRCEWEVT AATYQEVDLN DAYRSALETV GERESAFEIL KGVQGLTSRN KTPEPIEKGV LRAKNGVTSF KDGTVRYDMT DLPVTSVRAE ELDITADHLR EIGYETDIDG EPLRHDDQLV ELRVQDIVLS DGAAEHMLKT AGFVDDLLEQ FYGLDRFYEF NERDDLVGEL VFGMAPHTSA ATVGRVVGFT SAAVGYAHPY FHAAKRRNCF HPETEIEYRD DAGWHRETIE KFVEERLNDP QTDDFGTLVN ELDGNVEVPS IDEHGNKSTQ PVTAVSKHLS QDHLVQVETQ RGRSIRVTPD HTMLRVADGG VQKVAANELE IGDSVPTTTL RHEMSVGATV DVTTDGGIEA DTVASVDFLE SDIEHTYNLT VAETHTLAAN DLLVAQCDGD EDCVMLLMDG LLNFSKTFLP DQRGGRMDAP LVMSSRIDPS EIDDEAHNVD IAREYPREFY DATLEMADPE SVEDLIQIGE DTLGTDEEYH GFGHTHDTTD IAMGPDLSAY KTLGDMMEKM DAQLELARKL RAVDETDVAE RVIEYHFLPD IIGNLRAFSR QKTRCLDCGE KYRRMPLSGD CRECGGRVNL TVHEGSVSKY VDTAIEVADR FGCRPYTKQR LKVLDQSLES IFEDDTNKQS GIADFM
|
| |