Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3690 |
Symbol | |
ID | 8744316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3799874 |
End bp | 3803542 |
Gene Length | 3669 bp |
Protein Length | 1222 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646514277 |
Product | DNA polymerase II, large subunit DP2 |
Protein accession | YP_003405225 |
Protein GI | 284166946 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1933] Archaeal DNA polymerase II, large subunit |
TIGRFAM ID | [TIGR00354] DNA polymerase, archaeal type II, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAGG CAGACGAACG CTACTTCGAG CGGCTCGAGT CCCAGTTAGA CGAGGCCTTC GACGTCGCCG AACGAGCCAA GGAGCGCGGC GCGGACCCGA AACCCGAGGT CGAGATTCCG ACCGCGCGGG ACATGGCCGA CCGCGTCGAG AACATCCTCG GGATCGACGG CGTCGCCGAG CGCGTCCGCG AACTCGAGGG GGAGATGAGC CGCGAGGAGG CCGCCCTCGA ACTCGCAGAG GACTTCGCCG AGGGCCGGGT CGGCGACTAC GAGTCCAAGG CCGGGAAGGT CGAGGGCGCG GTCCGCACCG CGGTCGCCCT GCTGACCGAG GGGGTCGTCG CGGCGCCCAT CGAGGGGATC GACAAGGTCG AGATCTTAGA GAACGACGAC GGGACGGAGT TCGTCAACGT CTACTACGCC GGCCCGATCC GCTCGGCGGG CGGCACGGCG CAGGCCCTCT CGGTGCTCGT CGCCGACTAC ACCCGCGCCC TCGTGGGGAT CGAGCAGTTC AGCGCCCGCG ACGAGGAGAT CGAGCGCTAC GCCGAGGAGA TCGCCCTCTA CGACAAGGAG ACCGGTCTCC AGTACACGCC CAAGGACAAG GAGACGAAGT TCATCGCCAA GCACATGCCG ATCATGTTGG ACGGCGAGGC CACCGGCGAC GAGGAGGTCT CGGGCTTTCG CGACTTGGAG CGGGTCGACA CCAACAGCGC CCGCGGCGGG ATGTGTCTGG TCATGGCCGA GGGGATCGCG CTCAAGGCGC CGAAGATCCA GCGCTACACC CGCAACCTGG ACGAGATCGA CTGGCCGTGG CTCCAGGACC TGATCGATGG CACCTACTAC GACGACGCGG CCGACGAAGG GGACGGCGAG GACGACGCCG ACGAGGATGA GGCTGCCGAC GAGAGCGAGG GCGAGGACCC CGCGGACGGC GACGACGCCG ACGAACCGCA GGGTCCCCCG CGCGTCGAGG AGTCCACGAA GTTCCTCCGG GACCTGATCG CCGGCCGCCC CGTCTTCTCT CACCCCTGCG CGGAGGGCGG GTTCCGACTG CGCTACGGTC GCGCGCGCAA CCACGGCTTC GCGACCGCCG GCGTCCACCC CGCCGCGATG CATCTGGTCG ACGACTTCCT CGCGACCGGG ACCCAAATCA AGACCGAACG CCCCGGCAAG GCGGCGGGGG TCGTCCCCGT CGACTCCATC GAGGGGCCGA CGGTCAAACT CGCCAACGGT GACGTCCGCC GGATCGACGA CCCCGAGGAC GCCCTCGAGA TCAGAAACGG CGTCGAGAAG ATTCTGGACC TGGGCGAGTA CCTCGTCAAC TACGGCGAGT TCGTCGAGAA CAACCACCCG CTCGCGCCCG CCTCCTACAC CTACGAGTGG TGGGTCCAGG ACTTAGCGGC CGCCGGCGCC GACGTCCAGG CCCTCGAGGA CGACCCCCGA ATCGACCTCG AGTTCCCCGA GCCCGAGGAA GCCCTCGAGT GGGCCGTCGA GTACGACGCG CCGCTCCACC CCGAGTACAC CTACCTCTGG CACGACATTT CGGTCGACGC CTTCTGTGAC CTCGCGGCGG CGGTCGCCGA GGGACGGATC GAGCAGGACG GCGACGGCAG CGTGAACGGC AACGGGGACG ACAGCATCCT CGTCCTCGCA TACGCCGACG TCGTCGCCGA CGCCCTCGAG ACGATTGTCA TCGAGCACCG CCAGCGCCCC GACGCGGACC GCATCGAAAT CGACGACTGG CGGCCGTTCG TCCGCACCGT CGGCTGCGAA CCGCGGCGAG CCGTCGCCGA CGGCGCCGCC TTAGATCTCG ACGCCGACCG TGACGGAGGG GAGGAGGAAC CGAGCATCGA ACTCGAGCGC ACGTGGTCCG CGGACGACCT CTCGGAGCGG GCCCGCAACT GGGGCCGCGA GGACGAACCC GACGGCGCCA ACGCCATCGA GGCTGTCAAC GAAGTCGCAC CGTTTCAGGT GCGCGAGCGC GCCCCCACGC GGATCGGCAA CCGGATGGGA CGCCCGGAGA AGTCAGAGAG CCGCGACCTC AGCCCGCCCG TGCACACGCT GTTCCCGATC GGCGAGGCCG GCGGCGCACA GCGCAACGTC GCCGACGCCG CCAAGCACGC CGAGACGATG TCCGACACGC CGGGCGTCGT CGAACTCCAA GTCGGCCGCC AGCGCTGTCC CGACTGCGCG ACGGAGACGT TCAAGAACCG CTGTCCGGAC TGCGACGCGC GGACCGAACC CGACTACCGC TGTCCCGACT GCGACGAGTC CCTCGAGCCC GACGACGCCG GCCGCGTCGA GTGCGACCGC TGTGAACGCG AGGGAACCTG CGTCGAGAAC CGCGAGGTCG ACGTCAACGA CGAGTTCCGC TCGGCCCTCG AGTCGGTCGG GGAACGCGAG AACGCCTTCG ACATCCTGAA AGGCGTCAAG GGGTTGACCT CGTCGAACAA GATCCCCGAA CCCATCGAGA AGGGGATCCT GCGCGCGAAA CACGACGTCT CGGCGTTCAA GGATGGCACC GTCCGCTACG ACATGACCGA CCTCCCGGTC ACGTCCGTCC GCGCCAGCGA ACTCGACGTC GACGTCGGCC AGCTACAGGC GCTGGGATAC GAGGAGGATA TCCACGGCGA GCCGCTGACC CACGAGGACC AGCTCGTGGA GCTGAAAGTA CAGGATATCG TCCTCTCGGA CGGCGCCGCC GAGCACATGC TCCAGACCGC CGACTTCATC GACGATCTCT TGGAGCAGTA CTACGGCCTC GAGCCGTTCT ACGAGTTCGA GGATCGGCAG GAACTGGTCG GAGAGCTGGT GTTCGGGATG GCACCCCACA CGAGCGCGGC AACTGTCGGG AGAGTTATCG GTTTCACGAG CGCGGCAGTC GGATACGCTC ATCCGTACTT TCACGCCGCG AAACGGCGCA ACTGCGACGG TGACGAAGAT TGCGTGATGC TGCTACTCGA CGGACTTCTC AACTTCAGTA AGTCTTTCCT GCCCGACCAG CGCGGGGGGA AGATGGACGC CCCGCTCGTC ATGTCCTCCC GCATCGACCC CTCGGAGATC GACGACGAGG CCCACAACAT GGACGTCGTC TCGCAGTATC CCCGCGAGTT CTACCTCGCG ACCCGCGAGC AGGCCGATCC CGAGGAGGTC GACGTCCAGA TCGCCGAGGA GAACCTCGGC ACCGACCTCG AATACACCGG CTTCGAACAC ACCCACGACA CCACCGACAT CGCGATGGGG CCCGACCTCT CGGCGTACAA GACGTTGGGC TCGATGATGG ACAAGATGGA CGCCCAGCTC GAGCTCTCGC GGAAACTCGA GGCCGTCGAC GAGACTGACG TCGCCGAGCG GGTCATCGAG TACCACTTCC TGCCGGACCT GATCGGGAAC CTGCGGGCCT TCTCCCGACA GGAGACCCGC TGTCTCGACT GCGGCGAGAA GTTCCGCCGA ATGCCCCTGA CCGGCGACTG CCGCGAATGC GGCGGCCGCG TCAACCTCAC CGTCCACAAG GGCTCGGTCA ACAAGTACAT GCAGACCGCG ATCAAGGTCG CCGACGAGTA CGATTGTCGC GACTACACGA AACAGCGGTT AGAGGTGCTC GAGCGCTCGC TCGAGAGCAT CTTCGAGAAC GACAAGAACA AGCAGAGTGG GATTGAAGAC TTCATGTGA
|
Protein sequence | MREADERYFE RLESQLDEAF DVAERAKERG ADPKPEVEIP TARDMADRVE NILGIDGVAE RVRELEGEMS REEAALELAE DFAEGRVGDY ESKAGKVEGA VRTAVALLTE GVVAAPIEGI DKVEILENDD GTEFVNVYYA GPIRSAGGTA QALSVLVADY TRALVGIEQF SARDEEIERY AEEIALYDKE TGLQYTPKDK ETKFIAKHMP IMLDGEATGD EEVSGFRDLE RVDTNSARGG MCLVMAEGIA LKAPKIQRYT RNLDEIDWPW LQDLIDGTYY DDAADEGDGE DDADEDEAAD ESEGEDPADG DDADEPQGPP RVEESTKFLR DLIAGRPVFS HPCAEGGFRL RYGRARNHGF ATAGVHPAAM HLVDDFLATG TQIKTERPGK AAGVVPVDSI EGPTVKLANG DVRRIDDPED ALEIRNGVEK ILDLGEYLVN YGEFVENNHP LAPASYTYEW WVQDLAAAGA DVQALEDDPR IDLEFPEPEE ALEWAVEYDA PLHPEYTYLW HDISVDAFCD LAAAVAEGRI EQDGDGSVNG NGDDSILVLA YADVVADALE TIVIEHRQRP DADRIEIDDW RPFVRTVGCE PRRAVADGAA LDLDADRDGG EEEPSIELER TWSADDLSER ARNWGREDEP DGANAIEAVN EVAPFQVRER APTRIGNRMG RPEKSESRDL SPPVHTLFPI GEAGGAQRNV ADAAKHAETM SDTPGVVELQ VGRQRCPDCA TETFKNRCPD CDARTEPDYR CPDCDESLEP DDAGRVECDR CEREGTCVEN REVDVNDEFR SALESVGERE NAFDILKGVK GLTSSNKIPE PIEKGILRAK HDVSAFKDGT VRYDMTDLPV TSVRASELDV DVGQLQALGY EEDIHGEPLT HEDQLVELKV QDIVLSDGAA EHMLQTADFI DDLLEQYYGL EPFYEFEDRQ ELVGELVFGM APHTSAATVG RVIGFTSAAV GYAHPYFHAA KRRNCDGDED CVMLLLDGLL NFSKSFLPDQ RGGKMDAPLV MSSRIDPSEI DDEAHNMDVV SQYPREFYLA TREQADPEEV DVQIAEENLG TDLEYTGFEH THDTTDIAMG PDLSAYKTLG SMMDKMDAQL ELSRKLEAVD ETDVAERVIE YHFLPDLIGN LRAFSRQETR CLDCGEKFRR MPLTGDCREC GGRVNLTVHK GSVNKYMQTA IKVADEYDCR DYTKQRLEVL ERSLESIFEN DKNKQSGIED FM
|
| |