Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1458 |
Symbol | |
ID | 8742049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1516818 |
End bp | 1519301 |
Gene Length | 2484 bp |
Protein Length | 827 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512034 |
Product | DNA topoisomerase I |
Protein accession | YP_003403017 |
Protein GI | 284164738 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCTGA TAATCACGGA GAAGGACAAC GCCGCGCGAC GGATCGCCGA CATTCTGAGC GGCGGGACCT ACGACTCGAG TCGCGAAAAC GGCGTCAACG TCTACGAGTG GGGCGGCAAG CGTTGCGTGG GGCTGTCGGG CCACGTCGTC GGCGTCGACT TCCCGGACGA GTACTCGGAC TGGCGCGACG TCGAACCCGT CGAACTCATC GACGCGAGCG TCGAGAAGAC GGCGACGAAG GAGAACATCG TCGCGACGCT GCGCATCCTC GCGCGAAAGG CCACCCGCGT CACCATCGCG ACCGACTACG ACCGCGAGGG CGAACTCATC GGCAAGGAGG CCTACGACAT CGTCCGCGAC GTCGACGAGG AGGTCCCTAT CCGCCGCGTT CGGTTCTCTT CGATCACGGA AAACGAGGTC CAGAGCGCCT TCGACGACCC GGACGACCTC GACTTCGATC TGGCGGCCGC GGGCGAGGCC CGCCAGATCA TCGACCTCGT CTGGGGCGCC GCCCTGACCC GCTTCCTCTC GCTGTCGGCG GGTCAGCTCG GCAACGACTT CATCTCCGTC GGGCGAGTGC AGTCGCCGAC GCTGAAGCTG ATCGTCGACC GCGAGCGCGA GATTCAGGCC TTCGATCCCG AGACCTACTG GGAGCTGTTC GGCGACTTAA CCAAGGAAGA CACCACGTTC GAGGCCCAGT ACTTCTACCG CGACGAGGAC GACAACGAGG CCGAGCGCGT CTGGGAGGAG GCCGTCGCCG ACGAGGTCTA CGAGACCCTC GCCGAGCGCG ATAGCGCGAC CGTCGTCGAC GTCAACCGCC GGACGCGGAC GGACACGCCG CCCGAGCCGT TCAACACTAC CCAGTTCATC CGCGCGGCCG GCGCTATCGG CTACTCCGCC AAGCGGGCGA TGTCGATCGC CGAGGATCTC TACACCGCCG GCTACATCAC ATACCCGCGG ACCGACAACA CCGTCTACCC CGACGATCTG GATCCCGAGG AACTGCTCGA CGACTTCGTC AGCCATCCGA CGCTCGGCGA GTCCGCCGAG TCGCTGCTCG AGGCCGACGA GATCGTTCCC ACCGAGGGCG ACGAGGAGAC GACCGACCAC CCGCCGATCC ACCCGACCGG CGAAATCCCG AGCCGCGGCG GCGACGTGAG CGACGACGAG TGGGAGGTGT ACGAACTCGT CGTCCGTCGG TTCTACGCGA CCGTCGCCGA CGCCGCCGTC TGGGAACACC TCAAGGTTGT CACCGAGGTC GACGACTACC GCATGAAGTC CAACGGCAAG CGACTCGTCG AGCCCGGCTA CCACGACGTC TACCCCTACT TCAGCACGTC CGAGAACTAC GTCCCCGACG TCACCGAGGG CGAGGAGCTC GCGCTGACCG ACGTCGAACT CGAGGAGAAG GAGACCCAGC CGCCCCGACG CTACGGCCAG TCGCGGCTCA TCGAGACCAT GGAGGACATG GGGATCGGGA CGAAGTCGAC CCGACACAAC ACCCTCGAGA AACTGTACGA CCGGGGCTAC ATCGAGAGCG ACCCGCCGCG GCCGACCAAG CTCGCGATGG CCGTCGTCGA CGCGGCCGAG AACTACGCCG ACCGCGTCGT CAGCGAGGAG ATGACGGCCC AGCTAGAGCA GGACATGGAC GCCATCGCCA GCGGCGAGGC GACGCTGGAC GACGTCACCG ACGAGTCCCG CGAGATGCTA GAAGAGATCT TCGCGAACCT CGCCGACTCG CGCGACGAGA TCGGTGACCA CCTCCGCAAG TCGCTCAAGG ACGACAAGCG GCTGGGTCCC TGCCCCGAGT GCGGCGAGGA CCTGCTCGTC CGGCGCAGCC GCCACGGCTC CTACTTCGTC GGCTGCGACG GCTATCCGGA CTGCGAGAAC ACCCTTCCGC TGCCCTCGAC GGGCAAGCCG CTCATCCTCG AGAGCGAGTG CGAGGACCAC GGTTTGAACG AGGTCAAGAT GCTCGCCGGG CGGCAGACGT TCGTCCACGG CTGTCCGCTC TGTAAGGCCG AAGACGCCGG CGAAGGGCCC GTGCTGGGAA CCTGTCCCGA ATGCGGAGAC GAACACGACG GCGAACTCGC CGTCAAGACC CTCCAGAGCG GTTCTCGGCT GGTGGGCTGT ACGCGCTACC CCGACTGCGA GTACTCGCTG CCGCTGCCCC GGCGCGGCGA GATCGAGGTC ACCGACGAGC GCTGCGACGA ACACGGCCTG CCCGAACTGG TCGTCCACAG CGGCGACGAC CCCTGGGAAC TGGGCTGTCC GATCTGCAAC TACCAGGAGT TTCAGGCCCG CGAGAGCGAC TCCGGCTCCG ACCTCGAGGC GCTGGACGGC GTCGGCGCCA AGACCGTCGA GAAACTCGCG GACGCGGGCA TCGAGAGCCT GGACGATCTG ACCGAGGCCG ATCCGGACGC GGTCGCCGAG GACGTCGACG GCGTCAGCGC CGATCGAGTC CGAACCTGGC AGGCGAAGGC GTAG
|
Protein sequence | MELIITEKDN AARRIADILS GGTYDSSREN GVNVYEWGGK RCVGLSGHVV GVDFPDEYSD WRDVEPVELI DASVEKTATK ENIVATLRIL ARKATRVTIA TDYDREGELI GKEAYDIVRD VDEEVPIRRV RFSSITENEV QSAFDDPDDL DFDLAAAGEA RQIIDLVWGA ALTRFLSLSA GQLGNDFISV GRVQSPTLKL IVDREREIQA FDPETYWELF GDLTKEDTTF EAQYFYRDED DNEAERVWEE AVADEVYETL AERDSATVVD VNRRTRTDTP PEPFNTTQFI RAAGAIGYSA KRAMSIAEDL YTAGYITYPR TDNTVYPDDL DPEELLDDFV SHPTLGESAE SLLEADEIVP TEGDEETTDH PPIHPTGEIP SRGGDVSDDE WEVYELVVRR FYATVADAAV WEHLKVVTEV DDYRMKSNGK RLVEPGYHDV YPYFSTSENY VPDVTEGEEL ALTDVELEEK ETQPPRRYGQ SRLIETMEDM GIGTKSTRHN TLEKLYDRGY IESDPPRPTK LAMAVVDAAE NYADRVVSEE MTAQLEQDMD AIASGEATLD DVTDESREML EEIFANLADS RDEIGDHLRK SLKDDKRLGP CPECGEDLLV RRSRHGSYFV GCDGYPDCEN TLPLPSTGKP LILESECEDH GLNEVKMLAG RQTFVHGCPL CKAEDAGEGP VLGTCPECGD EHDGELAVKT LQSGSRLVGC TRYPDCEYSL PLPRRGEIEV TDERCDEHGL PELVVHSGDD PWELGCPICN YQEFQARESD SGSDLEALDG VGAKTVEKLA DAGIESLDDL TEADPDAVAE DVDGVSADRV RTWQAKA
|
| |