Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1359 |
Symbol | |
ID | 5054064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1221280 |
End bp | 1223088 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468905 |
Product | DNA topoisomerase |
Protein accession | YP_001153574 |
Protein GI | 145591572 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.013738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATACTAA TCGTGGCGGA GAAGCGCTCG GTGGCGCATG CCATTGCAAA GTTCCTCGGC GGGCGGTACA AACTGGAGAA GATACAAGGC GTGGCGGCCT ACCGCTTTAA CTACGGCGGA AGAGAAGCCG TTGCGCTGGG GCTAAGCGGC CACCTTATGG ACTTCGACTT CACGGCTAGG CAGAACGTGT GGACGTGGAT ACCGCCAGAG GAGCTCTTCG CATCGCAACC CCTCATAGTT TACAGACCGG AGACTATGAA ATATATCAGG GCCCTTAGAA CCCTCGCCGC AAGGGCGCAC GAGGTTTACC TTGCACTGGA CGCCGACGTG GAGGGCGAGG CCATAGCCTA CGAGGCGGCT CTCTTGGTGC GACTTGTGAA CCCCCGCGCC AAGATTTACC GCGTCCGCTT CAACGCCGTG ACTCAGAGGG AGATAACAAA CGCCTTCAGA AACCCCACCC ACATCAACTT GAGGATGGTG GAGAAGGTAT TCACCAGGAT GCAAGTTGAC CTCACCCTAG GTGCCGTCTT CACCCGGTTC ATAACACTCG CCGTGAGACA CTCGCTTGAT AGGGGACAGT TCCTCAGCTA CGGGCCGTGT CAAACGCCCG TCCTGGGCAT CGTGGTCACC CGCGAATTAC AGAGGAGGAA TTTCAAGCCG GAAAAGTACT ACGTCGTGAA GGCCCTAGTG GAGATAGGCG GCCACAGAAT AGAGATGTCT GCTGACGTGA GGTTTAAGAC TAGAAAGGAG GCAGAGGAGG CGGCGGCAAC TATTAACCGC GGCGTTGTAA AAGCCGCCGT GTACAGGCCA CACCACGTCA ATCCGCCAGT GCCGCTCGAG ACTGTGGAGC TGGAGAGGAG GGCAAGCCGG TGGCTTGGGA TAAACTCGAA GCGGACCCTG GACATAGCAG AGGAGCTCTA CAGAGCAGGC TACATATCTT ACCCCCGCAC AGAGACCACC ATATACCCAC CGACGCTGGA TCTCAGAGAA GTACTACAAG AACTAGCCAG CGGCCACCTT GGCTCCTACG CCGACGAGCT GATGAGGCGC GGTTTCAGGC CCACTCGCGG GGATTCAGAC GACAGAGCAC ACCCGCCTAT ATACCCCACC AGAGCCGCTA CTAAAGGCGA AGTCGCAAAG GCCTTCGGCA AACTCGCCCC CCAAGCATGG GCTATCTACG ACTTCGTGGT GCGGCACTTC CTAGCCACCC TCAGCCCACC GGCGGTGGTA GAAAAACAGA AAATCATAGT CTCTTTCGGC AAACTCGAAA TGGAGGCAGA GGGACAGCTG GTCGTCGACG AGGGCTACTG GCGCATTTAC CCATGGGAGA GGCAGAGCAG TAAGCCCCTG CCTCGCGTAA GCCCCGGAGA CCCAGCAAGG GCGGTGAAGG TAGATGTGGT AGAGCGGGAG ACCGAGCCGC CGCCCCAGAT GACCGAGTCG GAGCTACTGG CACTGATGAA GAAGTACGGC ATAGGCACCG ACGCCACTAT GCAGGACCAC ATACACACGA ACGTCAGGAG GGGCTACATG AAAATCACAA AAGGGAAGTG CATCCCCACT GACCTCGGCA TAGCGCTAGC CACGTCGCTC TTCCAGTTCG CCCCCCAACT CATAGAGCCA ACTGTAAGGG CTAAAATAGA GAAAGCGCTT AACTCCATAG TCACAGACGG CACCCCCCCA GCCAGGCTCA TCTACGAAAT AAAAAAAGAA TTCGAAGAGT ACTACAAAGC GCTCAAGGCT AGGAAAGAGG AGATAAAGAA GGCGCTTGAA ACGGCTTTAA ACTCATCTCG GAACAGCCAA CGTGGATAG
|
Protein sequence | MILIVAEKRS VAHAIAKFLG GRYKLEKIQG VAAYRFNYGG REAVALGLSG HLMDFDFTAR QNVWTWIPPE ELFASQPLIV YRPETMKYIR ALRTLAARAH EVYLALDADV EGEAIAYEAA LLVRLVNPRA KIYRVRFNAV TQREITNAFR NPTHINLRMV EKVFTRMQVD LTLGAVFTRF ITLAVRHSLD RGQFLSYGPC QTPVLGIVVT RELQRRNFKP EKYYVVKALV EIGGHRIEMS ADVRFKTRKE AEEAAATINR GVVKAAVYRP HHVNPPVPLE TVELERRASR WLGINSKRTL DIAEELYRAG YISYPRTETT IYPPTLDLRE VLQELASGHL GSYADELMRR GFRPTRGDSD DRAHPPIYPT RAATKGEVAK AFGKLAPQAW AIYDFVVRHF LATLSPPAVV EKQKIIVSFG KLEMEAEGQL VVDEGYWRIY PWERQSSKPL PRVSPGDPAR AVKVDVVERE TEPPPQMTES ELLALMKKYG IGTDATMQDH IHTNVRRGYM KITKGKCIPT DLGIALATSL FQFAPQLIEP TVRAKIEKAL NSIVTDGTPP ARLIYEIKKE FEEYYKALKA RKEEIKKALE TALNSSRNSQ RG
|
| |