Gene Pars_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1359 
Symbol 
ID5054064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1221280 
End bp1223088 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content57% 
IMG OID640468905 
ProductDNA topoisomerase 
Protein accessionYP_001153574 
Protein GI145591572 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.013738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACTAA TCGTGGCGGA GAAGCGCTCG GTGGCGCATG CCATTGCAAA GTTCCTCGGC 
GGGCGGTACA AACTGGAGAA GATACAAGGC GTGGCGGCCT ACCGCTTTAA CTACGGCGGA
AGAGAAGCCG TTGCGCTGGG GCTAAGCGGC CACCTTATGG ACTTCGACTT CACGGCTAGG
CAGAACGTGT GGACGTGGAT ACCGCCAGAG GAGCTCTTCG CATCGCAACC CCTCATAGTT
TACAGACCGG AGACTATGAA ATATATCAGG GCCCTTAGAA CCCTCGCCGC AAGGGCGCAC
GAGGTTTACC TTGCACTGGA CGCCGACGTG GAGGGCGAGG CCATAGCCTA CGAGGCGGCT
CTCTTGGTGC GACTTGTGAA CCCCCGCGCC AAGATTTACC GCGTCCGCTT CAACGCCGTG
ACTCAGAGGG AGATAACAAA CGCCTTCAGA AACCCCACCC ACATCAACTT GAGGATGGTG
GAGAAGGTAT TCACCAGGAT GCAAGTTGAC CTCACCCTAG GTGCCGTCTT CACCCGGTTC
ATAACACTCG CCGTGAGACA CTCGCTTGAT AGGGGACAGT TCCTCAGCTA CGGGCCGTGT
CAAACGCCCG TCCTGGGCAT CGTGGTCACC CGCGAATTAC AGAGGAGGAA TTTCAAGCCG
GAAAAGTACT ACGTCGTGAA GGCCCTAGTG GAGATAGGCG GCCACAGAAT AGAGATGTCT
GCTGACGTGA GGTTTAAGAC TAGAAAGGAG GCAGAGGAGG CGGCGGCAAC TATTAACCGC
GGCGTTGTAA AAGCCGCCGT GTACAGGCCA CACCACGTCA ATCCGCCAGT GCCGCTCGAG
ACTGTGGAGC TGGAGAGGAG GGCAAGCCGG TGGCTTGGGA TAAACTCGAA GCGGACCCTG
GACATAGCAG AGGAGCTCTA CAGAGCAGGC TACATATCTT ACCCCCGCAC AGAGACCACC
ATATACCCAC CGACGCTGGA TCTCAGAGAA GTACTACAAG AACTAGCCAG CGGCCACCTT
GGCTCCTACG CCGACGAGCT GATGAGGCGC GGTTTCAGGC CCACTCGCGG GGATTCAGAC
GACAGAGCAC ACCCGCCTAT ATACCCCACC AGAGCCGCTA CTAAAGGCGA AGTCGCAAAG
GCCTTCGGCA AACTCGCCCC CCAAGCATGG GCTATCTACG ACTTCGTGGT GCGGCACTTC
CTAGCCACCC TCAGCCCACC GGCGGTGGTA GAAAAACAGA AAATCATAGT CTCTTTCGGC
AAACTCGAAA TGGAGGCAGA GGGACAGCTG GTCGTCGACG AGGGCTACTG GCGCATTTAC
CCATGGGAGA GGCAGAGCAG TAAGCCCCTG CCTCGCGTAA GCCCCGGAGA CCCAGCAAGG
GCGGTGAAGG TAGATGTGGT AGAGCGGGAG ACCGAGCCGC CGCCCCAGAT GACCGAGTCG
GAGCTACTGG CACTGATGAA GAAGTACGGC ATAGGCACCG ACGCCACTAT GCAGGACCAC
ATACACACGA ACGTCAGGAG GGGCTACATG AAAATCACAA AAGGGAAGTG CATCCCCACT
GACCTCGGCA TAGCGCTAGC CACGTCGCTC TTCCAGTTCG CCCCCCAACT CATAGAGCCA
ACTGTAAGGG CTAAAATAGA GAAAGCGCTT AACTCCATAG TCACAGACGG CACCCCCCCA
GCCAGGCTCA TCTACGAAAT AAAAAAAGAA TTCGAAGAGT ACTACAAAGC GCTCAAGGCT
AGGAAAGAGG AGATAAAGAA GGCGCTTGAA ACGGCTTTAA ACTCATCTCG GAACAGCCAA
CGTGGATAG
 
Protein sequence
MILIVAEKRS VAHAIAKFLG GRYKLEKIQG VAAYRFNYGG REAVALGLSG HLMDFDFTAR 
QNVWTWIPPE ELFASQPLIV YRPETMKYIR ALRTLAARAH EVYLALDADV EGEAIAYEAA
LLVRLVNPRA KIYRVRFNAV TQREITNAFR NPTHINLRMV EKVFTRMQVD LTLGAVFTRF
ITLAVRHSLD RGQFLSYGPC QTPVLGIVVT RELQRRNFKP EKYYVVKALV EIGGHRIEMS
ADVRFKTRKE AEEAAATINR GVVKAAVYRP HHVNPPVPLE TVELERRASR WLGINSKRTL
DIAEELYRAG YISYPRTETT IYPPTLDLRE VLQELASGHL GSYADELMRR GFRPTRGDSD
DRAHPPIYPT RAATKGEVAK AFGKLAPQAW AIYDFVVRHF LATLSPPAVV EKQKIIVSFG
KLEMEAEGQL VVDEGYWRIY PWERQSSKPL PRVSPGDPAR AVKVDVVERE TEPPPQMTES
ELLALMKKYG IGTDATMQDH IHTNVRRGYM KITKGKCIPT DLGIALATSL FQFAPQLIEP
TVRAKIEKAL NSIVTDGTPP ARLIYEIKKE FEEYYKALKA RKEEIKKALE TALNSSRNSQ
RG