Gene Hlac_0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0259 
Symbol 
ID7401185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp279524 
End bp282118 
Gene Length2595 bp 
Protein Length864 aa 
Translation table11 
GC content68% 
IMG OID643707322 
ProductDNA topoisomerase type IA central domain protein 
Protein accessionYP_002564934 
Protein GI222478697 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGCG GCCCCGAACT GATAATCACG GAGAAAGACA ACGCGGCGCG TCGCATCGCC 
GACATCCTGA GCGGCGAGTC CGCAACGGCG GAGCGGGAAA ACGACGTGAA CGTGTACAAG
TGGGGCGGCA AGCGCTGTAT CGGCCTCTCG GGTCACGTCG TCGGCGTCGA CTTCCCCGCC
GAGTACAACG ATTGGCGCGA CGTGGAACCC GTCGAACTGA TCGACGCACC GATCACCAAA
GAGCCCACCC AAGAGGGAAT CGTCGCGGCC CTCCGGAAAC TGGCGCGCAA CGCCTCCCGA
GTCGTCATCG CGACCGACTA CGACCGCGAG GGCGAGCTGA TCGGCAAGGA GGCGTACGAG
CTGGTGCGCG AGGTGAACGA GGACGCTCCC ATCGACCGCG TCCGGTTCTC CTCGATCACC
GACAACGAGG TAAACGAGGC GTTCGCGAAC CCCGACGAGC TCGACTTCGA TCTGGCCGCC
GCCGGGGAGG CCCGGCAGGT GATCGACCTG ACGTGGGGCG CGGCGCTCAC CCGCTTCCTC
TCGCTGTCCG CCCGCCAGCT CGGCGAGGAC TTCATCTCCG TGGGCCGGGT GCAGGGACCG
ACGCTCAAGC TGATCGTCGA CCGCGAGCGA GAGATCAAAG CGTTCGATCC CGAAGCGTAC
TGGGAGCTGT ACGGGAACCT GACCAAGTCC GGCGGCGACC CCTTCGAGGC GCGCTACTTC
TACCTGAACG ACGAGGGCAA CGAGGCCGAG CGCGTCTGGA ACGGCGACGT CGCGGAGGTC
CTCACGGAGG CGTTCGACGC GGCCGACGAG GCGGTCGTCG ACGACGTGCG CCGGCGGACC
CGCACCGACG ACCCGCCGAC CCCGTTCAAC ACCACGGCGT TCATCCGCGC CGCAGGCTCG
CTCGGACACT CCGCGCAACG CGCGATGTCG CTCGCTGAGG ACTTGTACAC GGCTGGCTAC
GTCACCTACC CCCGGACTGA TAACACGGTG TACCCAGAGG ATCTCGACCC CCGCGAGCTG
ATCGAGGAGC TGTCGGTCGC CTCGACGTTC GGGAAAGACG CGAAGAGCCT CCTCGAACAG
GAGGAGATCG AGCCCACCGA GGGCGACGAG GAGACGACCG ATCACCCGCC GATCCACCCG
ACCGGGGAGC TTCCCTCCGC TTCCGACCTC TCGGAGGACG AGTGGGAGGT GTACGAGCTG
ATCGTCCGCC GCTTCCTCGC GACCTGCGCC GAGCCCGCGA CGTGGGAGCG GCTCCGCGTC
GTCGCGCTCG CGAACGACGA GGCGACCGCG ATCGCCGAGC GCGCGGACGG GCTCGCCGCT
CTCCGCAATC CCGAGGAGGC GAGCGGCCCC GGCGACCTCG TCGCCGACGG CGGGCTCCGG
CTGAAGGCGA ACGGGAAGCG CCTGTTGGAG GCCGGCTACC ACGACGTGTA CCCGTACCGC
TCCAGCGACG AGCGGATCGT CCCCGACGTC GAGGTCGGCG AGACGCTCGC GCTCACCGAC
CGGCGGACGG ACGCGAAGGA GACTCAGCCG CCCCGTCGAT ACGGGCAGTC CCGGCTCATC
GAGGAGATGG AGAAGCGCGG CGTCGGCACG AAGGCGACCC GACACCGTAC CCTCGAAAAG
CTGTACGACC GCAACTACAT CGAGAGCGAC CCGCCGCGGC CGACCCGGCT CGCGGAGGCG
GTCGTCGAGG CCGCAGAAGA GTTCGCGGAA CACGTCGTGA GCGAGGAGAT GACCGCCCAG
CTCGAACGCG ACATGCAGGC GATCGCGGCC GGCGAGAAGG GGTACGACGA GGTGACCGAG
GCATCCCGCG AGCTGCTCAA CCGCGTGTTC GACGACCTCA CCGAGTCGCG CGAGGCGGTC
GGCGACCACC TCCAGAAGTC GCTGAAGGCG GATAAGACGC TCGGCCCCTG CCCGGAATGC
GGCTCCGACC TCCTCGTCCG AAAGTCTCGA CAGGGGTCGT ACTTCGTCGG CTGCGACGGC
TACCCGGACT GTGAGTACAC CCTTCCGCTC CCCTCCAGCG GGAAGCCGCT GCTCTTAGAC
GAGACCTGCG AGGAGCACGA ACTCCGGCAC GTGAAGATGC TCGCAGGCCG GAAGACGTTC
GTCCACGGCT GCCCGCAGTG CAAGGCCGAC GAGGCCGACG AGCAGGAAGA CGAGGTTATC
GGGGCGTGTC CGGAGTGTGG CGAGGAGCAC GGCGGAGAGT TAGCTATCAA GCGGCTCCGC
TCTGGCTCCC GACTCGTCGG CTGCACGCGC TACCCCGACT GCGACTACTC GCTGCCGCTC
CCCCGGCGCG GTGAGATCGA GGTCACCGAC GAAATCTGCG AGGAGCACGG CCTTCCGCAC
CTCCGGGTCC ACTCCGGCGA CGAGCCGTGG GAGCTCGGCT GTCCCATCTG CAACTACCGG
GAGTTCACCG CCCGGCAGGA GGGCTCGGAG CTCCAGACCG TCGAAGGGAT CGGCGAGAAG
ACCGCCGAGA AGCTGAAAGA CGCCGGCGTC GACGGCGTCG ACGACCTCAA GTCGATCGAT
CCGGACGAGT TGGCGGCCGA CGTCGACGGC GTGGGCGCGG ACACGGTCCG CGACTGGCAG
GCGAAGGCGG ACTAA
 
Protein sequence
MSRGPELIIT EKDNAARRIA DILSGESATA ERENDVNVYK WGGKRCIGLS GHVVGVDFPA 
EYNDWRDVEP VELIDAPITK EPTQEGIVAA LRKLARNASR VVIATDYDRE GELIGKEAYE
LVREVNEDAP IDRVRFSSIT DNEVNEAFAN PDELDFDLAA AGEARQVIDL TWGAALTRFL
SLSARQLGED FISVGRVQGP TLKLIVDRER EIKAFDPEAY WELYGNLTKS GGDPFEARYF
YLNDEGNEAE RVWNGDVAEV LTEAFDAADE AVVDDVRRRT RTDDPPTPFN TTAFIRAAGS
LGHSAQRAMS LAEDLYTAGY VTYPRTDNTV YPEDLDPREL IEELSVASTF GKDAKSLLEQ
EEIEPTEGDE ETTDHPPIHP TGELPSASDL SEDEWEVYEL IVRRFLATCA EPATWERLRV
VALANDEATA IAERADGLAA LRNPEEASGP GDLVADGGLR LKANGKRLLE AGYHDVYPYR
SSDERIVPDV EVGETLALTD RRTDAKETQP PRRYGQSRLI EEMEKRGVGT KATRHRTLEK
LYDRNYIESD PPRPTRLAEA VVEAAEEFAE HVVSEEMTAQ LERDMQAIAA GEKGYDEVTE
ASRELLNRVF DDLTESREAV GDHLQKSLKA DKTLGPCPEC GSDLLVRKSR QGSYFVGCDG
YPDCEYTLPL PSSGKPLLLD ETCEEHELRH VKMLAGRKTF VHGCPQCKAD EADEQEDEVI
GACPECGEEH GGELAIKRLR SGSRLVGCTR YPDCDYSLPL PRRGEIEVTD EICEEHGLPH
LRVHSGDEPW ELGCPICNYR EFTARQEGSE LQTVEGIGEK TAEKLKDAGV DGVDDLKSID
PDELAADVDG VGADTVRDWQ AKAD