Gene Hmuk_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3010 
Symbol 
ID8412563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2896288 
End bp2898879 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content69% 
IMG OID645021357 
ProductDNA topoisomerase I 
Protein accessionYP_003178822 
Protein GI257389049 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTGA TCATCACCGA AAAGGAGAAC GCCGCCCGCC GCATCGCGGA GATTCTCAGC 
GACGGCGCTG CGACCGCCGA GCGCGAGAAC GGCGTCAACG TCTACCGCTG GGGGACCAAA
CGCTGCATGG GGCTGTCGGG ACACGTCGTC GGCGTGGACT TCCCGCCGGA GTACTCCGAC
TGGCGCGACG TCGAGCCGGC GGAGCTGGTC GCGGCAGACG TGGTCAAGTC CGCGACCCAG
GAGAACATCG TCGCGACCCT GCGCAAGCTG GCCCGCAGGG CCGACGAGGC GACGATCGCG
ACGGACTACG ACCGCGAGGG GGAGCTGATC GGCAAGGAGG CGTACGAACT GATCCGCGAA
GAGACCGACG TGCCCGTCGA TCGCGTGCGG TTCTCCTCGA TCACCGACCG CGAGGTCCGG
GAGGCCTTCG CCAACCCCGA CGAGATCGAC TTCGACCTCG CCGCCGCCGG TGAGGCCCGA
CAGGTGATCG ACCTCGTGTG GGGCGCGGCG CTGACTCGGT TCCTCTCGCT GTCGGCCCGA
CAGCTCGGCG ACGACTTCAT CTCGGTCGGC CGCGTCCAGT CCCCGACGCT GAAGCTGATC
GTCGACCGCG AGCGCGAGAT CCAGGCGTTC GATCCCGAGG ACTACTGGGA GATATTCGGC
GACCTCCAGG GCGACGACGC CGCCTTCGAG GCTCAGTACT TCTACGACGA CGACGGCAGC
GAGGCCGAGC GCGTCTGGGA CGAGGCGACC GCCGAATCCG TCTACGAGGA CCTGGCCGCG
CTCGACGCCG CGACGGTCAC GTCCGTCCGC GAGCGGACCC GCACCGACGC GCCACCGGAG
CCGTTCAACA CCACCGCCTT CATCTCCGCC GCGGGCTCGC TTGGCTACTC GGCCCAGCGG
GCGATGAGCC TCGCAGAGGA GCTGTACACC GCCGGCTACA TGACCTACCC CCGGACGGAC
AACACGGTCT ATCCCGAGGA CCTGGAGCCC GACGAACTGC TGGACGAGTT CGTCGGCAGC
TCGACCTTCG GCGAGGACGC CGAGTCGCTG CTCGAACAGG ACGAGATCGT CCCCACCGAG
GGCGACGAGG AGACGACCGA CCACCCGCCG ATCCACCCGA CCGGGGAACT CCCCCCGCGG
GGCGAGCTGG ACGACGACGA GTGGGAGATC TACGAGCTCG TCGTCCGGCG CTTCTTCGCG
ACGGTCGCCG AGCCCGCGAA GTGGGCCCAC CTCCGGGTGG TCGCCGAGGC TGCTGGCCAC
TCGCTGAAGG CCAACGGCAA GCGCCTGCTC GAACCGGGCT ACCACGCCGT CTACCCCTAC
TCCTCGGCCA GCGAGAACGT CGTCCCGGCC GTCTCCGAGG GCGACGCGCT GACGATGAGC
GACGTGAGCC TCGACGCCAA ACAGACCCAG CCACCGCGTC GCTACGGGCA GTCCCGCCTC
ATCAAGAAGA TGGAAGACTT CGGGCTGGGC ACGAAGAGCA CGCGACACAA CACCATCGAG
AAGCTGTACG ACCGGGGGTA CATCGAGGGC GACCCGCCGC GGCCGACGAC CCTGGCCGAG
GCCGTCGTCG AGGCCGCAGA GGAGTTCGCC GACCGGATCG TCAGCGAGGA GATGACCGCC
CAGATGGAGG CCGACATGTC CGCGATCGCG AGCGGCGAGG CCACGCTGGA CGAGGTCACC
GCGGAGTCCC GCGAGATGCT CGATCGGGTG TTCGAGGAGC TGGCCGACTC CCGCGAGGAG
GTCGGCGACT ACCTCCAGGA GTCGCTGAAA GCCGACAAGA CGCTCGGTCC CTGTCCGGAG
TGTGGCGAGG ACCTGCTCGT GCGTCGCTCG CGACAGGGGT CGTACTTCGT CGGCTGTGAC
GGCTTCCCCG AGTGTCGCTT TACCCTCCCG CTGCCGTCGA CCGGCGAGCC GCAGGTCCTC
GACGACAGCT GCGAGGACCA CGGCCTCAAC CACGTGAAGA TGCTGGCCGG TCGCGACACC
TTCGTCCACG GCTGTCCCCG CTGCGAGGCC GAGAAGGCCG ACGAGACCGA CGACGAGGTG
ATCGGAGCCT GTCCCGAGTG TGGGGAGGGC GCTGCTCCGG AGGAGCAACG AGGCGAAGGC
GGTGATAGCG AGGCGCAAAG CGCCTCGGAC CATTCGAGCG GGCAAAGCCC GCGAGAAGAA
ACCGCCGAAG CGCACGGCGG GAAGCTGGCG ATCAAGCGCC TGCGCTCCGG CTCGCGACTC
GTGGGCTGTA CTCGCTATCC CGACTGCGAG TACTCGCTGC CGCTGCCGCG AAACGGCGAC
ATCGTGGTCG GCGAGGACCG CTGTGCGGAA CACGATCTGC CCGAGATCTC GATCGTCGAC
GACGAGGACG ACGACGATCC CTGGGAGCTT GGCTGTCCGA TCTGCAACTA CGCCGAGTAC
CAGGCCCGCA ACGCGATCGA CGACCTCGAA GATCTGAACG GCATCGGCGG CGCGACCGCC
GAGAAGCTCG CCGCCGTCGG CATCGACGAG CCCGCCGACC TCCGGGCGGT CGAACCCGAC
GAGATCGCCG CCAACGTCCA GGGCGTGTCG GCCAGCCAGA TCGAGGACTG GCAGGCCGAA
CTGGCGGACT GA
 
Protein sequence
MELIITEKEN AARRIAEILS DGAATAEREN GVNVYRWGTK RCMGLSGHVV GVDFPPEYSD 
WRDVEPAELV AADVVKSATQ ENIVATLRKL ARRADEATIA TDYDREGELI GKEAYELIRE
ETDVPVDRVR FSSITDREVR EAFANPDEID FDLAAAGEAR QVIDLVWGAA LTRFLSLSAR
QLGDDFISVG RVQSPTLKLI VDREREIQAF DPEDYWEIFG DLQGDDAAFE AQYFYDDDGS
EAERVWDEAT AESVYEDLAA LDAATVTSVR ERTRTDAPPE PFNTTAFISA AGSLGYSAQR
AMSLAEELYT AGYMTYPRTD NTVYPEDLEP DELLDEFVGS STFGEDAESL LEQDEIVPTE
GDEETTDHPP IHPTGELPPR GELDDDEWEI YELVVRRFFA TVAEPAKWAH LRVVAEAAGH
SLKANGKRLL EPGYHAVYPY SSASENVVPA VSEGDALTMS DVSLDAKQTQ PPRRYGQSRL
IKKMEDFGLG TKSTRHNTIE KLYDRGYIEG DPPRPTTLAE AVVEAAEEFA DRIVSEEMTA
QMEADMSAIA SGEATLDEVT AESREMLDRV FEELADSREE VGDYLQESLK ADKTLGPCPE
CGEDLLVRRS RQGSYFVGCD GFPECRFTLP LPSTGEPQVL DDSCEDHGLN HVKMLAGRDT
FVHGCPRCEA EKADETDDEV IGACPECGEG AAPEEQRGEG GDSEAQSASD HSSGQSPREE
TAEAHGGKLA IKRLRSGSRL VGCTRYPDCE YSLPLPRNGD IVVGEDRCAE HDLPEISIVD
DEDDDDPWEL GCPICNYAEY QARNAIDDLE DLNGIGGATA EKLAAVGIDE PADLRAVEPD
EIAANVQGVS ASQIEDWQAE LAD