Gene Hlac_0105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0105 
Symbol 
ID7401623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp107619 
End bp110546 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content66% 
IMG OID643707166 
ProductDNA-directed RNA polymerase subunit A' 
Protein accessionYP_002564781 
Protein GI222478544 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02390] DNA-directed RNA polymerase subunit A' 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.811343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAATGC AAACACCGAA AGTACTCGGC GGCATCGACT TCGGCCTCAT GGACCCGGAA 
ACGTACCGGG ACATGTCCGC GACGAAGGTG ATCACGGCCG ACACGTATAA CGACGACGGC
TACCCGATCG ACATGGGGCT GATGGACCCG CGGCTGGGCG TCATCGACCC CGGCTTAGAG
TGTCGGACCT GCGGTTCCCA CTCTGGCTCC TGTAACGGCC ACTTCGGCCA CATCGAGCTT
GCGGCCCCCG TCATTCACGT GGGCTTCACG AAGCTCATCC GACGACTGCT CCGCTCGACG
TGCCGGGAGT GCGGCAAGCT CGCCTTGGAC GAGGAGCAGC GCGACGAGTT CCGCGACCGC
TACGAGCGGG CGAAAGAGCT CGGCAACGAC GAACACGACG TGTTGAAGGC TGCCGTCCGG
CAGGCCCGGA AGGCGTCGAC GTGCCCGTTC TGCGGTGAGC CGCAGGCGGA CATCAAACAC
GAGAAACCGA CCACCTACTA CGAGGTTCAG GACGTGCTCT CGGGCGATTA CTCCGAGCGC
ATCGCGGCAG CGATGCAGCC CGATGAGGAG GAGGACGACC CTGGCACCTC GCCACAGGAG
CTCGCCGAGA AGACCGACAT CGACTTAGAG CGGATCAACG AGATCATGGC CGGCGAGTTC
CGTCCGCGCA AGGAGGACCG CCGCGCCATT GAGAAGGCGC TCGACATCGA CCTCACCGAA
GAGGACATGA ACAAGCTGAT GCCCTCGGAC ATCCGCGACT GGTTCGAGGA CATCCCGGAC
GAGGACCTGG AGGTGCTCGG CATCGACTCT GAGCACTCCC GGCCGGAGTG GATGATCCTG
ACGGTGCTTC CGGTCCCGCC GGTCACCACC CGACCGTCCA TCACGCTCGA CAACGGCCAG
CGCTCCGAGG ACGACCTCAC GCACAAGCTG GTCGACATCA TTCGGATCAA CCAGCGGTTC
ATGGAGAACC GCGAGGCAGG TGCCCCGCAG CTGATCATCG AGGACCTCTG GGAACTGCTC
CAGTACCACG TCACCACGTT CGTCGACAAC GAGATCAGCG GGACGCCGCC GGCGCGCCAC
CGCTCCGGCC GCCCGCTCAA GACCCTCAGC CAGCGGCTGA AGGGGAAGGA GGGCCGCTTC
CGTGGCTCGC TGTCCGGGAA ACGCGTCAAC TTCTCCGCCC GGACCGTCAT CTCGCCGGAC
CCGACGCTCT CGCTGAACGA GGTCGGCGTC CCGGACCGGG TCGCCATGGA GATGACGCAG
ACGCTCAACG TCACCGAGCG CAACGTCGAG GAGGCGCGTC AGTACGTCCG GAACGGCCCC
GAGGCCCATC CCGGCGCCAA CTACGTCCGT CGTCCCGACG GTCGACGGCT GAAGGTGACC
GAGAAGAACT GCGAGGAACT CGCCGAGAAG GTCGAAGCCG ACTGGGAGGT CAACCGTCAC
CTGGTCGACG GCGACATCGT GATCTTCAAC CGGCAGCCCT CGCTGCACCG GATGTCCATC
ATGGCCCACG AGGTCGTGGT GATGCCGTAC AAGACGTTCC GGCTCAACAC CGTCGTCTGT
CCGCCGTACA ACGCCGACTT CGACGGCGAC GAGATGAACA TGCACGCGCT CCAGAACGAG
GAGGCCCGCG CCGAGGCGCG CGTCCTCATG CGCGTCCAAG AGCAGATCCT CTCGCCGCGG
TTCGGCGGGA ACATCATCGG TGCCATCCAG GACCACATCT CCGGGACCTA CCTGCTCACG
CACTCGAATC CCGAGTTCAC GGAAACGCAG GCGCTGGACC TACTGCGCGC GACCCGAGTC
GACGAGCTGC CCGAGGCGGA CGGCGTCGAC GACGAGGGAC GCGAGTACTG GACCGGCCGG
ACGCTGTTCT CGGAGCTGCT GCCCGACGAC CTCTCGTTGC ACTTCTCGTC GTCGACTGGC
GATGACGTCA TCATCGAGGA CGGCCAGCTG ATCGAGGGGA CGATCGACGA GGACGCCGTC
GGCGCGTTCG GCGGCGAGGT CGTCGACACC CTCACCAAGG AGTACGGCGA GACGCGCTCG
CGCGTGTTCA TCAACGAGAT CGCGTCGCTG GCGATGCGCG CGATCATGCA CTTCGGGTTC
TCGATCGGGA TCGACGACGA GTCGATTCCG CCGGAGGCCG AAGAGCAGGT CGACGACGCC
ATCGAGAGCG CTTACGACCG CGTTCAGGAG CTGATCGCGA CGTACGAGGC CGGCGAGCTG
GAGTCGCTAC CCGGCCGCGG CGTCGACGAG ACGCTGGAGA TGAAGATCAT GCAGACGCTC
GGGAAGGCCC GCGACTCGGC CGGTGATATC GCCGACCAGC ACTTCGGCGA CGACAACCCG
GCGGTCGTGA TGGCCCGCTC CGGCGCGCGT GGGTCGATGC TGAATCTCAC GCAGATGGCC
GGCTCCGTCG GCCAGCAGGC GGTCCGCGGC GAGCGGATCA ACCGCGGCTA CGAGGATCGG
ACGCTCTCCC ACTACCGGCC GAACGATCTG TCCTCTGAGG CGCACGGCTT CGTGGAGAAC
TCCTACCGCA GCGGGCTCAC GCCCGAGGAG TTCTTCTTCC ACGCGATGGG CGGTCGCGAG
GGGCTGGTCG ACACGGCGGT CCGGACCTCG AAGTCCGGTT ACCTCCAGCG TCGGCTCATC
AACGCACTCT CCGAGCTGGA GGCGCAGTAC GACGGCTCGG TCCGGGACAC CTCGGGCCGG
ATCGTCCAGT TCGAGTTCGG CGAGGACGGT ACCTCGCCGG TGAAAGTCTC CTCGGGCGAA
GGCGACGGCA TCGACGTCGA CGACATCGTC GACCGCGTCG TCGACTCGGA GTTCGACTCC
GACGACGAGA AGGAGCGGTT CCTCGGCGAG CGCACGCCGC CGACGAACCT CTCGGAGCAT
TCTGGACCGG GCCTGAACAA GGCCTCGGGG GTGGAGTCGG ATGACTGA
 
Protein sequence
MSMQTPKVLG GIDFGLMDPE TYRDMSATKV ITADTYNDDG YPIDMGLMDP RLGVIDPGLE 
CRTCGSHSGS CNGHFGHIEL AAPVIHVGFT KLIRRLLRST CRECGKLALD EEQRDEFRDR
YERAKELGND EHDVLKAAVR QARKASTCPF CGEPQADIKH EKPTTYYEVQ DVLSGDYSER
IAAAMQPDEE EDDPGTSPQE LAEKTDIDLE RINEIMAGEF RPRKEDRRAI EKALDIDLTE
EDMNKLMPSD IRDWFEDIPD EDLEVLGIDS EHSRPEWMIL TVLPVPPVTT RPSITLDNGQ
RSEDDLTHKL VDIIRINQRF MENREAGAPQ LIIEDLWELL QYHVTTFVDN EISGTPPARH
RSGRPLKTLS QRLKGKEGRF RGSLSGKRVN FSARTVISPD PTLSLNEVGV PDRVAMEMTQ
TLNVTERNVE EARQYVRNGP EAHPGANYVR RPDGRRLKVT EKNCEELAEK VEADWEVNRH
LVDGDIVIFN RQPSLHRMSI MAHEVVVMPY KTFRLNTVVC PPYNADFDGD EMNMHALQNE
EARAEARVLM RVQEQILSPR FGGNIIGAIQ DHISGTYLLT HSNPEFTETQ ALDLLRATRV
DELPEADGVD DEGREYWTGR TLFSELLPDD LSLHFSSSTG DDVIIEDGQL IEGTIDEDAV
GAFGGEVVDT LTKEYGETRS RVFINEIASL AMRAIMHFGF SIGIDDESIP PEAEEQVDDA
IESAYDRVQE LIATYEAGEL ESLPGRGVDE TLEMKIMQTL GKARDSAGDI ADQHFGDDNP
AVVMARSGAR GSMLNLTQMA GSVGQQAVRG ERINRGYEDR TLSHYRPNDL SSEAHGFVEN
SYRSGLTPEE FFFHAMGGRE GLVDTAVRTS KSGYLQRRLI NALSELEAQY DGSVRDTSGR
IVQFEFGEDG TSPVKVSSGE GDGIDVDDIV DRVVDSEFDS DDEKERFLGE RTPPTNLSEH
SGPGLNKASG VESDD