Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0105 |
Symbol | |
ID | 7401623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 107619 |
End bp | 110546 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707166 |
Product | DNA-directed RNA polymerase subunit A' |
Protein accession | YP_002564781 |
Protein GI | 222478544 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02390] DNA-directed RNA polymerase subunit A' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.811343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATGC AAACACCGAA AGTACTCGGC GGCATCGACT TCGGCCTCAT GGACCCGGAA ACGTACCGGG ACATGTCCGC GACGAAGGTG ATCACGGCCG ACACGTATAA CGACGACGGC TACCCGATCG ACATGGGGCT GATGGACCCG CGGCTGGGCG TCATCGACCC CGGCTTAGAG TGTCGGACCT GCGGTTCCCA CTCTGGCTCC TGTAACGGCC ACTTCGGCCA CATCGAGCTT GCGGCCCCCG TCATTCACGT GGGCTTCACG AAGCTCATCC GACGACTGCT CCGCTCGACG TGCCGGGAGT GCGGCAAGCT CGCCTTGGAC GAGGAGCAGC GCGACGAGTT CCGCGACCGC TACGAGCGGG CGAAAGAGCT CGGCAACGAC GAACACGACG TGTTGAAGGC TGCCGTCCGG CAGGCCCGGA AGGCGTCGAC GTGCCCGTTC TGCGGTGAGC CGCAGGCGGA CATCAAACAC GAGAAACCGA CCACCTACTA CGAGGTTCAG GACGTGCTCT CGGGCGATTA CTCCGAGCGC ATCGCGGCAG CGATGCAGCC CGATGAGGAG GAGGACGACC CTGGCACCTC GCCACAGGAG CTCGCCGAGA AGACCGACAT CGACTTAGAG CGGATCAACG AGATCATGGC CGGCGAGTTC CGTCCGCGCA AGGAGGACCG CCGCGCCATT GAGAAGGCGC TCGACATCGA CCTCACCGAA GAGGACATGA ACAAGCTGAT GCCCTCGGAC ATCCGCGACT GGTTCGAGGA CATCCCGGAC GAGGACCTGG AGGTGCTCGG CATCGACTCT GAGCACTCCC GGCCGGAGTG GATGATCCTG ACGGTGCTTC CGGTCCCGCC GGTCACCACC CGACCGTCCA TCACGCTCGA CAACGGCCAG CGCTCCGAGG ACGACCTCAC GCACAAGCTG GTCGACATCA TTCGGATCAA CCAGCGGTTC ATGGAGAACC GCGAGGCAGG TGCCCCGCAG CTGATCATCG AGGACCTCTG GGAACTGCTC CAGTACCACG TCACCACGTT CGTCGACAAC GAGATCAGCG GGACGCCGCC GGCGCGCCAC CGCTCCGGCC GCCCGCTCAA GACCCTCAGC CAGCGGCTGA AGGGGAAGGA GGGCCGCTTC CGTGGCTCGC TGTCCGGGAA ACGCGTCAAC TTCTCCGCCC GGACCGTCAT CTCGCCGGAC CCGACGCTCT CGCTGAACGA GGTCGGCGTC CCGGACCGGG TCGCCATGGA GATGACGCAG ACGCTCAACG TCACCGAGCG CAACGTCGAG GAGGCGCGTC AGTACGTCCG GAACGGCCCC GAGGCCCATC CCGGCGCCAA CTACGTCCGT CGTCCCGACG GTCGACGGCT GAAGGTGACC GAGAAGAACT GCGAGGAACT CGCCGAGAAG GTCGAAGCCG ACTGGGAGGT CAACCGTCAC CTGGTCGACG GCGACATCGT GATCTTCAAC CGGCAGCCCT CGCTGCACCG GATGTCCATC ATGGCCCACG AGGTCGTGGT GATGCCGTAC AAGACGTTCC GGCTCAACAC CGTCGTCTGT CCGCCGTACA ACGCCGACTT CGACGGCGAC GAGATGAACA TGCACGCGCT CCAGAACGAG GAGGCCCGCG CCGAGGCGCG CGTCCTCATG CGCGTCCAAG AGCAGATCCT CTCGCCGCGG TTCGGCGGGA ACATCATCGG TGCCATCCAG GACCACATCT CCGGGACCTA CCTGCTCACG CACTCGAATC CCGAGTTCAC GGAAACGCAG GCGCTGGACC TACTGCGCGC GACCCGAGTC GACGAGCTGC CCGAGGCGGA CGGCGTCGAC GACGAGGGAC GCGAGTACTG GACCGGCCGG ACGCTGTTCT CGGAGCTGCT GCCCGACGAC CTCTCGTTGC ACTTCTCGTC GTCGACTGGC GATGACGTCA TCATCGAGGA CGGCCAGCTG ATCGAGGGGA CGATCGACGA GGACGCCGTC GGCGCGTTCG GCGGCGAGGT CGTCGACACC CTCACCAAGG AGTACGGCGA GACGCGCTCG CGCGTGTTCA TCAACGAGAT CGCGTCGCTG GCGATGCGCG CGATCATGCA CTTCGGGTTC TCGATCGGGA TCGACGACGA GTCGATTCCG CCGGAGGCCG AAGAGCAGGT CGACGACGCC ATCGAGAGCG CTTACGACCG CGTTCAGGAG CTGATCGCGA CGTACGAGGC CGGCGAGCTG GAGTCGCTAC CCGGCCGCGG CGTCGACGAG ACGCTGGAGA TGAAGATCAT GCAGACGCTC GGGAAGGCCC GCGACTCGGC CGGTGATATC GCCGACCAGC ACTTCGGCGA CGACAACCCG GCGGTCGTGA TGGCCCGCTC CGGCGCGCGT GGGTCGATGC TGAATCTCAC GCAGATGGCC GGCTCCGTCG GCCAGCAGGC GGTCCGCGGC GAGCGGATCA ACCGCGGCTA CGAGGATCGG ACGCTCTCCC ACTACCGGCC GAACGATCTG TCCTCTGAGG CGCACGGCTT CGTGGAGAAC TCCTACCGCA GCGGGCTCAC GCCCGAGGAG TTCTTCTTCC ACGCGATGGG CGGTCGCGAG GGGCTGGTCG ACACGGCGGT CCGGACCTCG AAGTCCGGTT ACCTCCAGCG TCGGCTCATC AACGCACTCT CCGAGCTGGA GGCGCAGTAC GACGGCTCGG TCCGGGACAC CTCGGGCCGG ATCGTCCAGT TCGAGTTCGG CGAGGACGGT ACCTCGCCGG TGAAAGTCTC CTCGGGCGAA GGCGACGGCA TCGACGTCGA CGACATCGTC GACCGCGTCG TCGACTCGGA GTTCGACTCC GACGACGAGA AGGAGCGGTT CCTCGGCGAG CGCACGCCGC CGACGAACCT CTCGGAGCAT TCTGGACCGG GCCTGAACAA GGCCTCGGGG GTGGAGTCGG ATGACTGA
|
Protein sequence | MSMQTPKVLG GIDFGLMDPE TYRDMSATKV ITADTYNDDG YPIDMGLMDP RLGVIDPGLE CRTCGSHSGS CNGHFGHIEL AAPVIHVGFT KLIRRLLRST CRECGKLALD EEQRDEFRDR YERAKELGND EHDVLKAAVR QARKASTCPF CGEPQADIKH EKPTTYYEVQ DVLSGDYSER IAAAMQPDEE EDDPGTSPQE LAEKTDIDLE RINEIMAGEF RPRKEDRRAI EKALDIDLTE EDMNKLMPSD IRDWFEDIPD EDLEVLGIDS EHSRPEWMIL TVLPVPPVTT RPSITLDNGQ RSEDDLTHKL VDIIRINQRF MENREAGAPQ LIIEDLWELL QYHVTTFVDN EISGTPPARH RSGRPLKTLS QRLKGKEGRF RGSLSGKRVN FSARTVISPD PTLSLNEVGV PDRVAMEMTQ TLNVTERNVE EARQYVRNGP EAHPGANYVR RPDGRRLKVT EKNCEELAEK VEADWEVNRH LVDGDIVIFN RQPSLHRMSI MAHEVVVMPY KTFRLNTVVC PPYNADFDGD EMNMHALQNE EARAEARVLM RVQEQILSPR FGGNIIGAIQ DHISGTYLLT HSNPEFTETQ ALDLLRATRV DELPEADGVD DEGREYWTGR TLFSELLPDD LSLHFSSSTG DDVIIEDGQL IEGTIDEDAV GAFGGEVVDT LTKEYGETRS RVFINEIASL AMRAIMHFGF SIGIDDESIP PEAEEQVDDA IESAYDRVQE LIATYEAGEL ESLPGRGVDE TLEMKIMQTL GKARDSAGDI ADQHFGDDNP AVVMARSGAR GSMLNLTQMA GSVGQQAVRG ERINRGYEDR TLSHYRPNDL SSEAHGFVEN SYRSGLTPEE FFFHAMGGRE GLVDTAVRTS KSGYLQRRLI NALSELEAQY DGSVRDTSGR IVQFEFGEDG TSPVKVSSGE GDGIDVDDIV DRVVDSEFDS DDEKERFLGE RTPPTNLSEH SGPGLNKASG VESDD
|
| |