Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0513 |
Symbol | |
ID | 8410013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 483070 |
End bp | 485988 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645018837 |
Product | DNA-directed RNA polymerase subunit A' |
Protein accession | YP_003176354 |
Protein GI | 257386581 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02390] DNA-directed RNA polymerase subunit A' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCAC AACAGACACC CAAATCGATC AGTTCCATTA GCTTCGGGCT GATGGATCCC GAGGAGTACC GCGACATGAG CGCCACGAAG GTCATCACGG CCGACACCTA CGACGACGAC GGGTTCCCGA TCGACATGGG ACTGATGGAC CCGCGTCTGG GCGTGATCGA CCCCGGCCTG GAGTGTAAGA CCTGTGGCAA GCACAGTGGG TCGTGTAACG GCCACTTCGG CCACATCGAA CTCGCAGCGC CCGTCATCCA CGTCGGGTTC AACAAGCTCA TCCGACGCCT GCTGCGCGGG ACCTGTCGGG AGTGCTCGCG GCTCTGTCTC GACGAACGCG AACGCGAGGA GTTCGAGGAC CGCCTCGAAC GATCCGAGGA GCTGGGCAAG GACGTCAACG ACGTGACCAA GGCCGCGATC CGGCAGGCCC GCAAGAAGGA CCGCTGTCCG TTCTGTGGCG AGCCCCAGTA CGACATCAAA CACGAGAAGC CGACGACCTA CTACGAGGTC CAGGACGTGC TCTCCTCGGA GTACTCCCAG CGGATCGTCG CCGCGATGGA GCCCGACGAG GAGGCCGGCC GCGAACGGAC CACGCCCTCG GAACTGGCCG ACGAGACGGG TATCGATCCC TCTCGCGTCC AGGAGATCAT CTCCGGAGAG TTCCGACCCC GACAGGACGA CCGTCGGGCC ATCGAGTCGG CGCTGGACGT GGACCTCACC GAGGAAGACG AGAACAAGCT GATGCCCAGC GACATCCGGG ACTGGTTCGA GGACATCCCG GACGAGGACG TCGAGGTGCT GGGCATGAAG CCCGCCCGTT CTCGCCCCGA GTGGATGATT CTCACCGTCC TGCCCGTGCC GCCGGTCACC GCGCGACCGT CGATCACGCT GGACAACGGC CAGCGCTCGG AGGACGACCT GACCCACAAG CTGGTCGACA TCATCCGGAT CAACCAGCGG TTCATGGAGA ACCGCGAGGC AGGCGCGCCC CAGCTGATCA TCGAGGACCT CTGGGAACTG CTGCAGTACC ACGTCACCAC GTTCATGGAC AACGAGATCA GCGGCACGCC GCCGGCCCGA CACCGCTCTG GCCGGCCGCT GAAGACCCTC TCTCAGCGCC TGAAAGGCAA GGAAGGTCGC TTCCGTGGCT CGCTGTCCGG GAAGCGCGTG AACTTCTCGG CCCGGACGGT CATCTCCCCG GACCCGACCC TGTCGCTCAA CGAGGTCGGC GTCCCCGACC GCGTGGCCAA GGAGATGACC CAGACGATGA ACGTCAACGA GCGCAACCTC GACAACGCTC GTCGCTACGT CGCCAACGGG CCCGAGGCCC ACCCCGGTGC CAACTACGTC AAGCGGCCCG ACGGTCGCCG ACTGAAGGTC ACCGAGAAGA ACTGCGAGGA GTTGGCCGAG AAGGTCGAAC CGGGCTGGGA AGTGTCCCGT CACATGATCG ACGGCGACAT CATCATCTTC AACCGCCAGC CGTCGCTGCA CCGGATGTCC ATCATGGCCC ACGAGGTCGT GGTGATGCCC TACAAGACGT TCCGGCTCAA CACGACGGTC TGTCCGCCGT ACAACGCGGA CTTCGACGGC GACGAGATGA ACATGCACGC CCTCCAGAAC GAGGAGGCCC GTGCCGAGGC TCGCGTGCTG ATGCGCGTCC AGGAGCAGAT CCTCAGCCCG CGCTTCGGTG AGAACATCAT CGGCGCGATT CAGGACCACA TCAGTGGGAC CTATCTGCTG ACTCACACTA ATCCGGAGTT CAACGAGACC CAGGCACTGG ACCTGCTGCG TGCGACCCGG ATCGACGAGC TCCCGCCCGA ATCCGGCGTC GACGACGACG GCACGCCCTA CTGGACGGGT CGCGACATCT TCTCGGAGCT GTTGCCCGAC GACCTCGATC TGACGTTCAC CAGCGAGGCC GGCGACGACG TGATCATCGA GGACGGGCAG CTCGTCGAGG GGACCATCGA CGACAGCGCG GTCGGTGGGT TCGGCGGCGA GATCGTCGAC ACTATCACCA AGGAGTACTC CAACACCCGA GCGCGCATCT TCATCAACGA GATCGCCGCG CTGGCGATGC GTTCGATCAT GCACTTCGGG TTCTCGATCG GGATCGACGA CGAGTCCGTT CCCGAGGCCG CGGAGGCCCA GATCACCGAG GCCATCGACA ACGCCTACGA CCGCGTCGAG GAACTCATCG AGACCTACGA TCGGGGCGAA CTGGAGTCGC TGCCCGGTCG GACGGTCGAC GAGACCCTCG AAATGAAGAT CATGCAGACG CTCGGCAAGG CCCGTGACTC CGCCGGTGAC ATCGCCGGAG ACCACTTCGA CGACGACAAC CCGGCTGTCG TGATGGCCGA GTCCGGCGCG CGTGGGTCGA TGCTCAACCT GACCCAGATG GCCGGCTGTG TCGGCCAGCA GGCGGTCCGT GGCGAGCGGA TCAACCGCGG CTACGAGGAC CGCACCCTGT CACACTTCAA GCAGAACGAC CTCTCCTCGG ACGCCCACGG CTTCGTGGAG GCGTCGTACC GCTCCGGACT CAACCCCAAG GAGTTCTTCT TCCACGCGAT GGGTGGCCGC GAGGGGCTGG TCGACACGGC GGTCCGGACC TCCAAGTCCG GTTACCTCCA GCGCCGTCTG ATCAACGCGC TCTCGGAACT GGAGGCCCAG TACGACGGCA CCGTCCGGGA CACCTCGGAC AACATCGTCC AGTTCGAGTT CGGCGAAGAC GGCACCTCGC CGGTACAGGT CTCCTCGAAC GAGGAGACGC CGGTCGACGT CGAGGAGATC GCGGACAGCG TTCTCGACGC CGAGTTCGAG TCCGAGAGCA TCAAAAGCGA GTTCCTCGGT GAACGCACCG AGCCGACCAA TCTGTCGGAA CACACCGACG ACTGGTGGAT GGCGGAGGGT GACGACTGA
|
Protein sequence | MSAQQTPKSI SSISFGLMDP EEYRDMSATK VITADTYDDD GFPIDMGLMD PRLGVIDPGL ECKTCGKHSG SCNGHFGHIE LAAPVIHVGF NKLIRRLLRG TCRECSRLCL DEREREEFED RLERSEELGK DVNDVTKAAI RQARKKDRCP FCGEPQYDIK HEKPTTYYEV QDVLSSEYSQ RIVAAMEPDE EAGRERTTPS ELADETGIDP SRVQEIISGE FRPRQDDRRA IESALDVDLT EEDENKLMPS DIRDWFEDIP DEDVEVLGMK PARSRPEWMI LTVLPVPPVT ARPSITLDNG QRSEDDLTHK LVDIIRINQR FMENREAGAP QLIIEDLWEL LQYHVTTFMD NEISGTPPAR HRSGRPLKTL SQRLKGKEGR FRGSLSGKRV NFSARTVISP DPTLSLNEVG VPDRVAKEMT QTMNVNERNL DNARRYVANG PEAHPGANYV KRPDGRRLKV TEKNCEELAE KVEPGWEVSR HMIDGDIIIF NRQPSLHRMS IMAHEVVVMP YKTFRLNTTV CPPYNADFDG DEMNMHALQN EEARAEARVL MRVQEQILSP RFGENIIGAI QDHISGTYLL THTNPEFNET QALDLLRATR IDELPPESGV DDDGTPYWTG RDIFSELLPD DLDLTFTSEA GDDVIIEDGQ LVEGTIDDSA VGGFGGEIVD TITKEYSNTR ARIFINEIAA LAMRSIMHFG FSIGIDDESV PEAAEAQITE AIDNAYDRVE ELIETYDRGE LESLPGRTVD ETLEMKIMQT LGKARDSAGD IAGDHFDDDN PAVVMAESGA RGSMLNLTQM AGCVGQQAVR GERINRGYED RTLSHFKQND LSSDAHGFVE ASYRSGLNPK EFFFHAMGGR EGLVDTAVRT SKSGYLQRRL INALSELEAQ YDGTVRDTSD NIVQFEFGED GTSPVQVSSN EETPVDVEEI ADSVLDAEFE SESIKSEFLG ERTEPTNLSE HTDDWWMAEG DD
|
| |