Gene Hmuk_0513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0513 
Symbol 
ID8410013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp483070 
End bp485988 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content65% 
IMG OID645018837 
ProductDNA-directed RNA polymerase subunit A' 
Protein accessionYP_003176354 
Protein GI257386581 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02390] DNA-directed RNA polymerase subunit A' 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAC AACAGACACC CAAATCGATC AGTTCCATTA GCTTCGGGCT GATGGATCCC 
GAGGAGTACC GCGACATGAG CGCCACGAAG GTCATCACGG CCGACACCTA CGACGACGAC
GGGTTCCCGA TCGACATGGG ACTGATGGAC CCGCGTCTGG GCGTGATCGA CCCCGGCCTG
GAGTGTAAGA CCTGTGGCAA GCACAGTGGG TCGTGTAACG GCCACTTCGG CCACATCGAA
CTCGCAGCGC CCGTCATCCA CGTCGGGTTC AACAAGCTCA TCCGACGCCT GCTGCGCGGG
ACCTGTCGGG AGTGCTCGCG GCTCTGTCTC GACGAACGCG AACGCGAGGA GTTCGAGGAC
CGCCTCGAAC GATCCGAGGA GCTGGGCAAG GACGTCAACG ACGTGACCAA GGCCGCGATC
CGGCAGGCCC GCAAGAAGGA CCGCTGTCCG TTCTGTGGCG AGCCCCAGTA CGACATCAAA
CACGAGAAGC CGACGACCTA CTACGAGGTC CAGGACGTGC TCTCCTCGGA GTACTCCCAG
CGGATCGTCG CCGCGATGGA GCCCGACGAG GAGGCCGGCC GCGAACGGAC CACGCCCTCG
GAACTGGCCG ACGAGACGGG TATCGATCCC TCTCGCGTCC AGGAGATCAT CTCCGGAGAG
TTCCGACCCC GACAGGACGA CCGTCGGGCC ATCGAGTCGG CGCTGGACGT GGACCTCACC
GAGGAAGACG AGAACAAGCT GATGCCCAGC GACATCCGGG ACTGGTTCGA GGACATCCCG
GACGAGGACG TCGAGGTGCT GGGCATGAAG CCCGCCCGTT CTCGCCCCGA GTGGATGATT
CTCACCGTCC TGCCCGTGCC GCCGGTCACC GCGCGACCGT CGATCACGCT GGACAACGGC
CAGCGCTCGG AGGACGACCT GACCCACAAG CTGGTCGACA TCATCCGGAT CAACCAGCGG
TTCATGGAGA ACCGCGAGGC AGGCGCGCCC CAGCTGATCA TCGAGGACCT CTGGGAACTG
CTGCAGTACC ACGTCACCAC GTTCATGGAC AACGAGATCA GCGGCACGCC GCCGGCCCGA
CACCGCTCTG GCCGGCCGCT GAAGACCCTC TCTCAGCGCC TGAAAGGCAA GGAAGGTCGC
TTCCGTGGCT CGCTGTCCGG GAAGCGCGTG AACTTCTCGG CCCGGACGGT CATCTCCCCG
GACCCGACCC TGTCGCTCAA CGAGGTCGGC GTCCCCGACC GCGTGGCCAA GGAGATGACC
CAGACGATGA ACGTCAACGA GCGCAACCTC GACAACGCTC GTCGCTACGT CGCCAACGGG
CCCGAGGCCC ACCCCGGTGC CAACTACGTC AAGCGGCCCG ACGGTCGCCG ACTGAAGGTC
ACCGAGAAGA ACTGCGAGGA GTTGGCCGAG AAGGTCGAAC CGGGCTGGGA AGTGTCCCGT
CACATGATCG ACGGCGACAT CATCATCTTC AACCGCCAGC CGTCGCTGCA CCGGATGTCC
ATCATGGCCC ACGAGGTCGT GGTGATGCCC TACAAGACGT TCCGGCTCAA CACGACGGTC
TGTCCGCCGT ACAACGCGGA CTTCGACGGC GACGAGATGA ACATGCACGC CCTCCAGAAC
GAGGAGGCCC GTGCCGAGGC TCGCGTGCTG ATGCGCGTCC AGGAGCAGAT CCTCAGCCCG
CGCTTCGGTG AGAACATCAT CGGCGCGATT CAGGACCACA TCAGTGGGAC CTATCTGCTG
ACTCACACTA ATCCGGAGTT CAACGAGACC CAGGCACTGG ACCTGCTGCG TGCGACCCGG
ATCGACGAGC TCCCGCCCGA ATCCGGCGTC GACGACGACG GCACGCCCTA CTGGACGGGT
CGCGACATCT TCTCGGAGCT GTTGCCCGAC GACCTCGATC TGACGTTCAC CAGCGAGGCC
GGCGACGACG TGATCATCGA GGACGGGCAG CTCGTCGAGG GGACCATCGA CGACAGCGCG
GTCGGTGGGT TCGGCGGCGA GATCGTCGAC ACTATCACCA AGGAGTACTC CAACACCCGA
GCGCGCATCT TCATCAACGA GATCGCCGCG CTGGCGATGC GTTCGATCAT GCACTTCGGG
TTCTCGATCG GGATCGACGA CGAGTCCGTT CCCGAGGCCG CGGAGGCCCA GATCACCGAG
GCCATCGACA ACGCCTACGA CCGCGTCGAG GAACTCATCG AGACCTACGA TCGGGGCGAA
CTGGAGTCGC TGCCCGGTCG GACGGTCGAC GAGACCCTCG AAATGAAGAT CATGCAGACG
CTCGGCAAGG CCCGTGACTC CGCCGGTGAC ATCGCCGGAG ACCACTTCGA CGACGACAAC
CCGGCTGTCG TGATGGCCGA GTCCGGCGCG CGTGGGTCGA TGCTCAACCT GACCCAGATG
GCCGGCTGTG TCGGCCAGCA GGCGGTCCGT GGCGAGCGGA TCAACCGCGG CTACGAGGAC
CGCACCCTGT CACACTTCAA GCAGAACGAC CTCTCCTCGG ACGCCCACGG CTTCGTGGAG
GCGTCGTACC GCTCCGGACT CAACCCCAAG GAGTTCTTCT TCCACGCGAT GGGTGGCCGC
GAGGGGCTGG TCGACACGGC GGTCCGGACC TCCAAGTCCG GTTACCTCCA GCGCCGTCTG
ATCAACGCGC TCTCGGAACT GGAGGCCCAG TACGACGGCA CCGTCCGGGA CACCTCGGAC
AACATCGTCC AGTTCGAGTT CGGCGAAGAC GGCACCTCGC CGGTACAGGT CTCCTCGAAC
GAGGAGACGC CGGTCGACGT CGAGGAGATC GCGGACAGCG TTCTCGACGC CGAGTTCGAG
TCCGAGAGCA TCAAAAGCGA GTTCCTCGGT GAACGCACCG AGCCGACCAA TCTGTCGGAA
CACACCGACG ACTGGTGGAT GGCGGAGGGT GACGACTGA
 
Protein sequence
MSAQQTPKSI SSISFGLMDP EEYRDMSATK VITADTYDDD GFPIDMGLMD PRLGVIDPGL 
ECKTCGKHSG SCNGHFGHIE LAAPVIHVGF NKLIRRLLRG TCRECSRLCL DEREREEFED
RLERSEELGK DVNDVTKAAI RQARKKDRCP FCGEPQYDIK HEKPTTYYEV QDVLSSEYSQ
RIVAAMEPDE EAGRERTTPS ELADETGIDP SRVQEIISGE FRPRQDDRRA IESALDVDLT
EEDENKLMPS DIRDWFEDIP DEDVEVLGMK PARSRPEWMI LTVLPVPPVT ARPSITLDNG
QRSEDDLTHK LVDIIRINQR FMENREAGAP QLIIEDLWEL LQYHVTTFMD NEISGTPPAR
HRSGRPLKTL SQRLKGKEGR FRGSLSGKRV NFSARTVISP DPTLSLNEVG VPDRVAKEMT
QTMNVNERNL DNARRYVANG PEAHPGANYV KRPDGRRLKV TEKNCEELAE KVEPGWEVSR
HMIDGDIIIF NRQPSLHRMS IMAHEVVVMP YKTFRLNTTV CPPYNADFDG DEMNMHALQN
EEARAEARVL MRVQEQILSP RFGENIIGAI QDHISGTYLL THTNPEFNET QALDLLRATR
IDELPPESGV DDDGTPYWTG RDIFSELLPD DLDLTFTSEA GDDVIIEDGQ LVEGTIDDSA
VGGFGGEIVD TITKEYSNTR ARIFINEIAA LAMRSIMHFG FSIGIDDESV PEAAEAQITE
AIDNAYDRVE ELIETYDRGE LESLPGRTVD ETLEMKIMQT LGKARDSAGD IAGDHFDDDN
PAVVMAESGA RGSMLNLTQM AGCVGQQAVR GERINRGYED RTLSHFKQND LSSDAHGFVE
ASYRSGLNPK EFFFHAMGGR EGLVDTAVRT SKSGYLQRRL INALSELEAQ YDGTVRDTSD
NIVQFEFGED GTSPVQVSSN EETPVDVEEI ADSVLDAEFE SESIKSEFLG ERTEPTNLSE
HTDDWWMAEG DD