Gene Hoch_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4016 
Symbol 
ID8546412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5514080 
End bp5518291 
Gene Length4212 bp 
Protein Length1403 aa 
Translation table11 
GC content66% 
IMG OID646388688 
ProductDNA-directed RNA polymerase, beta subunit 
Protein accessionYP_003268408 
Protein GI262197199 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.303023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCGG TAATCCAGAA CAACTTCCGG GTCCGGAAGA GCTTTGCCAA GCTCAAGAAG 
GTCATCGATA TCCCGAACCT GATCGATATC CAGAAGCGGT CCTACGATAA GTTCCTCCAG
ATCGATATCC CTGCCGAGGA GCGGGAGGAT GTCGGCCTCC AGGGGGTGTT CAAGAGCGTA
TTCCCCATCA AGGACTTCTC CGAGACGTCT TCGCTGGAGT TCGTCTCGTA CAACCTCGAG
CGTCCCAAGT ACGACGTCGA CGAGTGTCGG GCCCGCGGGA TGACCTTCGC CGCGCCGGTG
AAGGTCGTGA TCCGCTTGGT GGTGTGGGAC GTCAACGAAG AGACCGGCGT GCAGTCGATC
CGCGACGTCA AAGAGCAGGA GGTCTACTTC GGCGAGATCC CGCTCATGAC CGACAGCGGT
ACCTTCATCA TCAACGGTAC CGAGCGCGTC ATCGTCTCGC AGCTGCACCG CTCGCCCGGT
GTGTTCTTCG ACCACGATAA GGGCAAGACC CACTCGTCGG GCAAGCTGCT GTACAGCGCC
CGGGTCATCC CGTATCGCGG CTCGTGGCTC GACTTTGAGT TCGACCACAA GGACATCCTC
TACGTCCGCA TCGATCGCCG GCGCAAGCTG TACGCCACCG TGCTGCTGCG CGCGCTCGGC
TACTCGACCG AGGATCTGCT CAACTACTTC TACGACACCG AGGTGATCCA CATCGAGGGG
CCGCAGAAGT TCTCGCGGAC CATCAACTAC GACCTGCTGC TCGGCCAGCG CGCGACCCGC
GACATCCGTC ACCCGGACAG CCGCGAGATC CTGGTGCGCA AGAACCGCAA GTTCACGCGC
GCCGCGATCC GCAAGCTGCG CGACTCGGAC ATCGAGAAGC TCACGATCGA CCTCGAGGAG
CTCGTCGGCA AGGTGTCGGC GCGCGACATC ATCGATGAGA GCACCGGCGA GGTGCTGCTG
CAGTGCAACG AGGAGCTGAG CGAGGAGAAG CTCGAGGAGC TGCGCACGCG CGGCGTCGAG
CGCTTCGATG TGCTGTTCAT CGACAACCTC AACGTCGGCC CGTACCTGCG CACGACCCTG
CTCGCCGACA AGCTGCAGGG CCCGGAAGAG GCGATCATGG AGATCTACCG GCGCCTCCGC
CCGGGTGATC CGCCGACCAT CGACACCGCG CAGAACCTGT TCCAGAACCT GTTCTTCAAC
CCCGAGCGCT ACGACCTGTC GCAGGTCGGC CGGCTCAAGC TCAACTACAA GTTCCGCCTC
GACGAGTCGC TCGACAACCC GGTGCTCACC CGGCGCGACA TCCTGGAGAC GGTGCGCTAC
CTCATCGAGC TGCGCAACGG CCGCGGCATC ATCGACGATA TCGACCATCT CGGTAACCGT
CGCGTGCGCG CCGTCGGCGA GCTGATGGAG AACCAGTACC GCATCGGTCT GGTGCGCATG
GAGCGCGCGA TCAAGGAGCG CATGAGCATG TCTCAGGAGA TCGAGACGCT CATGCCGCAC
GACCTGATCA ACGCCAAGCC GGTGTCGGCC GTGGTCAAGG AGTACTTCGG CAGCTCGCAG
CTGTCGCAGT TCATGGACCA GACCAACCCG CTGTCCGAGG TGACGCACAA GCGCCGCCTG
TCGGCGCTCG GCCCCGGCGG TCTCACGCGC GAGCGCGCCG GCTTCGAGGT GCGCGACGTG
CACCCGACGC ACTACGGCCG CATCTGCCCG ATCGAGACGC CGGAAGGTCC CAACATCGGC
CTCATCGCCT CGCTCTCGAC CTATGCGCGG GTCAACCAGT ACGGCTTCAT CGAGACGCCG
TATCGCCGCG TGAACGACGG CAAGGTGACC GAGGAGGTGC AGTTCTACTC GGCCCTCCAG
GAGGAGGGGC AGGTCATCGC CCAGGCCAAC GCGGCCCACG ACCCGAGCGG CGCCTTCAGC
GAGGACTTCG TGTCCTGCCG CCGCGCCGGC GACGTGTCGA TGGTGCGTCC CGAGGACGTC
ACCCTGATGG ACGTCTCGCC CAACCAGCTC GTGTCCGTGG CCGCGTCGCT GATTCCCTTC
CTCGAGCACG ACGACGCCAA CCGCGCGCTC ATGGGATCGA ACATGCAGCG GCAGGCGGTG
CCCCTGGTGC GCACGGCCGC GCCGCTGGTC GGCACCGGCA TCGAGAACAT CGTCGCCCGC
GACTCGGGCG TCACCGTGGT CGCCAAGCGC GACGGCGTGG TCGAGTCGGT GGACGGCGCG
CGCATCGTGA TCAAGCCCTT CGAGACCGAC GGGGAAGATT CGCTCGGCGC CAAGCCCGAC
ATCTACAACC TGGTCAAGTT CCAGCGCAGC AACCAGAACA CCTGCAGCAA CCAGAAGCCC
ATCGTGCGCC GCGGCGATAC GGTGCGCGTC GGCGACGTCA TCGCCGACGG TCCGGCGACC
GAGTGCGGTG AGCTGGCGCT GGGCCAGAAC ACGGTGGTCG CGTTCATGCC GTGGGGTGGC
TACAACTTCG AGGACTCGAT CCTCGTCAAC GAGCGCCTGG TCAAGAACGA CACCTTCACC
TCGGTGCACA TCGAGGAGTT CGAGTGCGTC GCGCGCGACA CCAAGCTCGG CAAGGAAGAG
ATCACGCGCG ACATCCCCAA CGTCGGCGAG GAGGCGCTCA AGGACCTCGA CGACTCGGGC
ATCGTGCGCA TCGGCGCCGA GGTCAAGGCC GGCGACATCC TGGTCGGCAA GATCACGCCC
AAGGGCGAGA CCCAGCTCTC GCCCGAGGAG AAGCTGCTCC GCGCGATCTT CGGCGAGAAG
GCCGGCGACG TCCGCGACAC CTCGCTGCGC GTGCCCCCGG GCGTGAGCGG TGTGGTCATC
AACGCCCGCG TGTTCGCGCG CAAGGGCACC GAGAAGGACG ACCGCGCCAA GGACATCGAG
GACGCCGAGA AGGAGAAGCT GCTGCTCAAC AAGCAGACCG AGATCAAGAT CATCTCGGAC
TCCTACTACG GCAAGATGCG CAAGCTGCTG GTCGGCAAGA CCACGGCCGC GCGTCTGGTC
GACGACAAGG GCAAGGTGCT GCTGCCCAAG GGCCAGAAGA TCGACGCCGC CGCGCTCGAC
CAGGTGCCCG CGCGCTACTG GCACGAGGTC CAGGCCGAGG GTGACACCAA GGTCGAGGAG
TCGCTCGAGA AGCTGGCCGC GCAGCGCGAA GAGGACGTGC GCCTCATCGA GGAGCAGTAC
GACGAGAAGA TCGGCAAGCT GACCAAGGGC GACGAGCTGC CTCCGGGCGT GATCAAGCTG
GTCAAGGTCT ACCTGGCCAT CAAGCGCAAG CTCTCGGTGG GTGACAAGAT GGCCGGTCGC
CACGGCAACA AGGGTGTGGT CTCGCGCCTG TTGCCCGAGG AGGACATGCC GTACCTGTCC
GACGGGACGC CCGTGGACAT CGTGCTCAAC CCGCTCGGTG TGCCCTCGCG TATGAACGTC
GGCCAGATCC TCGAGACCCA CCTGGGCTGG GCGGCGCGCG AGATCGGCCG CCAGATCGAC
ATGTACATGG AGACCTCCTG GTCGGCCGAT GTGCTGCGCG AGAAGCTCAA GAAGGTCTTC
AACACCGCCC AGGCGCACGA GTTTCTCGAC CGCCTGGACA ACGAAGATAT CGGACGCTTC
GCCACCAAGC TGCGCAAGGG CATCCACTTC GCGACGCCGG TCTTCGACGG CGCCGCCGAG
GACGAGATCA AGGCGGCCCT CAACATGGCC GGCATGCGCC CGAGCGGCCA GTCGCAGCTG
TGCGACGGCA AATCCGGCGA GCCCTTCGAC AACCCGGTGA CCGTGGGCGT GATGTACATG
CTCAAGCTGC ACCACCTGGT GGACGACAAG ATCCACGCGC GCAGCATCGG TCCGTACTCG
CTGGTTACGC AGCAGCCGCT GGGCGGCAAG GCCCAGTTCG GCGGTCAGCG TCTCGGCGAG
ATGGAAGTCT GGGCCATGGA GGCCTACGGC GCGGCCTACG CGCTGCAGGA GTTCCTCACC
GTCAAGAGCG ACGACGTGCT CGGCCGTACC CGCATGTACG AGTCGATCGT CAAGGGCGAG
CACGTGCTCG AGGCCGGCTT GCCGGAGTCG TTCAACGTGC TGCTCAAAGA GCTTCAGTCG
CTGTGTCTCG ACGTCGAGCT CATCGAGGAT CCCTCGGCTC CGCGCAAGCA GGAGCACGCG
GGCCCCGGTG TGCCGGCCGG TCTCGCCGCG CTGGCGCGCG AGGTCGCTGA GAAGGTGGGC
GGCGCCCAGT AG
 
Protein sequence
MASVIQNNFR VRKSFAKLKK VIDIPNLIDI QKRSYDKFLQ IDIPAEERED VGLQGVFKSV 
FPIKDFSETS SLEFVSYNLE RPKYDVDECR ARGMTFAAPV KVVIRLVVWD VNEETGVQSI
RDVKEQEVYF GEIPLMTDSG TFIINGTERV IVSQLHRSPG VFFDHDKGKT HSSGKLLYSA
RVIPYRGSWL DFEFDHKDIL YVRIDRRRKL YATVLLRALG YSTEDLLNYF YDTEVIHIEG
PQKFSRTINY DLLLGQRATR DIRHPDSREI LVRKNRKFTR AAIRKLRDSD IEKLTIDLEE
LVGKVSARDI IDESTGEVLL QCNEELSEEK LEELRTRGVE RFDVLFIDNL NVGPYLRTTL
LADKLQGPEE AIMEIYRRLR PGDPPTIDTA QNLFQNLFFN PERYDLSQVG RLKLNYKFRL
DESLDNPVLT RRDILETVRY LIELRNGRGI IDDIDHLGNR RVRAVGELME NQYRIGLVRM
ERAIKERMSM SQEIETLMPH DLINAKPVSA VVKEYFGSSQ LSQFMDQTNP LSEVTHKRRL
SALGPGGLTR ERAGFEVRDV HPTHYGRICP IETPEGPNIG LIASLSTYAR VNQYGFIETP
YRRVNDGKVT EEVQFYSALQ EEGQVIAQAN AAHDPSGAFS EDFVSCRRAG DVSMVRPEDV
TLMDVSPNQL VSVAASLIPF LEHDDANRAL MGSNMQRQAV PLVRTAAPLV GTGIENIVAR
DSGVTVVAKR DGVVESVDGA RIVIKPFETD GEDSLGAKPD IYNLVKFQRS NQNTCSNQKP
IVRRGDTVRV GDVIADGPAT ECGELALGQN TVVAFMPWGG YNFEDSILVN ERLVKNDTFT
SVHIEEFECV ARDTKLGKEE ITRDIPNVGE EALKDLDDSG IVRIGAEVKA GDILVGKITP
KGETQLSPEE KLLRAIFGEK AGDVRDTSLR VPPGVSGVVI NARVFARKGT EKDDRAKDIE
DAEKEKLLLN KQTEIKIISD SYYGKMRKLL VGKTTAARLV DDKGKVLLPK GQKIDAAALD
QVPARYWHEV QAEGDTKVEE SLEKLAAQRE EDVRLIEEQY DEKIGKLTKG DELPPGVIKL
VKVYLAIKRK LSVGDKMAGR HGNKGVVSRL LPEEDMPYLS DGTPVDIVLN PLGVPSRMNV
GQILETHLGW AAREIGRQID MYMETSWSAD VLREKLKKVF NTAQAHEFLD RLDNEDIGRF
ATKLRKGIHF ATPVFDGAAE DEIKAALNMA GMRPSGQSQL CDGKSGEPFD NPVTVGVMYM
LKLHHLVDDK IHARSIGPYS LVTQQPLGGK AQFGGQRLGE MEVWAMEAYG AAYALQEFLT
VKSDDVLGRT RMYESIVKGE HVLEAGLPES FNVLLKELQS LCLDVELIED PSAPRKQEHA
GPGVPAGLAA LAREVAEKVG GAQ