Gene Moth_2468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2468 
SymbolrpoB 
ID3831202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2575143 
End bp2578568 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content56% 
IMG OID637830387 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_431293 
Protein GI83591284 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR01635] phage virion morphogenesis (putative tail completion) protein
[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00716537 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGCTTATC CAGTCAAGGT TGGGGCCAGG GAACGGTGGA CCTTTGCCAA GATCAAAGAG 
ACCCTGGAGT TACCAAACCT TATTGAAATC CAGCGTAACT CCTATCAGTG GTTTATTGAT
ACTGGACTGC GGGAGCTCTT TCACGATATC TCGCCTATTC AGGACTTCAC CGGCAACCTG
ATTCTGCAAT TTGTAGATTA TTCCCTGGGC GAGCCCAAGT ATTCAGAGGA CGAATGCAAA
GAGCGGGATA TGACCTACGC TGCCCCCCTG CGGGTCAAGG TACGCTTGAT TAACAGAGAA
ACCGGCGAGG TTAAGGAACA GGAGGTCTTC ATGGGGGATT TCCCCCTGAT GACCGATAAG
GGCACCTTTA TCATCAATGG CGCCGAGAGG GTTATTGTCA GCCAACTGGT CCGTTCGCCG
GGTGCCTATT ATGCTGAAGG TATTGATGCC AGCGGAACCA AGGTTTACGG AGCCACGGTG
ATCCCCAACC GGGGCGCCTG GCTGGAGTTT GAAACCGATA ATACGGAGAG CATTTATGTC
CGCATTGATC GTACCCGTAA AATACCGGTA ACAATCCTCC TCCGTGCCCT GGGCTACAAC
ACCAATGCCC GAATTCTTGA GCTTTTTGAT TACGACGGCC GTATCCAGAA CACCCTGGAA
AAGGATAACA CGGAATCTGA AGAAGAGGCC CTGGTTGAAA TCTATAAACG CCTGCGGCCA
GGTGAACCGC CGACGGTGGA CAGCGCCAGA AACTTGCTGG AATCCCTCTT CTTTGACCCC
AGACGTTATG ATCTGGGCAA TGTCGGCCGC TATAAGCTCC ACAAGAAATT TAATCACGGC
ATCCTGACCC GGGAGGTTGA CGGTCGGGAA GAGTACATCC GCACCTTAAC CAAAGAGGAC
ATTATCTATA CCATCAAATA TCTCCTTCGC CTCATGGATG GCGAGGTAAA ACCCGATGAT
ATCGACCACC TGGGTAACCG GCGCCTGCGT TCGGTCGGAG AACTCTTACA GAACCAGTTT
CGTATCGGTC TGGCGCGGAT GGAACGAGTG GTACGCGAAC GGATGACCAT CCAGGATGTT
GACGTTATCA CACCCCAGGT TTTAATCAAC ATTCGCCCGG TAGTGGCGGC CATTAAAGAG
TTTTTTGGGT CCAGCCAGCT CTCCCAATTC ATGGATCAGA CTAACCCCCT GGCGGAATTG
ACCCACAAAC GACGGCTGTC GGCCCTGGGA CCGGGGGGTC TTTCCCGGGA ACGGGCCGGT
TTCGAGGTCC GGGACGTCCA TACCTCCCAC TATGGGCGCA TGTGTCCCAT TGAGACCCCT
GAAGGGCCAA ATATCGGGCT AATCGGCTCT CTTAGTACCT ATGCCCGGAT CAACGAATTC
GGTTTTATTG AGACCCCTTA CCGGCGGGTA GACAAGGAAA AGGGGATTGT TACCAATCAG
GTTGATTACC TGACTGCCGA TGAAGAGGAC AAGTATTTCA TAGCTCAGGC CAATGCCCCC
CTGGATGAAG AGGGCCACTT CCTGGAGAAA AGGGTCAATG CCCGCCATGG CGGTGAAATC
CTGGTGGTCC CGGCCAGCCA GGTGGATTAC ATGGACGTCT CGCCCAAGCA GATGGTCAGT
GTAGCTACGG CCCTGATCCC CTTCCTGGAA CATGATGATG CCAACCGGGC CCTGATGGGG
GCTAACATGC AGCGCCAGGC CGTCCCCCTC CTGAGAACAG AAGCACCTAT AGTGGGGACG
GGGATGGAAT GGCGGGCAGC ACGCGATTCC GGCACTGTAG TTTTAGCCCG GAATAACGGC
GTGGTAGAAA GGGTCACTGC CCGCGAGATT GTCATCCGTA CCGACAACGG CCATTTGGAA
ACCTACCGCC TGCAGAAGTT TGTTCGTTCC AACCAGGGCA CCTGCATCAA CCAGAAACCA
ATTGTCCGCA AGGGTGAGCG GGTCAGTGCC GGCCAGACCA TTGCCGACGG TCCTTCCACT
GACCAGGGCG AACTGGCCCT GGGCCGCAAC ATCCTGGTTG CCTTCATGAC GTGGGAGGGC
TATAACTATG AAGACGCCAT CCTGATCAGC GAGAAACTGG TCAAAGACGA TATCTTCACT
TCCATTCATA TTGAAGAGTA CGAATGCGAC GCCCGTGACA CCAAGCTGGG ACCGGAAGAA
ATCACCCGGG ACATTCCCAA TGTGAGCGAG GATGTTTTAA AGGACCTGGA TGAGCGGGGC
ATTATCCGCA TTGGCGCCGA AGTGCGGCCG GGAGACATCC TGGTCGGCAA GGTTACCCCC
AAGGGCGAAA CGGAGCTGAC GGCCGAGGAA CGCCTCTTGC GGGCGATCTT CGGCGAGAAG
GCCAGGGAGG TAAGGGATAC TTCCCTGCGG GTCCCCCACG GAGAATCCGG TATTGTCGTT
GACGTCAAGG TATTCTCCCG GGAAAATGGG GACGAACTCG CACCAGGAGT CAATGAACTG
GTGCGGGTCT ACATCGCCCA GAAACGCAAG ATATCCGTGG GCGACAAGAT GGCCGGCCGT
CACGGCAACA AGGGAGTTAT TTCGCGCATC CTGCCGGAAG AGGATATGCC CTTCCTCCCC
GATGGTACGC CCATCGAGAT TGTCCTGAAC CCCCTGGGTG TGCCCTCACG TATGAACCTG
GGCCAGATCC TGGAGACCCA CCTGGGCCGG GCAGCCCGGG CTCTGGGGAT TACCGTGGCC
ACCCCCGTTT TCGACGGGGC GACGGAAGAG GAGATCAAAG AAGCGTTCCG TAAGGCCGGC
CTGCCCGAAG ACGGCAAAAC CATCCTCTAT GACGGTCGCA CCGGGGAACC ATTTGACCGG
CCCATTACAG TGGGATATAT GTATATGCTC AAGCTCGCCC ACCTGGTTGA TGATAAGATT
CACGCCCGGT CCACCGGTCC TTACTCCCTG GTCACCCAGC AACCCCTGGG CGGCAAGGCC
CAGTTCGGTG GCCAGCGCTT CGGCGAAATG GAAGTCTGGG CTCTGGAGGC CTATGGCGCT
GCCTATACCC TCCAGGAGAT TTTAACTGTG AAGTCCGATG ACGTCGTCGG CCGGGTCAAG
ACCTACGAAG CCATTGTCAA GGGCGAAAAC GTACCCGAGC CGGGAGTCCC GGAATCCTTC
AAGGTCCTGA TCAAGGAACT CCAGAGCCTG GGCCTGGACG TCAAGGTCCT CTCCGAGGAA
AACGAAGAAA TCGAGATTAA AGAGGACGAT GACGATGTGT CCGAATCCGC CCAGGAACTG
GGACTGGATG TCCATGGCCA GCCGAATACG GCGCCTGAGT CCGAAGGCGG GAATGAGGAA
GATAAAGAGG CGGAGGCGGA CGAAGATTTT GATCCGGCCG ACCTGGATCC CTCGGAACTG
GAACTCGATG CCGACTTGAA TGATGACCTG GTAGTACCTG ACGAGGCCTA TGGCGACGAA
GAATAA
 
Protein sequence
MAYPVKVGAR ERWTFAKIKE TLELPNLIEI QRNSYQWFID TGLRELFHDI SPIQDFTGNL 
ILQFVDYSLG EPKYSEDECK ERDMTYAAPL RVKVRLINRE TGEVKEQEVF MGDFPLMTDK
GTFIINGAER VIVSQLVRSP GAYYAEGIDA SGTKVYGATV IPNRGAWLEF ETDNTESIYV
RIDRTRKIPV TILLRALGYN TNARILELFD YDGRIQNTLE KDNTESEEEA LVEIYKRLRP
GEPPTVDSAR NLLESLFFDP RRYDLGNVGR YKLHKKFNHG ILTREVDGRE EYIRTLTKED
IIYTIKYLLR LMDGEVKPDD IDHLGNRRLR SVGELLQNQF RIGLARMERV VRERMTIQDV
DVITPQVLIN IRPVVAAIKE FFGSSQLSQF MDQTNPLAEL THKRRLSALG PGGLSRERAG
FEVRDVHTSH YGRMCPIETP EGPNIGLIGS LSTYARINEF GFIETPYRRV DKEKGIVTNQ
VDYLTADEED KYFIAQANAP LDEEGHFLEK RVNARHGGEI LVVPASQVDY MDVSPKQMVS
VATALIPFLE HDDANRALMG ANMQRQAVPL LRTEAPIVGT GMEWRAARDS GTVVLARNNG
VVERVTAREI VIRTDNGHLE TYRLQKFVRS NQGTCINQKP IVRKGERVSA GQTIADGPST
DQGELALGRN ILVAFMTWEG YNYEDAILIS EKLVKDDIFT SIHIEEYECD ARDTKLGPEE
ITRDIPNVSE DVLKDLDERG IIRIGAEVRP GDILVGKVTP KGETELTAEE RLLRAIFGEK
AREVRDTSLR VPHGESGIVV DVKVFSRENG DELAPGVNEL VRVYIAQKRK ISVGDKMAGR
HGNKGVISRI LPEEDMPFLP DGTPIEIVLN PLGVPSRMNL GQILETHLGR AARALGITVA
TPVFDGATEE EIKEAFRKAG LPEDGKTILY DGRTGEPFDR PITVGYMYML KLAHLVDDKI
HARSTGPYSL VTQQPLGGKA QFGGQRFGEM EVWALEAYGA AYTLQEILTV KSDDVVGRVK
TYEAIVKGEN VPEPGVPESF KVLIKELQSL GLDVKVLSEE NEEIEIKEDD DDVSESAQEL
GLDVHGQPNT APESEGGNEE DKEAEADEDF DPADLDPSEL ELDADLNDDL VVPDEAYGDE
E