Gene Moth_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0003 
Symbol 
ID3831313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1894 
End bp3018 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content50% 
IMG OID637827930 
ProductDNA polymerase III, beta subunit 
Protein accessionYP_428886 
Protein GI83588877 
COG category[L] Replication, recombination and repair 
COG ID[COG0592] DNA polymerase sliding clamp subunit (PCNA homolog) 
TIGRFAM ID[TIGR00663] DNA polymerase III, beta subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00459981 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000116577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATATCC TTTGTCCTCA ACCCCAACTT GTTAATGCTG TGCAAAAGGT ATACCGGGCG 
GTAGCCACAA CGACAACCTA TCACGCTATT ACCGGGATTC TATTGCAGGC CCATGAAAAT
ACCTTGACCC TCCAGGGTAC CGATCTTGAT CTGGGAATTA TTTATACCTT TCCTGTTGAG
GTTATCGAAG AAGGCGAGCT CTTACTGCCG GCACGTATCT TTACCGAGAT GGTCCGGCGC
CTGCCGCCTA CCTCCCTTTC TTTACAGAGT TTACAGGATA ACACCGTGGA GATCGCTTAC
CAGCAGTCCA AAGTCCAACT TAACAGCATT GACGCCAGCC AGTTTCCGCT CCTGCCGCCG
GTAGAAGGTA ACTTCTCCTT TACAGTGGCC ATTACCGCCC TCAAGGATGC CATCCGTAAG
GTAACAATTG CCGCCGGTAA TGACGACCTG CGCAGCATTT TCAATGGTGT TCTCTGGGAA
TTAGAACCCG GGGAAAACAG GTTTAACCTG GTGGCCACCG ATACCCATCG TCTGGCTGTC
TACCACGGCC AACCAGAAGA TTCCACGAGT AACGAAACGG CTACCGCCCT GGTACCATGC
CGGGCTATGA ATGAACTGGC GCGTTTACTC CCCGGAGAAG ATGGTTTAGT AAAAATAACC
ATCGGTGAAA GTCAGATCTA CGCCCAGCAC GAGGGCTTAA CGTTATACAC CCGATTATTG
AATGGTAAAT TTCCTCATTA CCAGCAAGTT ATCCCAACTG ATCATATAAC TACCATAGAA
ATAGCCACCC GGGATCTCCT GGACACCGTT GAACGGGCTA CCTTACTGGC CCGGGATGAG
AATAAAGCCA GGGCCCATAT TATTATTTTG CAGGTAGGGG AAAAATCTTT AAAAATAACC
AGTGAAGCTG CCGAGATAGG CCACCTGGAA GAGGAGTTAA CGGCAGAAAT AGCAGGACAA
CCCCTGGAAC TAGCTTTGAA CGGGCGCTAC CTGCTGGAAA CCCTGCGGGT AATTGATACC
GAAAACGTAA TTCTGGAACT CCTGGCCCCG TTGAAACCCG TTGTTGTCAG GCCGGCCGGC
CAGGAAAACT ACTTCTGCCT TATCCTACCG GTCAGGATTG GCTAA
 
Protein sequence
MHILCPQPQL VNAVQKVYRA VATTTTYHAI TGILLQAHEN TLTLQGTDLD LGIIYTFPVE 
VIEEGELLLP ARIFTEMVRR LPPTSLSLQS LQDNTVEIAY QQSKVQLNSI DASQFPLLPP
VEGNFSFTVA ITALKDAIRK VTIAAGNDDL RSIFNGVLWE LEPGENRFNL VATDTHRLAV
YHGQPEDSTS NETATALVPC RAMNELARLL PGEDGLVKIT IGESQIYAQH EGLTLYTRLL
NGKFPHYQQV IPTDHITTIE IATRDLLDTV ERATLLARDE NKARAHIIIL QVGEKSLKIT
SEAAEIGHLE EELTAEIAGQ PLELALNGRY LLETLRVIDT ENVILELLAP LKPVVVRPAG
QENYFCLILP VRIG