Gene Moth_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0134 
Symbol 
ID3830791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp128807 
End bp130141 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content62% 
IMG OID637828068 
Productprimary replicative DNA helicase 
Protein accessionYP_429016 
Protein GI83589007 
COG category[L] Replication, recombination and repair 
COG ID[COG0305] Replicative DNA helicase 
TIGRFAM ID[TIGR00665] replicative DNA helicase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.132489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.322197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAG AAATAGAGAG GGTACCGCCC CAGAGTATAG AGGCCGAGCA ATCGGTTCTG 
GGGGCTATCA TGCTGGACCG GGAAGCCCTC TACGCTGTCC TGGAAACCCT GAAGGTAGAC
GATTTTTACC GGGAAGCCCA CCGCATGATC TATCGGGCCA TCCTGGACCT GAATGAGCGG
GGCGAGGCCG TCGACCTGCT GACGGTGACG GAAGAACTCC GTCGCCGGGG TGAACTGGAG
GCAGCCGGCG GTGTCGCCTA CCTCACTTCC CTGACCGGGG ATGTCCCCAG CGTCGCCAAT
GCCGGATATT ATGCCCGCCT GGTGGCTGAG AAGGCTGCCC TGCGCTCCCT CGTCCAGGCA
GCTTCCCAGA TCACCGAGAT GGCCTTCAGC GAGAGCGGCA GCGTCGACCA GATTCTCGAC
GAGGCCGAAC GTCTGATCTT TGAAGTAGCC GGGGGGCGGC ACCGGAGCGG TTTCGTTCCC
ATTAAAAACG TCCTTCTCCA GACCTTCGAA CAGCTGGAGC GCCTGAGCAC CCACAAGGGC
GAGGTCACCG GAGTGCCAAC CTTTCACGAT CTGGACCGTC TCCTTTCCGG TCTCCAGCCC
TCCGACCTGA TTATCTGCGC CGCCCGGCCG GGGATGGGCA AGACCTCCTT TTGCCTGAAC
ATTGCCCAGC AGGTGGCTGT CAAGGAAAAA CTACCGGTAG CCATTTTCAG CCTGGAGATG
TCCCGGGAGC AGCTGGTACA GCGGATGCTG GCCGCCGAAG CCATGGTCGA ACAGCAACGC
CTGCGGACTG GCTATTTGAC GGAAGACGAC TGGGCCCGGC TTGTCAACGC CGCCGGCATT
CTGGGTGAAG CGCCCATTTA TATTGACGAT ACGCCGGCCA TTTCCGCCCT GGAGGTTCGG
GCCAAGGCGC GACGACTGCA GTCGGAGACC GGTCTGGGCC TGGTGGTAGT CGACTACCTG
CAGCTGATGC AGGCCCATCG CCGGGTGGAC AGTCGCCAGC AGGAGATCGC CCTCATCTCC
CGGGCCATGA AGGCCCTGGC CCGGGAATTG AACGTCCCGG TCATGGTCCT CTCCCAGTTG
AACCGGGGTG TCGAGCAGCG CCAGGATAAA CGCCCGGTCA TGGCCGACCT CCTGGAAAGC
GGCGCCATCG AGGCCGACGC CGATGTCATT ATCTTCCTTT ACCGGCCCCA ATACTACGAT
CCCGACACCG ATAAAAAGGG CATCGCCGAA GTCATCGTGG CCAAGCACCG CAACGGTCCC
GTGGGAACGG TGGAAATGGC CTTTCTACCC GAGTATACCA AGTTTGTCGA CCTGGCCCCC
GAACCGGCCG GGTAA
 
Protein sequence
MAAEIERVPP QSIEAEQSVL GAIMLDREAL YAVLETLKVD DFYREAHRMI YRAILDLNER 
GEAVDLLTVT EELRRRGELE AAGGVAYLTS LTGDVPSVAN AGYYARLVAE KAALRSLVQA
ASQITEMAFS ESGSVDQILD EAERLIFEVA GGRHRSGFVP IKNVLLQTFE QLERLSTHKG
EVTGVPTFHD LDRLLSGLQP SDLIICAARP GMGKTSFCLN IAQQVAVKEK LPVAIFSLEM
SREQLVQRML AAEAMVEQQR LRTGYLTEDD WARLVNAAGI LGEAPIYIDD TPAISALEVR
AKARRLQSET GLGLVVVDYL QLMQAHRRVD SRQQEIALIS RAMKALAREL NVPVMVLSQL
NRGVEQRQDK RPVMADLLES GAIEADADVI IFLYRPQYYD PDTDKKGIAE VIVAKHRNGP
VGTVEMAFLP EYTKFVDLAP EPAG