Gene Moth_0675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0675 
Symbol 
ID3832499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp708160 
End bp711285 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content56% 
IMG OID637828613 
ProductDNA topoisomerase I 
Protein accessionYP_429543 
Protein GI83589534 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000063053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAACTT TTGATTCGAT AACCACTGAA CTGATAGCGG CCCTGGAAGA AGAGATCAAT 
GCAATCAAGG AGTCTGGCGG GGCGGAGCAA ATAAGAGTCC ACGACGGAAG GTACGGAGGC
ACTGCTGCGG GCCGCTTTCT ATATATTTTT TTGCTTGATA CCGAACTCAA TATTCCCTCC
GATACCCCGG CTCAACTCTG TATTGATAAG GATTCACACG AGACCATCAT ATTAAGCGTT
CAGGGGTTTG AGATGACGCT GGGCATACAG GATGACCTGG GCCCCTTTAT CCCTAAGGCG
GTCCTGAGCC TTTCCGCCTA CTACTTATTA GAACAGTTGC AGCGACGCCT GGATGAAATT
CGCACCGGCG TACTTCCTGC CGAACGGGAT ATGTGTATGC GGCTTTTTGC GTTCCAAGCC
AATGTTTCCT TTCCCAATGT TAAACCTGCC ACCAGCCTTG GAGACTTGAA CAGGGAACAA
ATGGAGGCAG TCAACCGGAG CCTGGGACAG CGGGTCACAT TCATCTGGGG CCCGCCGGGA
ACGGGGAAGA CACGGACCAT CGGTGCCCTT GTACGGGAGT TGGTCCAGCG TGGGGACCGT
GTGCTGGTTA CGTCCCACAC GAACGTGGCT GTAGATACGG CCTTGATTCC CGTTATCAAA
GCCCTGGACG ATAACGAAAT CCAGGGTGGT GCCGTGGTAC GGGTAGGCGC ACTGGCCCGG
GAGGACCCGG AATTGCAGCA GGTGACGATG GAGGCCGTGC TTGAGAGGAA AAGCAAGGAT
TTAAGGCAGC AACAGCAAGC GCTGGGGGTC GAGCGCAAGA GGGTGCAGGG AGTGCGGGAC
CAGTTGGCTG CTACCATCCG GGTTATCGAG ACTGTTGAAA AGTCAGAGCA GCTTGTCGCA
ACAGCAAAGA TTGCCCTGGC TGGAGCCGAA CAGAAGGTGC AAGAAGAAGG GAACGCTATC
TCGCAGGCCC GTAAAACCCT GGGTGATTTG CATAGTAAGC TGCAGCAAGC TGAAGCAGCC
GGGTTCATTC GACGGGTCTT TTTCGGTTTT AACCCGGAGG TCATCCGCCG TCAGATCGTC
AATCAAGAAA CTGTAATCAC AAAGCTGGAG GCAGCTTATC AGGTTGCCCA GCAGGAGAGA
AGGGCTGCTA GCGTCGCGTT ATCAGAAGCT GAACACCGGC AGGCCAAAGA CCGGCAGGCT
CTGGAAAGCC TGGGGCCGTT GCCTCCTCTC CCGGAACTTC GGCAGCAGCT GGGTGAGGTC
GAGAAAAACC TCAAGCAATT AGATGAGCAA TTGGCCGCCA TTGAAGCCCG GCTGCGTGAA
ATGGCGGCGA CCGTGATTCG CGACGCCAGA ATCGTTGGAG CGACTTTATC TCGCCTGGTG
CTTTTGGAAG AACTGTACCG GGGCACCTTT GATACAGTAA TCATCGACGA GGCCAGTATG
GTTCCCCTGC CCAATCTATG GTTTGCCGGT AGCCGGGCGC AGAAGCGCGT GGTGGTCACA
GGTGATTTCC GGCAGCTGCC CCCAATTGCA ACCGCCCGGG ACGCTGAAAA GTATCCCCTG
GCTGCCAGAT GGTTACAAAC CGATATTTTT GTTAAGGCCG GTATCGTGGA GGGCCGGGCA
AGGTTGGACG ATCCCCGGTT ATGTGCGCTG AAGGTGCAGT ATCGCATGCA TGAGGCCATT
GGTGAAGTGG CCAATATGCT GGTCTATGAG CATGACGGAA ACCCGCTCGA GCACAGGGCG
GACCCCAAGA AATATGCTCA CGCAACTGCG GCTCTTCCGG AATCCGGGGA GCCCCTGGTC
CTGTGTACCA CATCGGGTGC CAATCCCTGG TGCGGCCGGC TGGACCCCGG GTTCTCCCGC
TATAATATTT ACAGTGCTAT AGTCTGTATC CGGCTTGCCG CCCGGGCCCT GGCCAGCGGC
GCCCAAAATG TTGGATTGGT GGCGCCGTAC CGGGCTCAAA CTCGCTTGTT GCAACACCTG
GTGGAGCAAT ACAGGCTTCC CGGGGAAAGA GTTGAAACGG CAACCGTGCA CAGGTTCCAG
GGCAACGAGA AAGATGTTAT TATCTTCGAC CTGGTGGATA GTCCCCCATT TCAGATTGGG
AAGCTTCTCT CTGGCGGCTG GGGCTCCGAA GCTATGCGCT TGTTCAATGT GGCCTGCACC
CGGGCCAAAG GCAAGTTGGT GATAGTGGCC CACCATGACT ATCTGAGCCA AAAAGCTCCT
GCGGGTGATT CCCTGGCTAC CCTGCTGCAG TATCTGGAGC AGCACGGCAA GATCATGGAC
GCCCGTATGG TGGTCCAGGA TTACGCCGAC CCGGCGGTTA AGATCGCACT GGGGGCAGTT
ATGCCGGCAC GCCGGCAGCT GGGAAATCCC GAAGGTGCTA CGCACTTCAA TGAAGGAAAT
TTTTACCCTG CTTTTCTGGA AGATTTGCGT GACGCCGCCG GAGAAGTAGT TATATTTAGC
CCTTTCATCG CCGAGCGACG CCTGGCGGAC GTAATCACTC CTTTGCGGCG CCTGGTAGAC
CGCGGGGTAC TGGTACTGGT GGTAACCAGG GAGCGGCATG AGTCCAACCA GGTTACGGAG
GAATTAATTC GTCAGTTATC TACAATAGGT ATTAAAGTGT TGCGCCGCAG GGGCCTGCAT
GAGAAGCTCG CCTTCGTGGA TCGGAAGATC GCCTGGTTCG GCAGCTTGAA CATTCTTTCC
CACAGCCGGA GCAGTGAGGT GATGATCCGG TTCTCCCAGC CGGAGCTGGT GGTGAGGCTG
ATGGAACTCT CAGGTACGGT GTACCTGCTT AAGCAAGAGG AACGGCGGTC TGTACAAAAG
CACCGCCTTA CCGAGTTGGC TGATGCTCTA AAGAAGCGGA TGGCGTTTCC TTCCTGTCCA
TTGTGCGGTG GTACCACCGG GCTCAGAACA GGCAAGCATG GGCCCTTCTT TGGCTGTGCT
TCCTTCCGGA ATGGCGGTTG TAAAGGCTTG ATTAACATCC CGCGTCGGGT ACTGGAACTG
GCGGTTCAGG ACCTGGAGCT TACCTGTCCG CATTGTGGGG GGAAGGTTGT CCTCAAATCT
GGCCGCAACG GGGCATTCCT GGGCTGCAGC CGGTACCCGG ACTGCCGCTG GACTGATTCC
TTTTAA
 
Protein sequence
MATFDSITTE LIAALEEEIN AIKESGGAEQ IRVHDGRYGG TAAGRFLYIF LLDTELNIPS 
DTPAQLCIDK DSHETIILSV QGFEMTLGIQ DDLGPFIPKA VLSLSAYYLL EQLQRRLDEI
RTGVLPAERD MCMRLFAFQA NVSFPNVKPA TSLGDLNREQ MEAVNRSLGQ RVTFIWGPPG
TGKTRTIGAL VRELVQRGDR VLVTSHTNVA VDTALIPVIK ALDDNEIQGG AVVRVGALAR
EDPELQQVTM EAVLERKSKD LRQQQQALGV ERKRVQGVRD QLAATIRVIE TVEKSEQLVA
TAKIALAGAE QKVQEEGNAI SQARKTLGDL HSKLQQAEAA GFIRRVFFGF NPEVIRRQIV
NQETVITKLE AAYQVAQQER RAASVALSEA EHRQAKDRQA LESLGPLPPL PELRQQLGEV
EKNLKQLDEQ LAAIEARLRE MAATVIRDAR IVGATLSRLV LLEELYRGTF DTVIIDEASM
VPLPNLWFAG SRAQKRVVVT GDFRQLPPIA TARDAEKYPL AARWLQTDIF VKAGIVEGRA
RLDDPRLCAL KVQYRMHEAI GEVANMLVYE HDGNPLEHRA DPKKYAHATA ALPESGEPLV
LCTTSGANPW CGRLDPGFSR YNIYSAIVCI RLAARALASG AQNVGLVAPY RAQTRLLQHL
VEQYRLPGER VETATVHRFQ GNEKDVIIFD LVDSPPFQIG KLLSGGWGSE AMRLFNVACT
RAKGKLVIVA HHDYLSQKAP AGDSLATLLQ YLEQHGKIMD ARMVVQDYAD PAVKIALGAV
MPARRQLGNP EGATHFNEGN FYPAFLEDLR DAAGEVVIFS PFIAERRLAD VITPLRRLVD
RGVLVLVVTR ERHESNQVTE ELIRQLSTIG IKVLRRRGLH EKLAFVDRKI AWFGSLNILS
HSRSSEVMIR FSQPELVVRL MELSGTVYLL KQEERRSVQK HRLTELADAL KKRMAFPSCP
LCGGTTGLRT GKHGPFFGCA SFRNGGCKGL INIPRRVLEL AVQDLELTCP HCGGKVVLKS
GRNGAFLGCS RYPDCRWTDS F