Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0675 |
Symbol | |
ID | 3832499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 708160 |
End bp | 711285 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828613 |
Product | DNA topoisomerase I |
Protein accession | YP_429543 |
Protein GI | 83589534 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.135571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000063053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAACTT TTGATTCGAT AACCACTGAA CTGATAGCGG CCCTGGAAGA AGAGATCAAT GCAATCAAGG AGTCTGGCGG GGCGGAGCAA ATAAGAGTCC ACGACGGAAG GTACGGAGGC ACTGCTGCGG GCCGCTTTCT ATATATTTTT TTGCTTGATA CCGAACTCAA TATTCCCTCC GATACCCCGG CTCAACTCTG TATTGATAAG GATTCACACG AGACCATCAT ATTAAGCGTT CAGGGGTTTG AGATGACGCT GGGCATACAG GATGACCTGG GCCCCTTTAT CCCTAAGGCG GTCCTGAGCC TTTCCGCCTA CTACTTATTA GAACAGTTGC AGCGACGCCT GGATGAAATT CGCACCGGCG TACTTCCTGC CGAACGGGAT ATGTGTATGC GGCTTTTTGC GTTCCAAGCC AATGTTTCCT TTCCCAATGT TAAACCTGCC ACCAGCCTTG GAGACTTGAA CAGGGAACAA ATGGAGGCAG TCAACCGGAG CCTGGGACAG CGGGTCACAT TCATCTGGGG CCCGCCGGGA ACGGGGAAGA CACGGACCAT CGGTGCCCTT GTACGGGAGT TGGTCCAGCG TGGGGACCGT GTGCTGGTTA CGTCCCACAC GAACGTGGCT GTAGATACGG CCTTGATTCC CGTTATCAAA GCCCTGGACG ATAACGAAAT CCAGGGTGGT GCCGTGGTAC GGGTAGGCGC ACTGGCCCGG GAGGACCCGG AATTGCAGCA GGTGACGATG GAGGCCGTGC TTGAGAGGAA AAGCAAGGAT TTAAGGCAGC AACAGCAAGC GCTGGGGGTC GAGCGCAAGA GGGTGCAGGG AGTGCGGGAC CAGTTGGCTG CTACCATCCG GGTTATCGAG ACTGTTGAAA AGTCAGAGCA GCTTGTCGCA ACAGCAAAGA TTGCCCTGGC TGGAGCCGAA CAGAAGGTGC AAGAAGAAGG GAACGCTATC TCGCAGGCCC GTAAAACCCT GGGTGATTTG CATAGTAAGC TGCAGCAAGC TGAAGCAGCC GGGTTCATTC GACGGGTCTT TTTCGGTTTT AACCCGGAGG TCATCCGCCG TCAGATCGTC AATCAAGAAA CTGTAATCAC AAAGCTGGAG GCAGCTTATC AGGTTGCCCA GCAGGAGAGA AGGGCTGCTA GCGTCGCGTT ATCAGAAGCT GAACACCGGC AGGCCAAAGA CCGGCAGGCT CTGGAAAGCC TGGGGCCGTT GCCTCCTCTC CCGGAACTTC GGCAGCAGCT GGGTGAGGTC GAGAAAAACC TCAAGCAATT AGATGAGCAA TTGGCCGCCA TTGAAGCCCG GCTGCGTGAA ATGGCGGCGA CCGTGATTCG CGACGCCAGA ATCGTTGGAG CGACTTTATC TCGCCTGGTG CTTTTGGAAG AACTGTACCG GGGCACCTTT GATACAGTAA TCATCGACGA GGCCAGTATG GTTCCCCTGC CCAATCTATG GTTTGCCGGT AGCCGGGCGC AGAAGCGCGT GGTGGTCACA GGTGATTTCC GGCAGCTGCC CCCAATTGCA ACCGCCCGGG ACGCTGAAAA GTATCCCCTG GCTGCCAGAT GGTTACAAAC CGATATTTTT GTTAAGGCCG GTATCGTGGA GGGCCGGGCA AGGTTGGACG ATCCCCGGTT ATGTGCGCTG AAGGTGCAGT ATCGCATGCA TGAGGCCATT GGTGAAGTGG CCAATATGCT GGTCTATGAG CATGACGGAA ACCCGCTCGA GCACAGGGCG GACCCCAAGA AATATGCTCA CGCAACTGCG GCTCTTCCGG AATCCGGGGA GCCCCTGGTC CTGTGTACCA CATCGGGTGC CAATCCCTGG TGCGGCCGGC TGGACCCCGG GTTCTCCCGC TATAATATTT ACAGTGCTAT AGTCTGTATC CGGCTTGCCG CCCGGGCCCT GGCCAGCGGC GCCCAAAATG TTGGATTGGT GGCGCCGTAC CGGGCTCAAA CTCGCTTGTT GCAACACCTG GTGGAGCAAT ACAGGCTTCC CGGGGAAAGA GTTGAAACGG CAACCGTGCA CAGGTTCCAG GGCAACGAGA AAGATGTTAT TATCTTCGAC CTGGTGGATA GTCCCCCATT TCAGATTGGG AAGCTTCTCT CTGGCGGCTG GGGCTCCGAA GCTATGCGCT TGTTCAATGT GGCCTGCACC CGGGCCAAAG GCAAGTTGGT GATAGTGGCC CACCATGACT ATCTGAGCCA AAAAGCTCCT GCGGGTGATT CCCTGGCTAC CCTGCTGCAG TATCTGGAGC AGCACGGCAA GATCATGGAC GCCCGTATGG TGGTCCAGGA TTACGCCGAC CCGGCGGTTA AGATCGCACT GGGGGCAGTT ATGCCGGCAC GCCGGCAGCT GGGAAATCCC GAAGGTGCTA CGCACTTCAA TGAAGGAAAT TTTTACCCTG CTTTTCTGGA AGATTTGCGT GACGCCGCCG GAGAAGTAGT TATATTTAGC CCTTTCATCG CCGAGCGACG CCTGGCGGAC GTAATCACTC CTTTGCGGCG CCTGGTAGAC CGCGGGGTAC TGGTACTGGT GGTAACCAGG GAGCGGCATG AGTCCAACCA GGTTACGGAG GAATTAATTC GTCAGTTATC TACAATAGGT ATTAAAGTGT TGCGCCGCAG GGGCCTGCAT GAGAAGCTCG CCTTCGTGGA TCGGAAGATC GCCTGGTTCG GCAGCTTGAA CATTCTTTCC CACAGCCGGA GCAGTGAGGT GATGATCCGG TTCTCCCAGC CGGAGCTGGT GGTGAGGCTG ATGGAACTCT CAGGTACGGT GTACCTGCTT AAGCAAGAGG AACGGCGGTC TGTACAAAAG CACCGCCTTA CCGAGTTGGC TGATGCTCTA AAGAAGCGGA TGGCGTTTCC TTCCTGTCCA TTGTGCGGTG GTACCACCGG GCTCAGAACA GGCAAGCATG GGCCCTTCTT TGGCTGTGCT TCCTTCCGGA ATGGCGGTTG TAAAGGCTTG ATTAACATCC CGCGTCGGGT ACTGGAACTG GCGGTTCAGG ACCTGGAGCT TACCTGTCCG CATTGTGGGG GGAAGGTTGT CCTCAAATCT GGCCGCAACG GGGCATTCCT GGGCTGCAGC CGGTACCCGG ACTGCCGCTG GACTGATTCC TTTTAA
|
Protein sequence | MATFDSITTE LIAALEEEIN AIKESGGAEQ IRVHDGRYGG TAAGRFLYIF LLDTELNIPS DTPAQLCIDK DSHETIILSV QGFEMTLGIQ DDLGPFIPKA VLSLSAYYLL EQLQRRLDEI RTGVLPAERD MCMRLFAFQA NVSFPNVKPA TSLGDLNREQ MEAVNRSLGQ RVTFIWGPPG TGKTRTIGAL VRELVQRGDR VLVTSHTNVA VDTALIPVIK ALDDNEIQGG AVVRVGALAR EDPELQQVTM EAVLERKSKD LRQQQQALGV ERKRVQGVRD QLAATIRVIE TVEKSEQLVA TAKIALAGAE QKVQEEGNAI SQARKTLGDL HSKLQQAEAA GFIRRVFFGF NPEVIRRQIV NQETVITKLE AAYQVAQQER RAASVALSEA EHRQAKDRQA LESLGPLPPL PELRQQLGEV EKNLKQLDEQ LAAIEARLRE MAATVIRDAR IVGATLSRLV LLEELYRGTF DTVIIDEASM VPLPNLWFAG SRAQKRVVVT GDFRQLPPIA TARDAEKYPL AARWLQTDIF VKAGIVEGRA RLDDPRLCAL KVQYRMHEAI GEVANMLVYE HDGNPLEHRA DPKKYAHATA ALPESGEPLV LCTTSGANPW CGRLDPGFSR YNIYSAIVCI RLAARALASG AQNVGLVAPY RAQTRLLQHL VEQYRLPGER VETATVHRFQ GNEKDVIIFD LVDSPPFQIG KLLSGGWGSE AMRLFNVACT RAKGKLVIVA HHDYLSQKAP AGDSLATLLQ YLEQHGKIMD ARMVVQDYAD PAVKIALGAV MPARRQLGNP EGATHFNEGN FYPAFLEDLR DAAGEVVIFS PFIAERRLAD VITPLRRLVD RGVLVLVVTR ERHESNQVTE ELIRQLSTIG IKVLRRRGLH EKLAFVDRKI AWFGSLNILS HSRSSEVMIR FSQPELVVRL MELSGTVYLL KQEERRSVQK HRLTELADAL KKRMAFPSCP LCGGTTGLRT GKHGPFFGCA SFRNGGCKGL INIPRRVLEL AVQDLELTCP HCGGKVVLKS GRNGAFLGCS RYPDCRWTDS F
|
| |