Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1026 |
Symbol | |
ID | 3832646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1054994 |
End bp | 1057087 |
Gene Length | 2094 bp |
Protein Length | 697 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637828954 |
Product | DNA topoisomerase I |
Protein accession | YP_429883 |
Protein GI | 83589874 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00786254 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000108626 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTGGCCA AGACCTTGGT TATCGTGGAG TCCCCGGCCA AGGCAAAAAC CATTGGCAAA TTCCTGGGTA AGAATTATAC CATCAAGCCG TCCATGGGGC ATGTCCGGGA TTTACCCAAA AGCCAGTTCG GCGTTGATGT GGAAAATAAT TTTGAACCCC GTTATATAAC CATCCGGGGC AAGGGTGAAA TTGTCAAGGA ACTGCGGGCT GCCGCCCAGA AGGCCCAGAG GGTTTTACTG GCCCCGGACC CGGATCGTGA AGGAGAGGCC ATTGCCTGGC ACCTTCAGCA CCTCCTTGGC CTTCCTGATG AAGCCATCAG GATCGAATTC AACGAGATTA CCCGCAATGC CATCCAGGAA GCGGTCAAGA AACCCCGGAA GATCGACCAG GACCGGGTAG ATGCCCAACA GGCGCGGCGC ATCCTGGACA GGGTGGTAGG TTACCGTTTA AGCCCTCTCC TCTGGCGCAA AGTCCGTAAA GGTTTAAGTG CCGGCCGGGT GCAGTCAGTA GCCGTGCGCT TGATAGTCGA CCGGGAGCAG GAAATCGAAG CCTTTCAACC GGAGGAATAC TGGAGCCTTA CAGCCTGGCT ATTGCCGGAG AACGAGGGTG AGGCTTTTCC TGCTAGGTTG GTGAAATATG CCGGCGAGGA TTTAAATGTT AAAAACGAAG GGGAAATGGA GTCAATTCTT CATTCCCTGG AAGGAACCAC CTATGTTGTA GCTGAAGTAA AGCAGCGGGA ACGCCGGAAA AATCCGGCTG CTCCCTTTAC TACCAGTACC CTGCAGCAGG AAGCCTACCG GAAATTAAAC TTTACCTCCC GGCGTACCAT GCAGGTGGCC CAGCAACTCT ACGAGGGTAT TGACCTGGGA GGCGGCCAGG GCCCTGTAGG TCTTATCACC TATATCCGTA CGGATTCCAC CCGGGTGGCC AGTGTGGCTC AGCTGGAAGC CCGCGATTTT CTGATGGAAC GCTTCGGCTC GGAATATGTT CCGGAAGGTT TACGTCAGTA TAAGGGGCGC AAAGATATCC AGGACGCCCA TGAGGCCATC CGGCCGACCT CAGTTTGGCG TGAGCCGGCG TCCCTGAAAA ACATCCTGAC CCGCGATCAG TTTCGCCTTT ATAATCTTAT CTGGGAGCGT TTTGTTGCCA GCCAGATGCA GGCGGCTGTG ATGGATACCG TGACGGTGGA TATTACTGCC GGGCCGTGTC TTTTCCGGGC GACGGGTTCG GTCGTCAAGT TTCCGGGTTT TCTAAAGGTT TACCAGGAGG GCAGGGATGG TGAAGATAAG GACCAGGAGC AGCGGTTGCC ACCCCTGGCC ACAGGCCAGA CCCTGAAACT GCAATCCCTG GAACCCAAAC AGCACTTCAC CCAGCCCCCG CCTCGCTATA CTGAAGCTAC CCTGGTCAAG ACTATGGAGG AACTGGGTAT CGGCAGACCA AGTACCTATG CGCCGACCAT TGAAACCATC CTCCAGCGGG GATACGTCAC CAGGGAGCAG AAACAATTTG TTCCCACGGA ACTGGGCCGG GTAGTCGTCG CTTTGTTAAA GGAACATTTC CCGAAAATAA TAGACGTTGA ATTCACGGCC CATATGGAAG AGCAACTGGA TGCCATTGAA GCCGGGAAGA TATCCTGGCG CCAGGTGCTG GCGGAGTTTT ATGGCCCCTT CGAGGAGGTC CTGGAAAAGG CTGAGGCTGA GATAGGTACG GTGGAGGTGC CGGAAGAGGT CAGCGAGGAA AAATGCGAAC TCTGTGGTCG CAACCTGGTG GTCAAGATGG GTCGTTACGG CAAATTCCTG GCCTGTCCCG GTTTCCCGGA GTGTCGTTTT ACCAAGCCCC TGCTGGAGAC CATCGGCGTC AACTGCCCGG AGTGCGGCGG CCAGATCGTT GCCCGGCGGA CGAAAAGGGG GCGGAAGTTT TACGGCTGCC AGAACTATCC CCGTTGCACC TATGTTTCCT GGGATAAACC AACCAATCAA ACCTGCCCGC GCTGTGGTAA GCGTCTGGTA GAGAAGGCCT CACGTCAGGG TAGCCGCCTC GTTTGTCCCC AAAAAGAATG TGGCTATGTA GAAGAGGTCC GGCAGGCAAA ATAG
|
Protein sequence | MLAKTLVIVE SPAKAKTIGK FLGKNYTIKP SMGHVRDLPK SQFGVDVENN FEPRYITIRG KGEIVKELRA AAQKAQRVLL APDPDREGEA IAWHLQHLLG LPDEAIRIEF NEITRNAIQE AVKKPRKIDQ DRVDAQQARR ILDRVVGYRL SPLLWRKVRK GLSAGRVQSV AVRLIVDREQ EIEAFQPEEY WSLTAWLLPE NEGEAFPARL VKYAGEDLNV KNEGEMESIL HSLEGTTYVV AEVKQRERRK NPAAPFTTST LQQEAYRKLN FTSRRTMQVA QQLYEGIDLG GGQGPVGLIT YIRTDSTRVA SVAQLEARDF LMERFGSEYV PEGLRQYKGR KDIQDAHEAI RPTSVWREPA SLKNILTRDQ FRLYNLIWER FVASQMQAAV MDTVTVDITA GPCLFRATGS VVKFPGFLKV YQEGRDGEDK DQEQRLPPLA TGQTLKLQSL EPKQHFTQPP PRYTEATLVK TMEELGIGRP STYAPTIETI LQRGYVTREQ KQFVPTELGR VVVALLKEHF PKIIDVEFTA HMEEQLDAIE AGKISWRQVL AEFYGPFEEV LEKAEAEIGT VEVPEEVSEE KCELCGRNLV VKMGRYGKFL ACPGFPECRF TKPLLETIGV NCPECGGQIV ARRTKRGRKF YGCQNYPRCT YVSWDKPTNQ TCPRCGKRLV EKASRQGSRL VCPQKECGYV EEVRQAK
|
| |