Gene Moth_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1026 
Symbol 
ID3832646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1054994 
End bp1057087 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content55% 
IMG OID637828954 
ProductDNA topoisomerase I 
Protein accessionYP_429883 
Protein GI83589874 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00786254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000108626 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTTGGCCA AGACCTTGGT TATCGTGGAG TCCCCGGCCA AGGCAAAAAC CATTGGCAAA 
TTCCTGGGTA AGAATTATAC CATCAAGCCG TCCATGGGGC ATGTCCGGGA TTTACCCAAA
AGCCAGTTCG GCGTTGATGT GGAAAATAAT TTTGAACCCC GTTATATAAC CATCCGGGGC
AAGGGTGAAA TTGTCAAGGA ACTGCGGGCT GCCGCCCAGA AGGCCCAGAG GGTTTTACTG
GCCCCGGACC CGGATCGTGA AGGAGAGGCC ATTGCCTGGC ACCTTCAGCA CCTCCTTGGC
CTTCCTGATG AAGCCATCAG GATCGAATTC AACGAGATTA CCCGCAATGC CATCCAGGAA
GCGGTCAAGA AACCCCGGAA GATCGACCAG GACCGGGTAG ATGCCCAACA GGCGCGGCGC
ATCCTGGACA GGGTGGTAGG TTACCGTTTA AGCCCTCTCC TCTGGCGCAA AGTCCGTAAA
GGTTTAAGTG CCGGCCGGGT GCAGTCAGTA GCCGTGCGCT TGATAGTCGA CCGGGAGCAG
GAAATCGAAG CCTTTCAACC GGAGGAATAC TGGAGCCTTA CAGCCTGGCT ATTGCCGGAG
AACGAGGGTG AGGCTTTTCC TGCTAGGTTG GTGAAATATG CCGGCGAGGA TTTAAATGTT
AAAAACGAAG GGGAAATGGA GTCAATTCTT CATTCCCTGG AAGGAACCAC CTATGTTGTA
GCTGAAGTAA AGCAGCGGGA ACGCCGGAAA AATCCGGCTG CTCCCTTTAC TACCAGTACC
CTGCAGCAGG AAGCCTACCG GAAATTAAAC TTTACCTCCC GGCGTACCAT GCAGGTGGCC
CAGCAACTCT ACGAGGGTAT TGACCTGGGA GGCGGCCAGG GCCCTGTAGG TCTTATCACC
TATATCCGTA CGGATTCCAC CCGGGTGGCC AGTGTGGCTC AGCTGGAAGC CCGCGATTTT
CTGATGGAAC GCTTCGGCTC GGAATATGTT CCGGAAGGTT TACGTCAGTA TAAGGGGCGC
AAAGATATCC AGGACGCCCA TGAGGCCATC CGGCCGACCT CAGTTTGGCG TGAGCCGGCG
TCCCTGAAAA ACATCCTGAC CCGCGATCAG TTTCGCCTTT ATAATCTTAT CTGGGAGCGT
TTTGTTGCCA GCCAGATGCA GGCGGCTGTG ATGGATACCG TGACGGTGGA TATTACTGCC
GGGCCGTGTC TTTTCCGGGC GACGGGTTCG GTCGTCAAGT TTCCGGGTTT TCTAAAGGTT
TACCAGGAGG GCAGGGATGG TGAAGATAAG GACCAGGAGC AGCGGTTGCC ACCCCTGGCC
ACAGGCCAGA CCCTGAAACT GCAATCCCTG GAACCCAAAC AGCACTTCAC CCAGCCCCCG
CCTCGCTATA CTGAAGCTAC CCTGGTCAAG ACTATGGAGG AACTGGGTAT CGGCAGACCA
AGTACCTATG CGCCGACCAT TGAAACCATC CTCCAGCGGG GATACGTCAC CAGGGAGCAG
AAACAATTTG TTCCCACGGA ACTGGGCCGG GTAGTCGTCG CTTTGTTAAA GGAACATTTC
CCGAAAATAA TAGACGTTGA ATTCACGGCC CATATGGAAG AGCAACTGGA TGCCATTGAA
GCCGGGAAGA TATCCTGGCG CCAGGTGCTG GCGGAGTTTT ATGGCCCCTT CGAGGAGGTC
CTGGAAAAGG CTGAGGCTGA GATAGGTACG GTGGAGGTGC CGGAAGAGGT CAGCGAGGAA
AAATGCGAAC TCTGTGGTCG CAACCTGGTG GTCAAGATGG GTCGTTACGG CAAATTCCTG
GCCTGTCCCG GTTTCCCGGA GTGTCGTTTT ACCAAGCCCC TGCTGGAGAC CATCGGCGTC
AACTGCCCGG AGTGCGGCGG CCAGATCGTT GCCCGGCGGA CGAAAAGGGG GCGGAAGTTT
TACGGCTGCC AGAACTATCC CCGTTGCACC TATGTTTCCT GGGATAAACC AACCAATCAA
ACCTGCCCGC GCTGTGGTAA GCGTCTGGTA GAGAAGGCCT CACGTCAGGG TAGCCGCCTC
GTTTGTCCCC AAAAAGAATG TGGCTATGTA GAAGAGGTCC GGCAGGCAAA ATAG
 
Protein sequence
MLAKTLVIVE SPAKAKTIGK FLGKNYTIKP SMGHVRDLPK SQFGVDVENN FEPRYITIRG 
KGEIVKELRA AAQKAQRVLL APDPDREGEA IAWHLQHLLG LPDEAIRIEF NEITRNAIQE
AVKKPRKIDQ DRVDAQQARR ILDRVVGYRL SPLLWRKVRK GLSAGRVQSV AVRLIVDREQ
EIEAFQPEEY WSLTAWLLPE NEGEAFPARL VKYAGEDLNV KNEGEMESIL HSLEGTTYVV
AEVKQRERRK NPAAPFTTST LQQEAYRKLN FTSRRTMQVA QQLYEGIDLG GGQGPVGLIT
YIRTDSTRVA SVAQLEARDF LMERFGSEYV PEGLRQYKGR KDIQDAHEAI RPTSVWREPA
SLKNILTRDQ FRLYNLIWER FVASQMQAAV MDTVTVDITA GPCLFRATGS VVKFPGFLKV
YQEGRDGEDK DQEQRLPPLA TGQTLKLQSL EPKQHFTQPP PRYTEATLVK TMEELGIGRP
STYAPTIETI LQRGYVTREQ KQFVPTELGR VVVALLKEHF PKIIDVEFTA HMEEQLDAIE
AGKISWRQVL AEFYGPFEEV LEKAEAEIGT VEVPEEVSEE KCELCGRNLV VKMGRYGKFL
ACPGFPECRF TKPLLETIGV NCPECGGQIV ARRTKRGRKF YGCQNYPRCT YVSWDKPTNQ
TCPRCGKRLV EKASRQGSRL VCPQKECGYV EEVRQAK