Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0349 |
Symbol | |
ID | 3832761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 355643 |
End bp | 357712 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828284 |
Product | DNA topoisomerase |
Protein accession | YP_429226 |
Protein GI | 83589217 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.26912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAACG CTTGCGACGC CGGCCGGGAA GGGGAACTCA TCTTTCGCCG AATATACGCC TGGTGCAAAG GGCAGAAACC CGTCAAGCGC CTCTGGCTCT CGGAGGCCAC GCCGGCGGCC ATCAAAGAAG CCTTTCGTCA CCTCCGCGTA GGTGAAGAAT TGGACAACCT GGCGGCGGCC GCCGAAGCCC GGGCGCAGGC CGACTGGCTG GTAGGCATTA ATACCACCAG GGCCTTCACC TGCCGCCATA ATAGATTACT CTCGGTTGGC CGGGTCCAGA CCCCCACCCT GGCCCTGGTG GTGGCCAGGG AAAGAGAGAT CCGCGCTTTC AAGCCGGAGC CGTACTTTGA AGTATGGGCT ACCTTCCGGA AAAGCACCGG CGAAACCTAC AGGGGCAAAT GGTTCCGGGA AAAGCAGGAC CGGCTGCAGG ACAAGAAAAA AGCCGGGGAC CTGGCTAAAC AGATAAGCGC CTGCGGGATA GTCGAAAAGA TAGAGCAGAA AGAAGTGCGG GAACAGCCGC CGCAGCTGTT CAACCTCAAC GACCTCCAGA AAGAAGCCAA CAAAAAATAC GGCCTCACCG CCCAGAAAAC CCTTGACGCG GCCCAGGCCC TGTATGAAAA ACACAAGCTC CTGACCTACC CCCGCACCGA CAGCCGCCAT TTAACGACGG CCCTGGTCCG GGATACCCTG GCAGGGCGAC TGGAAGCCCT GACCGGCATA CCGGCATATG CCGCTCTGGT CCCCGAAAAC CTGCCCCAGC TGGGCAAGCG TTACGTAGAC GACAGCAAAG TTAGCGATCA CCACGCCCTC ATCCCCACGG CCGTAAACCC CGATCTAAGC AAATTAAGTC CGGTCGAGCA AAAGGTATAT GACCTGGTGG CGCGCCGTTT CCTGGCCATC TTCTACCCGG ACGCCCGGTA CGCAGTGACC AGGGTGGTCA CCACCGCCGG CGGCGAGAGT TTCTTATCCC AGGGTAGGGT GGAGTTAGAA CGGGGCTGGA AGGCCGTCTA CGGCCGGCAG GAAGAAGACG GGGCGGAAAG CAAAGACGAA GAAAGCCAGA CCCTCCCCCA GCTGGTGGAA GGCGAAGAAG TCGCCGTTCA GGGGGTAGAA GTAAAGGCGA AGCAGACCAG GCCCCCCCAG CGCTATACCG AAGCCACCCT CCTGGCGGCC ATGGAGAACG CCGGCCGCCT GGTGGAGGAC AAAGAGATGG CTGATACTCT GAAGACCGCC GGCGGCATCG GCACTCCCGC CACCAGGGCG GCAATTATTG AGCGCCTTAT CCAGGTCGGC TATCTCCGGC GGGAAAAGAA AAACCTGCTG CCTACCGCCA AGGGCGAAAC CCTTATCGGC CTGGTGCCGG AGGAGGTGAA GTCAGTCGAA TTAACAGCCC GATGGGAGGA AGGGTTAAAA GAGATCGAGG AAGGACAGCG CGACTGTAAA GAATGGCTGG AAGGAATAAA AAACTTCACC ACGGAGGTGG TCCGAATGGC CAGGGAACAG GAAGCCGCCC CGGGGGCCGA TCCTGACCGG GAAGTCTTGG GGCAGTGCCC CATATGCGGA CGGGAGGTGA TGGAATACCC CAAAAGCTAC AGCTGCAGCG GCTACAAAGA AGGCTGCAGA TTCGCCATCT GGAAAGAGAT CGCCGGCAAA AAAGTAACGG CCAGCCAGGC TAAGGAACTG CTGCAGAAAG GGAAAACCGG GGTAATCAAA GGCTTTAAGT CTAAAACCGG CAAAAAGTTC GACGCCGCAC TGACCCTGGG GGAAGGGGGC AAAGTTAACT TTGAATTTGC CGAAGGGAAT AGAGAAACCC TGGGGAAATG CCCGCTGTGC GGTAAAGACG TTACCGAGTC CCAGAAAGGA TACGGCTGTT CCGGCTGGAA AGAAGGCTGC AAATTCGTCA TCTGGAAAGA AATCGCCGGC AAAAAGATTA CCGCCGGCCA GGCGAAGGAG TTGTTGCAAA AAGGCAGGAC AGGGGTAATT AAAGGATTTA AGTCACGGGC TGGGAAGGAA TTCGAGGCGA TACTCGTCTT GAAGGAAGAC GGTAAGTTAG AATTTGAGTT TGAGGGGTGA
|
Protein sequence | MVNACDAGRE GELIFRRIYA WCKGQKPVKR LWLSEATPAA IKEAFRHLRV GEELDNLAAA AEARAQADWL VGINTTRAFT CRHNRLLSVG RVQTPTLALV VAREREIRAF KPEPYFEVWA TFRKSTGETY RGKWFREKQD RLQDKKKAGD LAKQISACGI VEKIEQKEVR EQPPQLFNLN DLQKEANKKY GLTAQKTLDA AQALYEKHKL LTYPRTDSRH LTTALVRDTL AGRLEALTGI PAYAALVPEN LPQLGKRYVD DSKVSDHHAL IPTAVNPDLS KLSPVEQKVY DLVARRFLAI FYPDARYAVT RVVTTAGGES FLSQGRVELE RGWKAVYGRQ EEDGAESKDE ESQTLPQLVE GEEVAVQGVE VKAKQTRPPQ RYTEATLLAA MENAGRLVED KEMADTLKTA GGIGTPATRA AIIERLIQVG YLRREKKNLL PTAKGETLIG LVPEEVKSVE LTARWEEGLK EIEEGQRDCK EWLEGIKNFT TEVVRMAREQ EAAPGADPDR EVLGQCPICG REVMEYPKSY SCSGYKEGCR FAIWKEIAGK KVTASQAKEL LQKGKTGVIK GFKSKTGKKF DAALTLGEGG KVNFEFAEGN RETLGKCPLC GKDVTESQKG YGCSGWKEGC KFVIWKEIAG KKITAGQAKE LLQKGRTGVI KGFKSRAGKE FEAILVLKED GKLEFEFEG
|
| |