Gene Moth_0349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0349 
Symbol 
ID3832761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp355643 
End bp357712 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content58% 
IMG OID637828284 
ProductDNA topoisomerase 
Protein accessionYP_429226 
Protein GI83589217 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.26912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAACG CTTGCGACGC CGGCCGGGAA GGGGAACTCA TCTTTCGCCG AATATACGCC 
TGGTGCAAAG GGCAGAAACC CGTCAAGCGC CTCTGGCTCT CGGAGGCCAC GCCGGCGGCC
ATCAAAGAAG CCTTTCGTCA CCTCCGCGTA GGTGAAGAAT TGGACAACCT GGCGGCGGCC
GCCGAAGCCC GGGCGCAGGC CGACTGGCTG GTAGGCATTA ATACCACCAG GGCCTTCACC
TGCCGCCATA ATAGATTACT CTCGGTTGGC CGGGTCCAGA CCCCCACCCT GGCCCTGGTG
GTGGCCAGGG AAAGAGAGAT CCGCGCTTTC AAGCCGGAGC CGTACTTTGA AGTATGGGCT
ACCTTCCGGA AAAGCACCGG CGAAACCTAC AGGGGCAAAT GGTTCCGGGA AAAGCAGGAC
CGGCTGCAGG ACAAGAAAAA AGCCGGGGAC CTGGCTAAAC AGATAAGCGC CTGCGGGATA
GTCGAAAAGA TAGAGCAGAA AGAAGTGCGG GAACAGCCGC CGCAGCTGTT CAACCTCAAC
GACCTCCAGA AAGAAGCCAA CAAAAAATAC GGCCTCACCG CCCAGAAAAC CCTTGACGCG
GCCCAGGCCC TGTATGAAAA ACACAAGCTC CTGACCTACC CCCGCACCGA CAGCCGCCAT
TTAACGACGG CCCTGGTCCG GGATACCCTG GCAGGGCGAC TGGAAGCCCT GACCGGCATA
CCGGCATATG CCGCTCTGGT CCCCGAAAAC CTGCCCCAGC TGGGCAAGCG TTACGTAGAC
GACAGCAAAG TTAGCGATCA CCACGCCCTC ATCCCCACGG CCGTAAACCC CGATCTAAGC
AAATTAAGTC CGGTCGAGCA AAAGGTATAT GACCTGGTGG CGCGCCGTTT CCTGGCCATC
TTCTACCCGG ACGCCCGGTA CGCAGTGACC AGGGTGGTCA CCACCGCCGG CGGCGAGAGT
TTCTTATCCC AGGGTAGGGT GGAGTTAGAA CGGGGCTGGA AGGCCGTCTA CGGCCGGCAG
GAAGAAGACG GGGCGGAAAG CAAAGACGAA GAAAGCCAGA CCCTCCCCCA GCTGGTGGAA
GGCGAAGAAG TCGCCGTTCA GGGGGTAGAA GTAAAGGCGA AGCAGACCAG GCCCCCCCAG
CGCTATACCG AAGCCACCCT CCTGGCGGCC ATGGAGAACG CCGGCCGCCT GGTGGAGGAC
AAAGAGATGG CTGATACTCT GAAGACCGCC GGCGGCATCG GCACTCCCGC CACCAGGGCG
GCAATTATTG AGCGCCTTAT CCAGGTCGGC TATCTCCGGC GGGAAAAGAA AAACCTGCTG
CCTACCGCCA AGGGCGAAAC CCTTATCGGC CTGGTGCCGG AGGAGGTGAA GTCAGTCGAA
TTAACAGCCC GATGGGAGGA AGGGTTAAAA GAGATCGAGG AAGGACAGCG CGACTGTAAA
GAATGGCTGG AAGGAATAAA AAACTTCACC ACGGAGGTGG TCCGAATGGC CAGGGAACAG
GAAGCCGCCC CGGGGGCCGA TCCTGACCGG GAAGTCTTGG GGCAGTGCCC CATATGCGGA
CGGGAGGTGA TGGAATACCC CAAAAGCTAC AGCTGCAGCG GCTACAAAGA AGGCTGCAGA
TTCGCCATCT GGAAAGAGAT CGCCGGCAAA AAAGTAACGG CCAGCCAGGC TAAGGAACTG
CTGCAGAAAG GGAAAACCGG GGTAATCAAA GGCTTTAAGT CTAAAACCGG CAAAAAGTTC
GACGCCGCAC TGACCCTGGG GGAAGGGGGC AAAGTTAACT TTGAATTTGC CGAAGGGAAT
AGAGAAACCC TGGGGAAATG CCCGCTGTGC GGTAAAGACG TTACCGAGTC CCAGAAAGGA
TACGGCTGTT CCGGCTGGAA AGAAGGCTGC AAATTCGTCA TCTGGAAAGA AATCGCCGGC
AAAAAGATTA CCGCCGGCCA GGCGAAGGAG TTGTTGCAAA AAGGCAGGAC AGGGGTAATT
AAAGGATTTA AGTCACGGGC TGGGAAGGAA TTCGAGGCGA TACTCGTCTT GAAGGAAGAC
GGTAAGTTAG AATTTGAGTT TGAGGGGTGA
 
Protein sequence
MVNACDAGRE GELIFRRIYA WCKGQKPVKR LWLSEATPAA IKEAFRHLRV GEELDNLAAA 
AEARAQADWL VGINTTRAFT CRHNRLLSVG RVQTPTLALV VAREREIRAF KPEPYFEVWA
TFRKSTGETY RGKWFREKQD RLQDKKKAGD LAKQISACGI VEKIEQKEVR EQPPQLFNLN
DLQKEANKKY GLTAQKTLDA AQALYEKHKL LTYPRTDSRH LTTALVRDTL AGRLEALTGI
PAYAALVPEN LPQLGKRYVD DSKVSDHHAL IPTAVNPDLS KLSPVEQKVY DLVARRFLAI
FYPDARYAVT RVVTTAGGES FLSQGRVELE RGWKAVYGRQ EEDGAESKDE ESQTLPQLVE
GEEVAVQGVE VKAKQTRPPQ RYTEATLLAA MENAGRLVED KEMADTLKTA GGIGTPATRA
AIIERLIQVG YLRREKKNLL PTAKGETLIG LVPEEVKSVE LTARWEEGLK EIEEGQRDCK
EWLEGIKNFT TEVVRMAREQ EAAPGADPDR EVLGQCPICG REVMEYPKSY SCSGYKEGCR
FAIWKEIAGK KVTASQAKEL LQKGKTGVIK GFKSKTGKKF DAALTLGEGG KVNFEFAEGN
RETLGKCPLC GKDVTESQKG YGCSGWKEGC KFVIWKEIAG KKITAGQAKE LLQKGRTGVI
KGFKSRAGKE FEAILVLKED GKLEFEFEG