Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1681 |
Symbol | |
ID | 3831952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1718863 |
End bp | 1721598 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829606 |
Product | single-stranded-DNA-specific exonuclease RecJ |
Protein accession | YP_430526 |
Protein GI | 83590517 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0608] Single-stranded DNA-specific exonuclease |
TIGRFAM ID | [TIGR00644] single-stranded-DNA-specific exonuclease RecJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.674318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0670138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCAG TAAACCTGCC GGAGAGGCTG GCCCTGGCCC GGGAGTTACA TATCTCACCT ATAACAGCGC AAATTCTCAT TAACCGGGGC ATAACTACAG CGGCAGCGGC CAGGGCTTTT TTCCAGCCGG ACCCGGCCAA TCTTATCCCG CCGGAGAAAA TTCCCGGCCT GCCGGCGGCC CGGGAGCGCC TGGTCCAGGC CATAGAAAAG CGGGAAAAAA TTATGGTCCA TGGTGACTAT GATGTCGACG GTCTGGCGGC CACGGCCATT ATGCTGGAAA CCCTAGCGCG CCTGGGGGTT GAGGCTGAAG TTTACCTGCC CGACCGTCTA ACGGAGGGTT ATGGTTTAAA AAAGAAGAGC CTCATTAAAG CCCTGGAGTT AAATTGTCAG CTGGTCATAA CGGTAGACTG TGGCATTACT TCCCTGGAGG AGGCCATCTT CGCCCGGGAA CAAGGTCTGG ACCTGATTAT TACCGACCAC CACCGACCCG GGACAAGCCT GCCCCAGGCG AGTGCCGTAG TTAATCCCCT GCTGGCACCG GGCCTGCCTC CCCTATGCGG CGCCGGGGTG GCCTTCAAAC TAGCCCAGTC CCTGGCGACC CACTTTGGTC TGGCACCCCA GGGAGGTGTG GCCGCCGGCT GGGCCCTGGA CCTGGTAGCC CTGGCGACTA TCGCTGACGC CGTACCCCTC CTGGGAGAAA ACCGCCTCCT GGTCCAGCTG GGTTTAAAGG CCCTGTCCGG TAGCATGCGG CCTGGCTTGA AAGCCCTGGC AGAGGTGGCA GGTCTACCGG CCAGGGAGTG GACGGCCCGG GAAGTGGCCT TTGGCCTTGT TCCCCGTTTG AATGCCTGCG GCCGCCTGGG AAATGCCATG CCGGCCCTGG AAATTTTGCT AACTTCTTCT CCCCAGCGGG CCCTGGAACT GGCCCAGCGC CTGCAGGAGG AAAACCAGGC CCGGAGACAC CTGGAGGAGA GCATTGCCGC CGAGGCTGAA GCCATGGCCG TGACGGCTTT AGCTGCAGGA GCAAAAGGCC TGGTCCTGGC GGCCGAGGGC TGGCACCCGG GAGTAACTGG CATTATCGCC GCCCGCTTGG TGGAAAAATA TAACTGTCCG GTTGTTTTAA TAGCCGTGGT AGGCGATAGG GGCCGGGGTT CGGGACGTAG CCTCCCGGGT ATCAATCTCC ATGAAATCTT CGCCAGGTGC CGTTCCTACC TCCTGTCCTT TGGCGGTCAT GCCCGGGCAG GGGGACTGGA AATCGCCGCC AAAGAAATAC CGGCTTTCCA GGCCGCCTTT AATGAAATAG TAAAGTCACT CATGGCTGAG GTTCAAACGC CTCCCGAAGT AGCGCCTGAG GCGGAGGTCC TTGTCAGCCA GCTGGACTGG CAGCTCCTGG AGGAATTAGA GCAGCTGGCC CCCTTTGGCG AAGGGAATCC CCGGCCCGTC CTGGTTACCC GCCGCGCCCT GATTAAGGCC GCCCGCCAGG TGGGCAGGGA CGGCGCCCAC CTGAAGCTGA CTGCCGGCGG GGAAGGTAGG GAAATTGGGG CCATTGGCTT TAACCTCTCC CTGCCGCCGG GGTTGTCTCC CGGGCATCAT GTTGACCTGG CCTTTTACCT GGAACGCAAC ACCTACCGGG ACCGGGAGGA ACTGCAGCTA AGGCTCGTGG CCCTGAAAGC GGCGGAAACA GGAGCAACAC TACCGGAACG GGCGGAAGGA ATGGTGGCGG CCACCGGTGA AGACGGCCCG GCACCGACCT GGGGCCGGCA GCTCCAGGAA TTGCTTGCTA CCTATGGCCC ACGGGTACGA GTATGCCTGG CGACGACGGC GGCTGTACGC CAGGCCTATA ACGGCTGCAG GCGTTTTTTC CAGGTGCCGG AAACCCTGCA ACCCCTGGGC CCCTGGCTGG GCCGGGCCGG GGTGGAACGG GTTTTGAGCC GGGCCCGGGG GTTAATTACC TGCAGCCCCT TCTGGCCGGT AACACCCGCC GGGAAAGAGG CGCTCCTGGC CTCGCCCCTG GCGGCCGCTG ATCTGCCGCC GGAGCAATTA CTGCGGCCTG CCGGCGACCC GGCGGCCCTG GAGGTATGGC CGGAGCTATT ACCCCTATTG ACTGCCGGCC TGGAAAGGGG CGAGCGCATC CTCCTCTATG CCGCTACCGG CAGAATTCCC TGGCTGGTAA GCTGGCTTGC CGGCGCCCTG CCCGGCGTAC CCCTGGCGGT GGATACTTAT AGTGACTACC GGCAGTTTGC CCTGGCCCGG GAAGGAGCCC TGGCCGGCCG GCTCCCCCTG CTGGTGGCCC GGCGGGAGGT TCCGGCGTGG TTTTACCCGG CCGACCTGGT GGTCTTTACC TATCTTCCCG ACAGCCTGGA GGAGGTGGAG CTGGCCCTCC CTCCGGGTGA AAAAGTGCCC CGGGTGGCTA TTCTCCTGGC GGCTGGGGAC AGGCCGGTCC CGGATTTGCG TCAGGAGCTG GCGGTATTTT ACCGCCGGTT ACAGAAGCTG CTGGGCAACG GCCGCGGTCT TTACATAGTT AATAATAAGG GATATCATCA ATTATGTTAC CTAGCTATCT TTGAAGAACT CGGGCTGATC CAGGTCACCA GCCGGGATCA GGGTTTATTT ATCCAGCCAT TGGAAGTAGA AACCAAACGC GACCTGATGG CTTCCAGACG CTACCGGCAG CTCCGGGCCG AAAGGGAACT GGCCCGGCAA TTCAGGCGAC AACTGCCGGG AGGGGAGGTG CGGTAA
|
Protein sequence | MPPVNLPERL ALARELHISP ITAQILINRG ITTAAAARAF FQPDPANLIP PEKIPGLPAA RERLVQAIEK REKIMVHGDY DVDGLAATAI MLETLARLGV EAEVYLPDRL TEGYGLKKKS LIKALELNCQ LVITVDCGIT SLEEAIFARE QGLDLIITDH HRPGTSLPQA SAVVNPLLAP GLPPLCGAGV AFKLAQSLAT HFGLAPQGGV AAGWALDLVA LATIADAVPL LGENRLLVQL GLKALSGSMR PGLKALAEVA GLPAREWTAR EVAFGLVPRL NACGRLGNAM PALEILLTSS PQRALELAQR LQEENQARRH LEESIAAEAE AMAVTALAAG AKGLVLAAEG WHPGVTGIIA ARLVEKYNCP VVLIAVVGDR GRGSGRSLPG INLHEIFARC RSYLLSFGGH ARAGGLEIAA KEIPAFQAAF NEIVKSLMAE VQTPPEVAPE AEVLVSQLDW QLLEELEQLA PFGEGNPRPV LVTRRALIKA ARQVGRDGAH LKLTAGGEGR EIGAIGFNLS LPPGLSPGHH VDLAFYLERN TYRDREELQL RLVALKAAET GATLPERAEG MVAATGEDGP APTWGRQLQE LLATYGPRVR VCLATTAAVR QAYNGCRRFF QVPETLQPLG PWLGRAGVER VLSRARGLIT CSPFWPVTPA GKEALLASPL AAADLPPEQL LRPAGDPAAL EVWPELLPLL TAGLERGERI LLYAATGRIP WLVSWLAGAL PGVPLAVDTY SDYRQFALAR EGALAGRLPL LVARREVPAW FYPADLVVFT YLPDSLEEVE LALPPGEKVP RVAILLAAGD RPVPDLRQEL AVFYRRLQKL LGNGRGLYIV NNKGYHQLCY LAIFEELGLI QVTSRDQGLF IQPLEVETKR DLMASRRYRQ LRAERELARQ FRRQLPGGEV R
|
| |