Gene Moth_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1681 
Symbol 
ID3831952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1718863 
End bp1721598 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content62% 
IMG OID637829606 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_430526 
Protein GI83590517 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.674318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0670138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCAG TAAACCTGCC GGAGAGGCTG GCCCTGGCCC GGGAGTTACA TATCTCACCT 
ATAACAGCGC AAATTCTCAT TAACCGGGGC ATAACTACAG CGGCAGCGGC CAGGGCTTTT
TTCCAGCCGG ACCCGGCCAA TCTTATCCCG CCGGAGAAAA TTCCCGGCCT GCCGGCGGCC
CGGGAGCGCC TGGTCCAGGC CATAGAAAAG CGGGAAAAAA TTATGGTCCA TGGTGACTAT
GATGTCGACG GTCTGGCGGC CACGGCCATT ATGCTGGAAA CCCTAGCGCG CCTGGGGGTT
GAGGCTGAAG TTTACCTGCC CGACCGTCTA ACGGAGGGTT ATGGTTTAAA AAAGAAGAGC
CTCATTAAAG CCCTGGAGTT AAATTGTCAG CTGGTCATAA CGGTAGACTG TGGCATTACT
TCCCTGGAGG AGGCCATCTT CGCCCGGGAA CAAGGTCTGG ACCTGATTAT TACCGACCAC
CACCGACCCG GGACAAGCCT GCCCCAGGCG AGTGCCGTAG TTAATCCCCT GCTGGCACCG
GGCCTGCCTC CCCTATGCGG CGCCGGGGTG GCCTTCAAAC TAGCCCAGTC CCTGGCGACC
CACTTTGGTC TGGCACCCCA GGGAGGTGTG GCCGCCGGCT GGGCCCTGGA CCTGGTAGCC
CTGGCGACTA TCGCTGACGC CGTACCCCTC CTGGGAGAAA ACCGCCTCCT GGTCCAGCTG
GGTTTAAAGG CCCTGTCCGG TAGCATGCGG CCTGGCTTGA AAGCCCTGGC AGAGGTGGCA
GGTCTACCGG CCAGGGAGTG GACGGCCCGG GAAGTGGCCT TTGGCCTTGT TCCCCGTTTG
AATGCCTGCG GCCGCCTGGG AAATGCCATG CCGGCCCTGG AAATTTTGCT AACTTCTTCT
CCCCAGCGGG CCCTGGAACT GGCCCAGCGC CTGCAGGAGG AAAACCAGGC CCGGAGACAC
CTGGAGGAGA GCATTGCCGC CGAGGCTGAA GCCATGGCCG TGACGGCTTT AGCTGCAGGA
GCAAAAGGCC TGGTCCTGGC GGCCGAGGGC TGGCACCCGG GAGTAACTGG CATTATCGCC
GCCCGCTTGG TGGAAAAATA TAACTGTCCG GTTGTTTTAA TAGCCGTGGT AGGCGATAGG
GGCCGGGGTT CGGGACGTAG CCTCCCGGGT ATCAATCTCC ATGAAATCTT CGCCAGGTGC
CGTTCCTACC TCCTGTCCTT TGGCGGTCAT GCCCGGGCAG GGGGACTGGA AATCGCCGCC
AAAGAAATAC CGGCTTTCCA GGCCGCCTTT AATGAAATAG TAAAGTCACT CATGGCTGAG
GTTCAAACGC CTCCCGAAGT AGCGCCTGAG GCGGAGGTCC TTGTCAGCCA GCTGGACTGG
CAGCTCCTGG AGGAATTAGA GCAGCTGGCC CCCTTTGGCG AAGGGAATCC CCGGCCCGTC
CTGGTTACCC GCCGCGCCCT GATTAAGGCC GCCCGCCAGG TGGGCAGGGA CGGCGCCCAC
CTGAAGCTGA CTGCCGGCGG GGAAGGTAGG GAAATTGGGG CCATTGGCTT TAACCTCTCC
CTGCCGCCGG GGTTGTCTCC CGGGCATCAT GTTGACCTGG CCTTTTACCT GGAACGCAAC
ACCTACCGGG ACCGGGAGGA ACTGCAGCTA AGGCTCGTGG CCCTGAAAGC GGCGGAAACA
GGAGCAACAC TACCGGAACG GGCGGAAGGA ATGGTGGCGG CCACCGGTGA AGACGGCCCG
GCACCGACCT GGGGCCGGCA GCTCCAGGAA TTGCTTGCTA CCTATGGCCC ACGGGTACGA
GTATGCCTGG CGACGACGGC GGCTGTACGC CAGGCCTATA ACGGCTGCAG GCGTTTTTTC
CAGGTGCCGG AAACCCTGCA ACCCCTGGGC CCCTGGCTGG GCCGGGCCGG GGTGGAACGG
GTTTTGAGCC GGGCCCGGGG GTTAATTACC TGCAGCCCCT TCTGGCCGGT AACACCCGCC
GGGAAAGAGG CGCTCCTGGC CTCGCCCCTG GCGGCCGCTG ATCTGCCGCC GGAGCAATTA
CTGCGGCCTG CCGGCGACCC GGCGGCCCTG GAGGTATGGC CGGAGCTATT ACCCCTATTG
ACTGCCGGCC TGGAAAGGGG CGAGCGCATC CTCCTCTATG CCGCTACCGG CAGAATTCCC
TGGCTGGTAA GCTGGCTTGC CGGCGCCCTG CCCGGCGTAC CCCTGGCGGT GGATACTTAT
AGTGACTACC GGCAGTTTGC CCTGGCCCGG GAAGGAGCCC TGGCCGGCCG GCTCCCCCTG
CTGGTGGCCC GGCGGGAGGT TCCGGCGTGG TTTTACCCGG CCGACCTGGT GGTCTTTACC
TATCTTCCCG ACAGCCTGGA GGAGGTGGAG CTGGCCCTCC CTCCGGGTGA AAAAGTGCCC
CGGGTGGCTA TTCTCCTGGC GGCTGGGGAC AGGCCGGTCC CGGATTTGCG TCAGGAGCTG
GCGGTATTTT ACCGCCGGTT ACAGAAGCTG CTGGGCAACG GCCGCGGTCT TTACATAGTT
AATAATAAGG GATATCATCA ATTATGTTAC CTAGCTATCT TTGAAGAACT CGGGCTGATC
CAGGTCACCA GCCGGGATCA GGGTTTATTT ATCCAGCCAT TGGAAGTAGA AACCAAACGC
GACCTGATGG CTTCCAGACG CTACCGGCAG CTCCGGGCCG AAAGGGAACT GGCCCGGCAA
TTCAGGCGAC AACTGCCGGG AGGGGAGGTG CGGTAA
 
Protein sequence
MPPVNLPERL ALARELHISP ITAQILINRG ITTAAAARAF FQPDPANLIP PEKIPGLPAA 
RERLVQAIEK REKIMVHGDY DVDGLAATAI MLETLARLGV EAEVYLPDRL TEGYGLKKKS
LIKALELNCQ LVITVDCGIT SLEEAIFARE QGLDLIITDH HRPGTSLPQA SAVVNPLLAP
GLPPLCGAGV AFKLAQSLAT HFGLAPQGGV AAGWALDLVA LATIADAVPL LGENRLLVQL
GLKALSGSMR PGLKALAEVA GLPAREWTAR EVAFGLVPRL NACGRLGNAM PALEILLTSS
PQRALELAQR LQEENQARRH LEESIAAEAE AMAVTALAAG AKGLVLAAEG WHPGVTGIIA
ARLVEKYNCP VVLIAVVGDR GRGSGRSLPG INLHEIFARC RSYLLSFGGH ARAGGLEIAA
KEIPAFQAAF NEIVKSLMAE VQTPPEVAPE AEVLVSQLDW QLLEELEQLA PFGEGNPRPV
LVTRRALIKA ARQVGRDGAH LKLTAGGEGR EIGAIGFNLS LPPGLSPGHH VDLAFYLERN
TYRDREELQL RLVALKAAET GATLPERAEG MVAATGEDGP APTWGRQLQE LLATYGPRVR
VCLATTAAVR QAYNGCRRFF QVPETLQPLG PWLGRAGVER VLSRARGLIT CSPFWPVTPA
GKEALLASPL AAADLPPEQL LRPAGDPAAL EVWPELLPLL TAGLERGERI LLYAATGRIP
WLVSWLAGAL PGVPLAVDTY SDYRQFALAR EGALAGRLPL LVARREVPAW FYPADLVVFT
YLPDSLEEVE LALPPGEKVP RVAILLAAGD RPVPDLRQEL AVFYRRLQKL LGNGRGLYIV
NNKGYHQLCY LAIFEELGLI QVTSRDQGLF IQPLEVETKR DLMASRRYRQ LRAERELARQ
FRRQLPGGEV R