Gene Moth_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1113 
Symbol 
ID3833245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1139729 
End bp1142320 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content61% 
IMG OID637829041 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_429970 
Protein GI83589961 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000302948 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGGG AACAGGCCTT AACCCCCATG ATGGCGCAAT ACCGCCAGAT CAAGTCCCAG 
TACCCGGATT GCATCCTCTT CTTCCGCCTG GGAGATTTTT ATGAGATGTT TTATGAAGAC
GCCGAGGTGG CAGCCCGGGA ATTGGACCTG GTTTTGACGA CCCGGGGCGG TAAAGAGGCA
GCCCCCATGT GCGGGGTACC CTTTCACGCC GCCGACAGTT ACCTGGCCCG CCTGGTAGGT
AAGGGCTACA AGGTGGCTAT TTGCGAACAG ATGGAGGACC CCCGCCAGGC CAAGGGCCTG
GTACGCCGGG AGGTTATCCG GGTGGTTACC CCGGGTACGA TAACCGATGA GAAGGCCCTG
ACCCCCGGCG GAAATAACTA CCTGGCGGCG ATAGTGCGCT ATAACGGCTG CTGGGGGTTG
GCCTGGGCCG ATGCCTCCAC GGGGGAATTC TTGTTTACCA CCTGCCCGGA CCAGGAAACC
CTGGTGGACG AGCTGGTACG CCTCATGCCG TCGGAATATT TGCTGCCCGG GGAACTGGCC
CGGGATACGG CCCTAAAAAG GCTCCTGCAG ATTTACACCC GGGGCGTGAT TACCGGCTGG
CAAGTAGCCA GTAATCCGGA GGCAGCCCGG CAGTCCCTGG AGGACCATTT TGGCCACGAG
GCCCTGGCCG GGGTCGAATT ACCGGCAGCT GCGGGCCTGG CAGCGGCCAT GATTCTTTCT
TTCCTGGTTG CTACCCAGCA CAACTCCCTG GCCCACCTGG AGGCCCCGGC TGCCGCAGCC
ACAACCAGCC GGATGTTCCT GGATCAGGCT ACCAGGCGCA ACCTGGAGCT GGTGACTGCC
GGCCGGGAGC AAAAACGGGA GGGTTCCCTC CTCTGGGTCC TGGATAAGAC CCTCACAGCC
ATGGGTGCCC GGACCCTGAG GCGCTGGCTG GACCAGCCCT TGGTAGATGC CGGGGCCATC
AAGGAACGCC AGGAAGCCGT CGCCGAACTG GTTGAGGGCT TCATCCTGCG CCAGGAATTG
CGGGAGAGGC TGCAGGTGGT GCGGGACCTG GAACGCCTGG CCGGCAGGGT GGCCTATGGG
ACGGCAGGCG GGCGGGAACT CCAGGCCCTG CGGGGTTCCC TGGCCGTAAT CCCCGGTATC
CTGGAGCTGA TGAGCGAAGT ACATTCCAGG CTCCTGGCGC AGGTGAGGGC CCAACTGGAC
CCCCTTGACG ACCTGGTCGA TCTCCTGGGC CGGGCCCTGG TGGACGATCC CCCGGCCAGC
ATCACCGAGG GCGGCATTAT CCGTACCGGC TATAATGGAG AGGTCGATAA GCTGCGCGAG
GCGGCCACCC ACGGCCGGGA TTGGATTGCC AGCCTGGAAG CCGAGGAGCG GGAACGCACA
GGGATTAAAT CCCTCAAGGT CGGTTACAAC CGTGTCTTCG GTTACTACAT AGAGGTCACC
CGTCCCAACC TCCCCCAGGT GCCGGCCGAC TACGAGCGCA AACAGACCCT GGCCAATGCC
GAGCGTTTTG TTACCCGGCG CCTGCAGGAA CTGGAACGTC AGGTCCTGGG GGCAGAGGAG
AGACTGGTCC AACTGGAGTA TGAACTCTTT CAGGGCCTGC GGGAAGAGGT GCTGGGCGTC
CTGCCCCGTA TCCAGGCCAC GGCCCGGGCC CTGGGAGTAC TGGACGCCCT TATTTCCCTG
GCTACGGTGG CCGTAGATAA CAACTATACC TGTCCACGGG TCGATGACGG TACGGTGATT
GAAATCGAGC AGGGGCGCCA CCCGGTGGTG GAACTGGTCG GTTCTCCGGG GACCTTTGTC
CCCAACGACA CCTACCTGGA CCAGGAACAG TATATCCAGA TCATTACCGG GCCCAATATG
GCCGGTAAAT CCACCTATAT CCGCCAGGTA GCTCTGATTG TACTGCTGGC CCAGATTGGC
AGCTTTGTAC CGGCCCGGCG AGCCCATATC GGCCTGGTGG ACAGGATTTT TACCCGGGTC
GGTGCTGCCG ATGATATCTT TGCCGGCCAG AGCACCTTTA TGGTTGAGAT GCAGGAAGTG
GCCGGTATCT TGAAACACGC CACCAGGCGG AGCCTGGTTA TCCTTGATGA AGTGGGCCGG
GGTACAAGCA CTGCTGACGG TCTGAGTATT GCCCGGGCAG TAACGGAGTA TATTCACAAT
GTCATTGGTG CCCGCTGCCT CTTTGCCACC CACTACCACG AATTGGTTAG CCTGGCGGAG
GAATTGTCCG GGGTCCGCAA TTACTGCGTT GCCGTCCTGG AAGAGGGAGA GGACATTACC
TTCCTCCGGA CGATAGTGCC AGGTAGCACC GACAAGAGCT ACGGAATCCA TGTGGCCCGC
CTGGCGGGCC TGCCGGAGCA GGTCCTGGAA CGGGCGCGAG AAATTCTGGA ACAGCAACCC
AGGGCCCAGG TGCGGGTTTT AACGAAACCG GTAAAAAAGC CGACCCTTAC TCCCGGCGAG
GTACAGGTAC TGGAGGAACT GGCCGGTTAT AACCTGATGG CGGCCACGCC CCTGGAAGCC
ATGGAGCAGA TCTTCCGCTG GCAGAAGATG CTTCACAAAG AGCTGGATAT TGTTAAAAGG
AGCAGGGGAT GA
 
Protein sequence
MAREQALTPM MAQYRQIKSQ YPDCILFFRL GDFYEMFYED AEVAARELDL VLTTRGGKEA 
APMCGVPFHA ADSYLARLVG KGYKVAICEQ MEDPRQAKGL VRREVIRVVT PGTITDEKAL
TPGGNNYLAA IVRYNGCWGL AWADASTGEF LFTTCPDQET LVDELVRLMP SEYLLPGELA
RDTALKRLLQ IYTRGVITGW QVASNPEAAR QSLEDHFGHE ALAGVELPAA AGLAAAMILS
FLVATQHNSL AHLEAPAAAA TTSRMFLDQA TRRNLELVTA GREQKREGSL LWVLDKTLTA
MGARTLRRWL DQPLVDAGAI KERQEAVAEL VEGFILRQEL RERLQVVRDL ERLAGRVAYG
TAGGRELQAL RGSLAVIPGI LELMSEVHSR LLAQVRAQLD PLDDLVDLLG RALVDDPPAS
ITEGGIIRTG YNGEVDKLRE AATHGRDWIA SLEAEERERT GIKSLKVGYN RVFGYYIEVT
RPNLPQVPAD YERKQTLANA ERFVTRRLQE LERQVLGAEE RLVQLEYELF QGLREEVLGV
LPRIQATARA LGVLDALISL ATVAVDNNYT CPRVDDGTVI EIEQGRHPVV ELVGSPGTFV
PNDTYLDQEQ YIQIITGPNM AGKSTYIRQV ALIVLLAQIG SFVPARRAHI GLVDRIFTRV
GAADDIFAGQ STFMVEMQEV AGILKHATRR SLVILDEVGR GTSTADGLSI ARAVTEYIHN
VIGARCLFAT HYHELVSLAE ELSGVRNYCV AVLEEGEDIT FLRTIVPGST DKSYGIHVAR
LAGLPEQVLE RAREILEQQP RAQVRVLTKP VKKPTLTPGE VQVLEELAGY NLMAATPLEA
MEQIFRWQKM LHKELDIVKR SRG