Gene Mlg_1485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1485 
Symbol 
ID4269958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1696849 
End bp1699485 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content68% 
IMG OID638126243 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_742324 
Protein GI114320641 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGT CCAGTCCCAA GGCATCGTCG CGGCATACGC CCATGATGCG CCAGTTCCTG 
CGCATCAAGG CCGAACACCC GGACATCCTG CTCTTCTACC GGATGGGCGA TTTCTACGAA
CTATTCTACG AGGATGCCGA ACGGGCCGCG AAGCTGCTGG ACATCACCCT GACCACCCGC
GGGCAGTCCG CCGGCGAACC CATCCCCATG GCCGGCGTGC CGGTGCACGC GGTGGAGAGC
TATCTGGCGC GGCTGGTGCG CCAGGGCGAA TCGGTGGCCA TCTGCGAACA GATCGGCGAC
CCGGACAACA GCAAGGGCCC GGTGGAGCGG CAGGTGGTGC GTATCGTCAC CCCCGGCACC
CTGACTGACG AGGCCCTGCT CGAGGAACGT CAAAGCAACA TCCTGGCGGC CCTGAGCTGC
CATCAATCGC GCTGGGGGCT GGCCAGCCTG GAGCTCTCCA GCGGCCGGTT CAGCCTCACC
GAACCTGCCG ACGAGCAGGC CCTTGCCGCC GACCTGGAGC GGCTGAACCC CGCCGAGTTG
CTGGTGGACG AGGCGCTCAC CCTGCCCACG GGCCTGGCCG TGGGGCCGGG GCTCACCCGC
CGGCCACCCT GGCACTTCGA GCTCGATACC GCAACCGACC TACTCACAGA GCAGTTCGGC
ACCCGCGATC TGGCCGGCTT CGGCGCCCAG GACCACAGCG CAGGGCTCGC CGCCGCCGGC
GCCCTGCTCC AGTACGTCCG TGAGACCCAG CGCTCCGCCC TGCCCCATAT CCGCAGGCTG
CAGGTGGAAC ACGGCGATCA GGCCATCGTC ATCGACGCCG CCAGCCGCCG GAACCTTGAG
CTGGAACGCA ACCTCTCCGG GGGCACCGAA CACACCCTTG CCTCGGTGCT GGACAGTACC
GTCAACGCAA TGGGCAGCCG CCTGCTGCGC CGCTGGCTCA ACCGCCCGCT GCGCGACCGC
ACCACCCTGC AGGCCCGCCA CCAGGCCGTG GAGATCCTGA TGGCCGAGTC CCTGACCGAG
GCGCTTCGCC GCCAGCTCCG GGGAATCAGC GACGTGGAGC GCATCCTTGC CCGGGTGGCA
TTGGGCAGCG CCCGGCCCCG GGATCTCACT GGCCTGCGGG AGACGCTCGC CCGGCTCCCG
GACATCCAGG CCACCCTCAC CGGTGCCGGC GCTCCTCGGC TGGTGGACCT GGCGGCGCAG
TGCGGGGAAC ACCCGCAGAC CCTGGACCAC TTGCGCCGCG CCCTGGTGGA CCAGCCACCG
GTGGTGATCC GCGACGGCGG GGTCATCGCC GAAGGCTATG ATGCGACCCT GGATGAGTTG
CGGACGCTTT CAGAAAACGC CGATAACTAC TTGTTGGAAC TCGAACAACG CGAGCGGGAA
CGCACGGGGA TCAGCACCCT GAAGGTGGGT TATAACCGGG TACACGGCTA CTACATCGAG
GTCACCCGCG CCCAGGCTGA TGCGGTGCCC GCCGAGTATG TCCGACGCCA GACCCTCAAA
GGGGTGGAGC GCTATATCCT CCCTGAACTC AAGGCCTTCG AGGACAAAGT GCTGTCGGCA
CGTGAAAAGG CCCTGGCCCG CGAGAAGGTC CTCTATGAGC AATTGCTCGC CAGCCTGGCC
AGCGACCTGG CGCCGCTTCA GGACACCGCC GCGGCACTGG CGGAGCTGGA CACCCTGGCC
GCCTTCGCGG AACGCGCGCA GGCGCTGGAC TACTCTCGCC CGGAACTGCG CGATGGGGCG
GGACTGCGCA TTGAGGCCGG CCGGCACCCG GTGGTGGAAT ACAGCCTCGA CGGCCCCTTC
GTCCCCAACG ACCTGACACT GGACGACCGC AGGCGGATGC TGATCATCAC CGGGCCCAAC
ATGGGGGGCA AGTCCACCTA TATGCGCCAG GTGGCGCTGA TCACCCTGAT GGCGCATATC
GGCAGTTTTG TACCGGCGCG CGCCGCCAGC CTGGGCCCGG TGGATCGGAT CTTCACCCGG
ATCGGCGCCT CCGACGACCT GGCCGGTGGC CGCTCCACCT TCATGGTGGA GATGACCGAG
ACGGCGAACA TCCTGCACAA CGCCACCGCG CAAAGCCTGG TGTTAATGGA CGAAATCGGG
CGAGGCACCA GCACCTTCGA CGGCCTGGCC CTGGCCTGGG CGACCGCCGA ACGGCTGGCC
CGCGACCAGC GCGCCTACAC CCTTTTCGCC ACCCACTATT TCGAGATGAC AGCGCTGCCC
GAGCAATGTC CCGGTGCCAG CAACGTCCAC CTGGATGCCG TGGAGCACGG CGAGCGCATC
GTCTTTCTGC ATGCGGTCAA ACCCGGCCCG GCGAGCCAGA GCTACGGCCT CCAGGTGGCC
GCGTTGGCGG GCGTCCCCGG GCCGGTGCTG GAGGCCGCCC GCGAGAAGCT GCGGGCGCTC
GAAGAGGAGA GTTCCCGCCA GAGGGCCGAG CCGGATCAGC TCTCACTCTT CGCGGAACCG
GCGCCGCCCC CACCCCTGCC CAGTGCCGCC GAGCAGGCGC TGTCGGAGGT GGACCCGGAC
GAACTCTCAC CCCGCCAGGC CCTGGACCTG CTCTACCGTC TGAAGGCGTT GACCAGTGGC
GAGGAAGGGG CGGACAAAAA GGCGCGGGGC GATGCAGTCG ATGCCCGGAG TCGCTAG
 
Protein sequence
MAQSSPKASS RHTPMMRQFL RIKAEHPDIL LFYRMGDFYE LFYEDAERAA KLLDITLTTR 
GQSAGEPIPM AGVPVHAVES YLARLVRQGE SVAICEQIGD PDNSKGPVER QVVRIVTPGT
LTDEALLEER QSNILAALSC HQSRWGLASL ELSSGRFSLT EPADEQALAA DLERLNPAEL
LVDEALTLPT GLAVGPGLTR RPPWHFELDT ATDLLTEQFG TRDLAGFGAQ DHSAGLAAAG
ALLQYVRETQ RSALPHIRRL QVEHGDQAIV IDAASRRNLE LERNLSGGTE HTLASVLDST
VNAMGSRLLR RWLNRPLRDR TTLQARHQAV EILMAESLTE ALRRQLRGIS DVERILARVA
LGSARPRDLT GLRETLARLP DIQATLTGAG APRLVDLAAQ CGEHPQTLDH LRRALVDQPP
VVIRDGGVIA EGYDATLDEL RTLSENADNY LLELEQRERE RTGISTLKVG YNRVHGYYIE
VTRAQADAVP AEYVRRQTLK GVERYILPEL KAFEDKVLSA REKALAREKV LYEQLLASLA
SDLAPLQDTA AALAELDTLA AFAERAQALD YSRPELRDGA GLRIEAGRHP VVEYSLDGPF
VPNDLTLDDR RRMLIITGPN MGGKSTYMRQ VALITLMAHI GSFVPARAAS LGPVDRIFTR
IGASDDLAGG RSTFMVEMTE TANILHNATA QSLVLMDEIG RGTSTFDGLA LAWATAERLA
RDQRAYTLFA THYFEMTALP EQCPGASNVH LDAVEHGERI VFLHAVKPGP ASQSYGLQVA
ALAGVPGPVL EAAREKLRAL EEESSRQRAE PDQLSLFAEP APPPPLPSAA EQALSEVDPD
ELSPRQALDL LYRLKALTSG EEGADKKARG DAVDARSR