Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0571 |
Symbol | |
ID | 4270901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 619294 |
End bp | 621204 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638125313 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_741415 |
Protein GI | 114319732 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.707368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00000000963067 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCAGGC CCATACAGCA ACTGCCCCTG CAGCTCGTCA ACCAGATCGC CGCCGGCGAG GTGGTGGAGC GCCCGGCGTC CGTGCTCAAG GAGCTGGTGG AGAACAGCCT GGATGCCGGT GCCCGGAGCC TTGCGGTGGA GTTGGAGCAG GGCGGCAAGC GCCTGATACG GGTGCGCGAT GATGGCAACG GCATCCCCCG CGAGCAGCTC GGCCTGGCGC TGCGGCGCCA CGCCACCAGT AAGATCACGT CGCTGAGTGA GCTGGAGCAG GTGGTCAGTC TGGGGTTTCG CGGTGAGGCG CTGCCGAGCA TCGGTGCGGT CTCCCGGCTC CGGCTGATCT CCCGGCCGCC CGGGGCTGAG CACGCCTGGG CGGTGCGCAC CGATGGCGAC AGCGAGCCCG CCGGGCCCGA GCCGGCGGCG CATCCGCCGG GGACCACCGT CGAGGTGCGT GACCTTTTCT TCAACACCCC CGGTCGACGC AAATTCCTGC GCACCGACCG CACCGAGTTC AGCCATGCCC AGGAGGCCCT GCGGCGCCTG GCGCTGGGGC GCTTCGACGT GGCCTTCCGG CTGCAGCACA ACGGGCGCAC CGTGCTCGAC CTGCCGCCGG CCGGTGACCG GGCCGGCGCC GAGCGGCGGC TGGGTGAGCT CCTCGGTGAG GGCTTCCTGG GTGAGTGTAT CCACCTGGAG TGTGCCGCCG CCGGCCTGAA GCTGAGCGGC TGGCTGGCGC TGCCCACCTT TTCCCGCAGC CAGGGGGATC TGCAGTATTT CTATGTCAAC GGCCGCATGA TCCGCGATCG CATGGCCGGC CACGCCCTGC GTCGGGCCTA CGCCGACGTG CTCTACCGCG ACCGCTTCCC CGCCTACCTG CTCTACCTGG ACCTGGACCC CGACCGGGTG GACGTGAACG TGCACCCCAC CAAGCACGAG GTGCGCTTCC GCGACAGTCG GTTGGTCTAT GATTTTCTGT TCCGACAGGT GCGGGAGGCG CTGGCTCGCG TCAGCCCTGC TACGGCGTCC GGGGTGCAGC CACCGCAGGG GTCGGTGGCG TCGGCCGAGG GGCCGCGCAG CTTGGCCGCT GCCGCGGGGG AACGGTGGGG TGGGGCCGCA CCAGCCGCTG GCGCCGCGCC AACGGCCCGG CGATCGCCAC ACCAGCACGG CCTGGGGTTG CCGCTGGAGG AGGCCCGGTT GCTCTATGGC GAGCGCAGCA AGGCGCACGC GGCCGGCCCG GCCGTCGCGT CTCCCTCCGG CGTCGTCCGG GACGCACCTG CCGGGGAGGC CGTCGCGTGT GAGACCGGAG GTGCGGGCGA TCCGACGGAA CGGGGTGGCC CGCCGCTGGG CCACGCCCTG GCCCAGGTCC ATGGGGTCTA CATCCTGGCC CAGAACGACC AGGGCCTGGT CTTGGTGGAC ATGCATGCCG CCCATGAGCG GGTGGTCTAC GAACGCATGA AGGCGCAACT CTCGGGCAGT GGCATCGCCA GCCAGGCCCT GCTGATGCCG GAGGGGCTGA GCGTGACGCC GGCGGAGGGC GAAGAGGTGG AGCGCGCCGG CGAGCGTTTC CGGCAACTGG GTTTTCAGGT GGACCGGGTG GCCCCGGACC GGGTGCTGGT CCGTGCGGTG CCGGCCCTGC TGGCCAATGC CGAGCCGGTC GCGCTGGTGC GTGACGTGCT GGCCGACCTC CGGACCCAGT CGCGTAGCCG CCAGGTGGAG GAGGCGCTGA ACCACGTCCT GGCCACCATG GCCTGCCACG GTTCCGTGCG CGCCAACCGG CGGCTGACCC TGCCGGAGAT GGACGCGCTG CTGCGCGAGA TGGAGGCCAC CCCGAACAGC GGCCAGTGCA ACCACGGCCG GCCGACGTGG ACCGTGCTGG ATATGGACGC CCTGGACCGG CTGTTCATGC GGGGGCAGTG A
|
Protein sequence | MVRPIQQLPL QLVNQIAAGE VVERPASVLK ELVENSLDAG ARSLAVELEQ GGKRLIRVRD DGNGIPREQL GLALRRHATS KITSLSELEQ VVSLGFRGEA LPSIGAVSRL RLISRPPGAE HAWAVRTDGD SEPAGPEPAA HPPGTTVEVR DLFFNTPGRR KFLRTDRTEF SHAQEALRRL ALGRFDVAFR LQHNGRTVLD LPPAGDRAGA ERRLGELLGE GFLGECIHLE CAAAGLKLSG WLALPTFSRS QGDLQYFYVN GRMIRDRMAG HALRRAYADV LYRDRFPAYL LYLDLDPDRV DVNVHPTKHE VRFRDSRLVY DFLFRQVREA LARVSPATAS GVQPPQGSVA SAEGPRSLAA AAGERWGGAA PAAGAAPTAR RSPHQHGLGL PLEEARLLYG ERSKAHAAGP AVASPSGVVR DAPAGEAVAC ETGGAGDPTE RGGPPLGHAL AQVHGVYILA QNDQGLVLVD MHAAHERVVY ERMKAQLSGS GIASQALLMP EGLSVTPAEG EEVERAGERF RQLGFQVDRV APDRVLVRAV PALLANAEPV ALVRDVLADL RTQSRSRQVE EALNHVLATM ACHGSVRANR RLTLPEMDAL LREMEATPNS GQCNHGRPTW TVLDMDALDR LFMRGQ
|
| |