Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1203 |
Symbol | mutL |
ID | 7979303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1256545 |
End bp | 1258404 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644798155 |
Product | DNA mismatch repair protein |
Protein accession | YP_002949328 |
Protein GI | 239826704 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.288753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAAA TTCGTAAGCT CGATGACCAA TTGTCCAATA AAATCGCCGC AGGGGAAGTC GTCGAGCGTC CTGCCTCCGT CGTTAAAGAG CTCGTTGAAA ACGCGGTTGA CGCCAATAGC ACCATTATTG AAATTGAATT AGAAGAAGCA GGACTAACGA AAATTCGCGT CATCGATAAT GGGGACGGAA TGGAAGAAGA TGATTGCCTC GTTGCTTTTG AAAGACATGC GACAAGCAAA ATTAAAGATG AGCACGATTT GTTTCGCATC CGCACGCTTG GCTTTCGCGG CGAAGCGCTG CCGAGCATCG CCTCCGTCTC GGAAGTTGAG ATGAAAACAA GCACGGGAGA CGGTCCAGGG ACGAAAGTGG TTCTAAAAGG CGGAAAACTC GTTGTACACG AACGGACAAC AAGCCGCAAG GGAACCGATA TTACTGTATC TAACTTGTTT TTCAACACCC CGGCCCGTTT AAAATATATG AAAACGATTC ATACCGAGCT TGGTCATGTC ACCGATGTCG TCAACCGCCT TGCCATGGCG CATCCTGATA TTTCATTTCG CCTGCGCCAT CACGGGAAGC AGCTGCTTTA CACGAGCGGC AACGGCGATG TACGCCATGT GCTTGCCGCG ATTTACGGCA TGGATGTCGC GAAAAAGATG ATTCCAATTC AAGCGGAATC GCTCGATTTT ACCGTTCAAG GCTACATTTC GCTTCCAGAA GTGACGCGCG CTTCGCGAAA TTACATTTCC ACGATCGTCA ACGGGCGATA TGTGCGCAAC ATTCCGCTCG CAAAAGCGAT CGAGGCGGGA TATCATACAC TGTTGCCGAT CGGCCGCTAT CCGATTGTAT TTTTATCGAT TGCGATGGAT CCGATTTTAG TCGATGTCAA TGTCCACCCG GCAAAATTAG AAGTACGTTT CAGCAAAGAA GCGGAATTAA ACGAGCTCGT CACCCAAGCG ATTCGCCAGG CGCTTCAAGC GCGGACGCTT ATTCCTGAGA TGATGATCAA GCAAAAAGAA ACTCCAAAAC CAAAAGCAGA ACAAACGGCT TGGACGTTTG AACATGTCGT TAAAGAACCA TTTGTTTCTC CACTCGTTCA TGTAGATGAA CCAAAACAAG TAGATGAGCC AAAGCAATCA AGTCCAGTGC AAGAACCAAA GGAGGAAATC CCTTCGTTTT TGCCGACAGT AGAATCGAAA CAAAATGATG TCGATGATGA ACTAGTCGAA ATGGACGAGC AAACGGAATC ATCAGACGAA CAAGAGCATG TGAATGACCG GCTTCCGCCG CTTTATCCAA TCGGGCAAAT GCACGGAACG TACATTTTGG CGCAAAATGA GAGAGGGCTT TACATCATTG ACCAGCATGC CGCCCAAGAG CGCATTAAGT ATGAATATTT TCGCGAAAAA GTTGGCGAAG TAATAAACGA AGTGCAGGAA TTGCTTGTTC CGCTTACATT TCATTATCCG ACAGACGAAT ATGTGTTAAT TGACGCACAT CGTGAAGAAT TGGCGAAATG CGGCGTCTTT TTAGAACCGT TCGGGCACAA TACGTTCATC GTCCGCTCCC ACCCATCGTG GTTTCCAAAA GGGGAAGAAG CGGAGATTAT TGAAGAAATG ATCCAGCAAG TGATCGATAT GAAAAAAGTC GATATGAAGC AGCTTCGTGA AAAAGCAGCG ATTTTAATGA GCTGCAAACG TTCGATTAAA GCGAACGAGT ATTTGCGCGA TGACGAAATC TTCGCACTCT TGGAATCACT GCGCAAAACG ACTGATCCAT TCACCTGCCC GCACGGCCGC CCAATCATCA TCCATTTTTC CACGTATGAG CTGGAGAAAA TGTTTAAGAG AGTGATGTAA
|
Protein sequence | MGKIRKLDDQ LSNKIAAGEV VERPASVVKE LVENAVDANS TIIEIELEEA GLTKIRVIDN GDGMEEDDCL VAFERHATSK IKDEHDLFRI RTLGFRGEAL PSIASVSEVE MKTSTGDGPG TKVVLKGGKL VVHERTTSRK GTDITVSNLF FNTPARLKYM KTIHTELGHV TDVVNRLAMA HPDISFRLRH HGKQLLYTSG NGDVRHVLAA IYGMDVAKKM IPIQAESLDF TVQGYISLPE VTRASRNYIS TIVNGRYVRN IPLAKAIEAG YHTLLPIGRY PIVFLSIAMD PILVDVNVHP AKLEVRFSKE AELNELVTQA IRQALQARTL IPEMMIKQKE TPKPKAEQTA WTFEHVVKEP FVSPLVHVDE PKQVDEPKQS SPVQEPKEEI PSFLPTVESK QNDVDDELVE MDEQTESSDE QEHVNDRLPP LYPIGQMHGT YILAQNERGL YIIDQHAAQE RIKYEYFREK VGEVINEVQE LLVPLTFHYP TDEYVLIDAH REELAKCGVF LEPFGHNTFI VRSHPSWFPK GEEAEIIEEM IQQVIDMKKV DMKQLREKAA ILMSCKRSIK ANEYLRDDEI FALLESLRKT TDPFTCPHGR PIIIHFSTYE LEKMFKRVM
|
| |