Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0776 |
Symbol | |
ID | 4810394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 936992 |
End bp | 939259 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106193 |
Product | DNA mismatch repair protein MutL |
Protein accession | YP_001037204 |
Protein GI | 125973294 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.287224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGAA TCATTATACT CGATGAAAAT ACGGCAAATC AGATTGCGGC CGGAGAAGTG GTGGAAAGGC CGGCTTCCGT GGTAAAAGAG CTTGTTGAAA ACTCTATTGA TGCCGGAAGT ACCAATATAT CGGTGGAAAT AAACAATGGC GGAATATCTT TTATAAAGGT GGTTGACAAC GGAAGCGGAA TCGAAGAGGA CGATATTGAA ATTGCCTTTG AAAGACATGC TACCAGCAAG ATAAGAAGGG CAAGTGACCT TGAGGCGATT ACCTCCCTTG GATTTAGAGG TGAGGCTCTT GCCAGTATTG CCTCTGTTTC CACCGTTGAG GTAACTTCAA GACCGGCCCA CAGGGAGTAC GGAAGGTATG TAAAAATCCA GGGCGGAACT GTTTTAGAGT CCGGCCAGGT AGGATGTCCT GCCGGTACAA CATTTATAGT AAGAGATCTT TTTTACAACA CTCCTGCCAG ATTTAAATTT TTGAAAAAGG ATTCCACCGA GGCTGGATAT GTTTCCGATA TAGTAAGCAG AATTGCTCTT GGAAATCCCG ATATTTCCTT CAGACTAATC AACAACAAAA ACACTGTTAT TCATACCCCG GGCAATAACG ATCTTTTGAG TACCATATAC AGTTTGTACG GAAAAGAGAC TGCAAAAGAA TGCATGGAGA TTTCCTATGA GGATGAGACC GTAAAGATAA CCGGATATGC TGGAAGTCCT GAAATAGCAA GAGCAAACAG GAATTATCAG TCCATATATT TAAACAAAAG ATATATAAAA AACAAGGTTA TATCTTCGGC GATTGACGAA GCATATAAAA CATATCTTAT GAAAAACAAA TTTGCTTTTA TTGTGCTGTA TATAGAATTA AATCCGCTGT TGGTGGATGT CAATGTGCAT CCTACAAAAA TGGAGGTAAG GTTTTCGAGG GAACAGGAGA TATTCAGAGC CGTTTACCAC GCCGTAAACA ATGCTCTGCT CAGTAAAACC CATATAAGAA ATGTTTCCCT TAAAGACAGT CCGAAGAATT ATTTCAAATT TGAGCAGTCT TCAAAAAAGG AAGCCGACTA TGTGCAGCAG AGGCTGGATA CGGACAGGAA GTTTTCAGGA TACAATTATG ATGAAGATTT TAAAACTGAA AAGACCAATA CGTCTAAAAG CACGTGGGAA AATTTAATTG TTAAAGAGAG CGCCAATATT AATAAACGAA AAGATGAAAC TATAGATGAA GTTATAAACA AAAATGAAGT TGGAGATAAA AATAAAGTTA TAAGTGAAGA TGAAGTTATA AATAAAGATG AAGTTATAAA TAAAGACAAA GTTATAAATA AAGATAAAGT TATAAATAAA GGTGAAGTTG TAAATAAGGT TAATGTAAAC GAAGTTGAAA ACGAGTCGGT TGATAATTTG ATTAACGGGC AAATCAGTGG CTTGGTTAAT TGGGCAATTA ATGAGCCAAT TAACAAGACT AATGACAAGC TGATTGAAAA ATCTGATGGC ACAGCTTTAA AGGGCACTGG CGAAGAGTGT TATAATTTTG ACAAAAGCGG TTATGATAAT ATTTTAAAAG ATGTCAGCGA TAAGCCTAAA GATAATCGTA ATGATGTCGA TAATAATGCT GATACGAATT TTGAGAAAAT CCGTGACGGT AAAGATACCC GGCCGCAGCA GGATATTGAA CGAAATGTAT TTCTTGATGC CAGAATAATC GGACAGGTTT TCTCAACATA TATTCTGCTT CAGAACGAAG ATGACCTGAT AATTATTGAT CAGCATGCGG CCCATGAAAG AATACGTTTT GAAGAGCTCA AAGAAAAGTA TGCGAGAAAT GAGAGTCTCG CGCAGTACCT TTTGACACCT GTGGTTATAG AGCTTACAAA CCAGGAAATT GTTTTTCTTG AAGAAGAAAA AGAATTATTT AATAAATTAG GTTTTATTTT CGAAAGCTTT GGCAATAATT CTATTATACT TCGTTCGGTG CCGATCCCGG ACGAGGGTGT CGGCGTTAAA GAAGCCTTTT TGGAAGTTGT GGATTTTTTA ATGTCAAAGG GCAGGAAATA TGATAAAATT ATTGAGGAAG ATGCATTATA CCAGATAGCA TGCAAGTCGG CGGTAAAAGC AAACAAGAAA CTTGATGAAA TCGAAATAAA AGCCATTTTG GACAAGCTCA ACATGCTTCA AAATCCATAT ACTTGTCCTC ACGGGCGACC GACTGTTGTT AAGATTACAA AATATGAATT TGAAAAAATG TTTAAAAGAA TAGTTTAA
|
Protein sequence | MGRIIILDEN TANQIAAGEV VERPASVVKE LVENSIDAGS TNISVEINNG GISFIKVVDN GSGIEEDDIE IAFERHATSK IRRASDLEAI TSLGFRGEAL ASIASVSTVE VTSRPAHREY GRYVKIQGGT VLESGQVGCP AGTTFIVRDL FYNTPARFKF LKKDSTEAGY VSDIVSRIAL GNPDISFRLI NNKNTVIHTP GNNDLLSTIY SLYGKETAKE CMEISYEDET VKITGYAGSP EIARANRNYQ SIYLNKRYIK NKVISSAIDE AYKTYLMKNK FAFIVLYIEL NPLLVDVNVH PTKMEVRFSR EQEIFRAVYH AVNNALLSKT HIRNVSLKDS PKNYFKFEQS SKKEADYVQQ RLDTDRKFSG YNYDEDFKTE KTNTSKSTWE NLIVKESANI NKRKDETIDE VINKNEVGDK NKVISEDEVI NKDEVINKDK VINKDKVINK GEVVNKVNVN EVENESVDNL INGQISGLVN WAINEPINKT NDKLIEKSDG TALKGTGEEC YNFDKSGYDN ILKDVSDKPK DNRNDVDNNA DTNFEKIRDG KDTRPQQDIE RNVFLDARII GQVFSTYILL QNEDDLIIID QHAAHERIRF EELKEKYARN ESLAQYLLTP VVIELTNQEI VFLEEEKELF NKLGFIFESF GNNSIILRSV PIPDEGVGVK EAFLEVVDFL MSKGRKYDKI IEEDALYQIA CKSAVKANKK LDEIEIKAIL DKLNMLQNPY TCPHGRPTVV KITKYEFEKM FKRIV
|
| |