Gene Cthe_0776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0776 
Symbol 
ID4810394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp936992 
End bp939259 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content37% 
IMG OID640106193 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001037204 
Protein GI125973294 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.287224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGAA TCATTATACT CGATGAAAAT ACGGCAAATC AGATTGCGGC CGGAGAAGTG 
GTGGAAAGGC CGGCTTCCGT GGTAAAAGAG CTTGTTGAAA ACTCTATTGA TGCCGGAAGT
ACCAATATAT CGGTGGAAAT AAACAATGGC GGAATATCTT TTATAAAGGT GGTTGACAAC
GGAAGCGGAA TCGAAGAGGA CGATATTGAA ATTGCCTTTG AAAGACATGC TACCAGCAAG
ATAAGAAGGG CAAGTGACCT TGAGGCGATT ACCTCCCTTG GATTTAGAGG TGAGGCTCTT
GCCAGTATTG CCTCTGTTTC CACCGTTGAG GTAACTTCAA GACCGGCCCA CAGGGAGTAC
GGAAGGTATG TAAAAATCCA GGGCGGAACT GTTTTAGAGT CCGGCCAGGT AGGATGTCCT
GCCGGTACAA CATTTATAGT AAGAGATCTT TTTTACAACA CTCCTGCCAG ATTTAAATTT
TTGAAAAAGG ATTCCACCGA GGCTGGATAT GTTTCCGATA TAGTAAGCAG AATTGCTCTT
GGAAATCCCG ATATTTCCTT CAGACTAATC AACAACAAAA ACACTGTTAT TCATACCCCG
GGCAATAACG ATCTTTTGAG TACCATATAC AGTTTGTACG GAAAAGAGAC TGCAAAAGAA
TGCATGGAGA TTTCCTATGA GGATGAGACC GTAAAGATAA CCGGATATGC TGGAAGTCCT
GAAATAGCAA GAGCAAACAG GAATTATCAG TCCATATATT TAAACAAAAG ATATATAAAA
AACAAGGTTA TATCTTCGGC GATTGACGAA GCATATAAAA CATATCTTAT GAAAAACAAA
TTTGCTTTTA TTGTGCTGTA TATAGAATTA AATCCGCTGT TGGTGGATGT CAATGTGCAT
CCTACAAAAA TGGAGGTAAG GTTTTCGAGG GAACAGGAGA TATTCAGAGC CGTTTACCAC
GCCGTAAACA ATGCTCTGCT CAGTAAAACC CATATAAGAA ATGTTTCCCT TAAAGACAGT
CCGAAGAATT ATTTCAAATT TGAGCAGTCT TCAAAAAAGG AAGCCGACTA TGTGCAGCAG
AGGCTGGATA CGGACAGGAA GTTTTCAGGA TACAATTATG ATGAAGATTT TAAAACTGAA
AAGACCAATA CGTCTAAAAG CACGTGGGAA AATTTAATTG TTAAAGAGAG CGCCAATATT
AATAAACGAA AAGATGAAAC TATAGATGAA GTTATAAACA AAAATGAAGT TGGAGATAAA
AATAAAGTTA TAAGTGAAGA TGAAGTTATA AATAAAGATG AAGTTATAAA TAAAGACAAA
GTTATAAATA AAGATAAAGT TATAAATAAA GGTGAAGTTG TAAATAAGGT TAATGTAAAC
GAAGTTGAAA ACGAGTCGGT TGATAATTTG ATTAACGGGC AAATCAGTGG CTTGGTTAAT
TGGGCAATTA ATGAGCCAAT TAACAAGACT AATGACAAGC TGATTGAAAA ATCTGATGGC
ACAGCTTTAA AGGGCACTGG CGAAGAGTGT TATAATTTTG ACAAAAGCGG TTATGATAAT
ATTTTAAAAG ATGTCAGCGA TAAGCCTAAA GATAATCGTA ATGATGTCGA TAATAATGCT
GATACGAATT TTGAGAAAAT CCGTGACGGT AAAGATACCC GGCCGCAGCA GGATATTGAA
CGAAATGTAT TTCTTGATGC CAGAATAATC GGACAGGTTT TCTCAACATA TATTCTGCTT
CAGAACGAAG ATGACCTGAT AATTATTGAT CAGCATGCGG CCCATGAAAG AATACGTTTT
GAAGAGCTCA AAGAAAAGTA TGCGAGAAAT GAGAGTCTCG CGCAGTACCT TTTGACACCT
GTGGTTATAG AGCTTACAAA CCAGGAAATT GTTTTTCTTG AAGAAGAAAA AGAATTATTT
AATAAATTAG GTTTTATTTT CGAAAGCTTT GGCAATAATT CTATTATACT TCGTTCGGTG
CCGATCCCGG ACGAGGGTGT CGGCGTTAAA GAAGCCTTTT TGGAAGTTGT GGATTTTTTA
ATGTCAAAGG GCAGGAAATA TGATAAAATT ATTGAGGAAG ATGCATTATA CCAGATAGCA
TGCAAGTCGG CGGTAAAAGC AAACAAGAAA CTTGATGAAA TCGAAATAAA AGCCATTTTG
GACAAGCTCA ACATGCTTCA AAATCCATAT ACTTGTCCTC ACGGGCGACC GACTGTTGTT
AAGATTACAA AATATGAATT TGAAAAAATG TTTAAAAGAA TAGTTTAA
 
Protein sequence
MGRIIILDEN TANQIAAGEV VERPASVVKE LVENSIDAGS TNISVEINNG GISFIKVVDN 
GSGIEEDDIE IAFERHATSK IRRASDLEAI TSLGFRGEAL ASIASVSTVE VTSRPAHREY
GRYVKIQGGT VLESGQVGCP AGTTFIVRDL FYNTPARFKF LKKDSTEAGY VSDIVSRIAL
GNPDISFRLI NNKNTVIHTP GNNDLLSTIY SLYGKETAKE CMEISYEDET VKITGYAGSP
EIARANRNYQ SIYLNKRYIK NKVISSAIDE AYKTYLMKNK FAFIVLYIEL NPLLVDVNVH
PTKMEVRFSR EQEIFRAVYH AVNNALLSKT HIRNVSLKDS PKNYFKFEQS SKKEADYVQQ
RLDTDRKFSG YNYDEDFKTE KTNTSKSTWE NLIVKESANI NKRKDETIDE VINKNEVGDK
NKVISEDEVI NKDEVINKDK VINKDKVINK GEVVNKVNVN EVENESVDNL INGQISGLVN
WAINEPINKT NDKLIEKSDG TALKGTGEEC YNFDKSGYDN ILKDVSDKPK DNRNDVDNNA
DTNFEKIRDG KDTRPQQDIE RNVFLDARII GQVFSTYILL QNEDDLIIID QHAAHERIRF
EELKEKYARN ESLAQYLLTP VVIELTNQEI VFLEEEKELF NKLGFIFESF GNNSIILRSV
PIPDEGVGVK EAFLEVVDFL MSKGRKYDKI IEEDALYQIA CKSAVKANKK LDEIEIKAIL
DKLNMLQNPY TCPHGRPTVV KITKYEFEKM FKRIV