Gene Ccel_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1696 
Symbol 
ID7312269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2040083 
End bp2042080 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content37% 
IMG OID643608624 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002506027 
Protein GI220929118 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.930861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGCA TAATTGTACT GGATGAAAAT ACTTCAAATA AAATAGCTGC TGGAGAGGTT 
GTTGAAAAGC CAGCTTCTGT TGTAAAGGAA TTGGTGGAAA ACTCTATTGA TGCAGGTGCA
ACCAGCATCT CAGTTGATAT AAAGAATGGC GGTATATCTT ATATTAAAAT AGCCGACAAC
GGAATCGGCA TGGATGAGGA CGATGTGGAA ATTGCCTTTG AACGTCATGC CACCAGTAAA
ATTAAAAGGG CGGAGGATCT TGATTCCGTT ATAACAATGG GGTTCAGGGG GGAGGCTCTG
GCAAGTATAG CCTCGGTTGC ATCTGTTGAG CTTATGACAA AGACAGCTGC AAGTGCCTAC
GGAATGTATG TACATGTAAG AGGAGGAGTT TTACAAGACG TAAGGCAGAC AGGATGTCCT
GTCGGCACAA CGTTTATTAT TAAGGATTTA TTTTTCAATA CTCCTGCTCG TTACAAGTTT
TTGAAAAAGG ATTCTACCGA AGCAGGATAT ATTTCTGATA CAATATCGAG AATAGCTTTG
GGTAATCCGA ATATTTCTTT TAAACTGACA AACGGAAAAA CACCATTAAT TCATACCCCG
GGAAATAATG ACTTGAAAAG TGTTATTTAC AGTATTTACG GAAAAGAAAT TATAAAAAAC
CTTGTTCATA TAGAGTACGC TGACGACAAG GTAAAGATAA GCGGATATAT AGGGAAACCG
GAAGCTGCCA GATCAAACAG GAACTATCAA TCTCTCTATA TAAATAAAAG ATATGTGAAA
AGCAAACTGG TATCATATTC AGTTGAACAA GCCTTTTCAA GCATACTTAT GAAAAACAGG
TTTCCTTTTT TTGTATTAAA TATTGACATT AATCCTATAT TGGTAGATGC CAATGTACAC
CCTGCAAAAA TCGAGGTACG GTTTGCTGAC GAAAGCTATT TATCCAGAAC TATATATATG
GCTGTTTCCA ATGCTCTTAC TACAGGGGGA GGCCTGTTTA ATCCTGTATC AGTTCCTGAT
AAAGACAGAG AGCTGTTCAA GTTTGCAGAT AATTCCCAAC CTAAAAAGGA ATATGTCCAG
AATGAAATAC AATTAAATAA TAAGCAGGAG GAAAACAAAA AAGCCGATGA GATACGTTTG
TTTACAAAAG CTCTGGAGCC ATTGGCAAAG GTCGATGTAC ACAAAGTAAG TACAGCAGCG
GAAAAACAAC CGGCGGATAC TTCCTCCTTT ACTTTTACAA GGTCTGAAGA CTATAATGTC
GGACAACCAA AGAATCTAAT CACGAATGTT AAGCAGGAAA ATTCTGATGA GCTTAAAAAT
AATTCTCCCG GAATCAGGGA GGATGATTCC TCTCAGAACT TTGATGAAAC AATAAATAAA
CAAGATCAGG AAGTAAATAA AGAAAGGGTT TATACTGAAC TAGCTGACAT GAAATACATA
GGGCAGGCTT TTTCCACTTA TATTCTTTTA CAAAATAATG ATGAGCTTGT AATGGTAGAT
CAGCACGCAG CACACGAAAG AATAATATAT GAAAAACTCA GAGCAAAATT TGATTCACAG
GAAAACACAA CTCAGCTGTT ATTGGAGCCG GTAGTTATTC AACTCCAGCC TTTTGAAATT
GATACAATAA AAGCAAAGGA AAAGTTGCTG ACTGGTATTG GATTTGTTTA TGAGGATTTT
GGAAATAATA CCATTATTAT CAGAGGAATT CCATATATGG TAGGAGACTA CTCGCCCAGA
GATATTTTTA TTGAATTGAC ACAAAAACTT CAAGAATCAA TAAAACCTGT CAGCACACCT
TTAGCTGATG AAATAATTCA TACCATTGCA TGTAAGGCTG CTATAAAAGC AAATAAAAAA
CTTGATGAAA AAGAGGTTCA TCAGCTTTTG ACTGAGCTTT CCAATACCGG AAGACGATAT
ACCTGTCCTC ATGGACGTCC TACTGTTATA CGTCTGACAA AAAACGAGAT AGAAAAAATG
TTTAAAAGAA TTGTCTAG
 
Protein sequence
MGRIIVLDEN TSNKIAAGEV VEKPASVVKE LVENSIDAGA TSISVDIKNG GISYIKIADN 
GIGMDEDDVE IAFERHATSK IKRAEDLDSV ITMGFRGEAL ASIASVASVE LMTKTAASAY
GMYVHVRGGV LQDVRQTGCP VGTTFIIKDL FFNTPARYKF LKKDSTEAGY ISDTISRIAL
GNPNISFKLT NGKTPLIHTP GNNDLKSVIY SIYGKEIIKN LVHIEYADDK VKISGYIGKP
EAARSNRNYQ SLYINKRYVK SKLVSYSVEQ AFSSILMKNR FPFFVLNIDI NPILVDANVH
PAKIEVRFAD ESYLSRTIYM AVSNALTTGG GLFNPVSVPD KDRELFKFAD NSQPKKEYVQ
NEIQLNNKQE ENKKADEIRL FTKALEPLAK VDVHKVSTAA EKQPADTSSF TFTRSEDYNV
GQPKNLITNV KQENSDELKN NSPGIREDDS SQNFDETINK QDQEVNKERV YTELADMKYI
GQAFSTYILL QNNDELVMVD QHAAHERIIY EKLRAKFDSQ ENTTQLLLEP VVIQLQPFEI
DTIKAKEKLL TGIGFVYEDF GNNTIIIRGI PYMVGDYSPR DIFIELTQKL QESIKPVSTP
LADEIIHTIA CKAAIKANKK LDEKEVHQLL TELSNTGRRY TCPHGRPTVI RLTKNEIEKM
FKRIV