Gene Ccel_1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1470 
Symbol 
ID7310239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1782650 
End bp1785028 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content42% 
IMG OID643608396 
ProductMutS2 family protein 
Protein accessionYP_002505804 
Protein GI220928895 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.471316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA AAACTCACAG AGTACTTGAA TTTGATAAAA TTCTCGATAA ATTAAAAGGC 
TTAACCGCAT CTGAATTGGG GAGGGAGCTT GTTTTAGAGC TAACTCCTCA AACTGATTAC
AGAGTGGTTG AAAAAATGCT GTCAGAAACA AATGACGGTG TAAGTTGTAT AATGAGAAGG
GGTTCACCAC CTCTTGGAGG AATTACAGAT ATCAGGATGA GCCTAAAAAG GCTTGATATG
GGAGGCGTAC TGAATCCCGG AGAATTACTG CGGCTGGCAG GAGTTTTAAG AGCCGCCAGA
AGGTTAAAGG GCTATATCAA TGATAAACTG GACGAAAATA ATGCCAGTGT GGTTAAGGAA
CTTATATCCT GTCTTGAATC AAATCAGAGG TTGGAACAGA AAATAGATAA CTGCATACTG
AGCGAAGATG AGATTGCTGA TAATGCAAGT CCTGCACTAA GCAGCATCAG ACGTCAGATT
AAGGAGCAGC AGGCGTCTAT TAAGGATAAG CTGAATTCTA TTATTCGTTC CACAAAATAT
CAGAAATATA TTCAGGAATC AGTTGTAACC ATGAGGGGTG ACAGGTATGT AATCCCGGTA
AAGCAGGAGC ATAAGGGAGA TATACCCGGA TTGGTTCATG ATTCTTCTGC CAGCGGAGCC
ACCTTGTTTA TTGAGCCAAT GGCAGTTGTT GAAGCAAACA ACAGCATAAA ACAGCTCAGA
GTGAAGGAAC AAACAGAGAT AGACAGAATT CTTGCCGAGC TTTCTCAGGA TGCTTCTCTA
GTATTACCCC AATTGAATGC TAACATGAGT ATAATGGCAA GACTTGATTT TATTTTTGCA
AAGTCAAAAC TGGCAATAGA CTATAACTGT ATATGTCCTA AAATTAATGA TACAGGCAAA
ATAATTATTA AAAAGGGGAG GCATCCTCTT CTTGACCCTA AAATTGTAGT TCCTATTGAT
TTCTGGATTG GTGAAAAATT CAGTTCATTG ATTGTTACAG GACCCAATAC AGGGGGTAAG
ACCGTTTCAC TTAAAACAGT AGGACTATTT ACTCTTATGA TGCAGTCAGG GCTTCTGGTT
CCCGCAAATG ACGGAACAGA GATGAGCGTA TTTGAAAAAA TCTATGCAGA CATTGGCGAC
GAGCAGAGTA TTGAACAAAG TCTTAGTACG TTTTCTTCTC ATATGAAGAA TATTGTGGAC
ATTCTAAGTG GTGTCAACAA TAAGTCACTG ATACTTCTGG ATGAATTGGG AGCGGGGACA
GACCCTACCG AAGGGGCGGC TCTTGCTATG TCCATTCTTG AATGTCTGCA CCAGATGGGA
GCTACTACCC TTGCTACAAC CCACTATAGT GAACTCAAGG TTTATGCTAT TTCAACAACG
GGAGTTGAAA ATGCCTCCTG TGAATTTGAT GTTGAAACCC TTAGGCCTAC CTACCGACTG
CTTATAGGCG TACCGGGAAA GAGTAATGCA TTTGCCATTT CTAAAAGACT TGGCCTGACA
GATGACATAA TAGAAAGGTC TAAGGAATTC TTGTCACAAG AGGACATCAG GTTTGAGGAC
ATATTATTAA GTATTGAAAA GAACCGTAGT GAGGCCGAAA AAGAGAAAAT GCGTGCTGAA
AGCTATCGTC AGGAAGCGGA GAGGCTTAAA AAGGACCTGG AAGATCAGAA GCGAAGATTG
GCCGCTCAGA AGGAAAGCGA GCTTCGCAAG GCCCGTGAGG AGGCACGCCG TATTCTTACC
GATTCAAAGC GTCAGGCCGA TGAACTTGTT TCTGAAATGA AAAGACTGGC AAAGGAACAG
GAGGAAGCCG AAGTTCGAAG GCAAACAGAA GAGCTCCGTC AAAAACTTAA TAAGAGTATA
AATAATCTGG ATGATTCGCT GGTTGAATCA ATTATGCCCA GACAAGGACT GGTTAAACCA
CCCAAAAACC TCAAGCCGGG TGATACTGTA TTGATAGTAA ACCTCAACCA GAAGGGAACG
GTTTTAACAC TCCCGGACAA GAATGGTGAA GCACAGGTTC AGGCAGGAAT TATGAAAATA
AATGTGCATA TTTCAAACCT CAAACTGGTA GATGAGCAGA AACAGCAGAT TCAAAGAACC
GGAATGGGAA AAATAGGTGT TTCTAAAGCA CAGAATATGT CAACTGAAAT TGATCTCCGT
GGAATGATGC TCAGCGAAGC TGTTGACGTT GTCGACAAGT ATCTTGACGA TGCAAGTATA
GCAGGTATGG GGGGAGTAAC ACTTATTCAC GGAAAAGGAA CAGGAGCACT GCGGGCTGGG
TTGCATCAGC ATTTGAAGCA TAATCCTCAT ATAAAAAGCT TCAGACTTGG TAAGCTGGGT
GAGGGCGAAA ACGGGGTTAC AGTTGTTGAG CTTAAATAG
 
Protein sequence
MNEKTHRVLE FDKILDKLKG LTASELGREL VLELTPQTDY RVVEKMLSET NDGVSCIMRR 
GSPPLGGITD IRMSLKRLDM GGVLNPGELL RLAGVLRAAR RLKGYINDKL DENNASVVKE
LISCLESNQR LEQKIDNCIL SEDEIADNAS PALSSIRRQI KEQQASIKDK LNSIIRSTKY
QKYIQESVVT MRGDRYVIPV KQEHKGDIPG LVHDSSASGA TLFIEPMAVV EANNSIKQLR
VKEQTEIDRI LAELSQDASL VLPQLNANMS IMARLDFIFA KSKLAIDYNC ICPKINDTGK
IIIKKGRHPL LDPKIVVPID FWIGEKFSSL IVTGPNTGGK TVSLKTVGLF TLMMQSGLLV
PANDGTEMSV FEKIYADIGD EQSIEQSLST FSSHMKNIVD ILSGVNNKSL ILLDELGAGT
DPTEGAALAM SILECLHQMG ATTLATTHYS ELKVYAISTT GVENASCEFD VETLRPTYRL
LIGVPGKSNA FAISKRLGLT DDIIERSKEF LSQEDIRFED ILLSIEKNRS EAEKEKMRAE
SYRQEAERLK KDLEDQKRRL AAQKESELRK AREEARRILT DSKRQADELV SEMKRLAKEQ
EEAEVRRQTE ELRQKLNKSI NNLDDSLVES IMPRQGLVKP PKNLKPGDTV LIVNLNQKGT
VLTLPDKNGE AQVQAGIMKI NVHISNLKLV DEQKQQIQRT GMGKIGVSKA QNMSTEIDLR
GMMLSEAVDV VDKYLDDASI AGMGGVTLIH GKGTGALRAG LHQHLKHNPH IKSFRLGKLG
EGENGVTVVE LK