Gene Ccel_0552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0552 
Symbol 
ID7309424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp637082 
End bp638749 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content34% 
IMG OID643607488 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_002504914 
Protein GI220928005 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTT TTTATATCAT TTTAGCAATA ACCGCCGGCT GCATAATTTT AAGTATAATC 
GGAAAAATAT CCCGTACAAA TAAAAATAGA AAGCAAATAC GTGCCCAATG GGGGAAAGCA
CCTGTTACAA AGTATACTGC TGATATATAC AACTCGGTTA GAAATTATTT TGACAACAAT
AAAGACCAGG GAGAAAGTTT TTTCATAGAT GATATTACTT GGAACGATTT AGATATGAAT
AGAATTTTTT CCAGGTTAAA CATAACCTGT ACGGATGTTG GAGAAGAATA TCTGTATAAT
ATACTGAGGG AATTACTCTA TGACCAGAAT GAGCTGACAG AGCGGGACAG GCTGATAGAA
TATTTCAGAA CAAACCCCTC CCAACGTGAA AAGATTCAGT TGATATTATC CGGTTTGGGG
AAACTCCGGT ATTTAAGTAT ATCAGAATAT ATTAACGGTA AAAGAAGCGG GGGTGGCATA
AAGAGTATAT ACTATAAGAT TTTATCATTA ATCTTCATAG CATCAATTTT TGCCACAATA
TTTTACCCCG GAGCAATAGC CATTTTTCTG ATATCGCTGG CCGTAAATGT AGTAGTTTAT
TTCAAGGCCA GAGATCAGAT AGTAGGCCAC TTGCAATCAC TTGGATATAT TGTGAATATG
TTGGGAATTT CACGAAGAAT ATCAAAGCTC AATATAAAAG AACTTAATAC ATATTTAAAT
GAATTGAAAA AATGTACGGC TAAAGTTAAG GGAATAAGCG TGAATGCTTT TTACTTTTTA
TTTTATACAA GTGAAAACTA TTTATTTGAG CTTATTAAGA TATTTCTTCT TGGGGAACCG
ATTGCATTTC ACAGTATTTT TAAAATTGTA AACAAATACC GCCATGAAAT TGATTCCGTA
TATAGGACAA TTGGACTTCT TGATAGTCTT ATATCAGTGG CATCCTACAG GGAAAGCCTT
GATTATTTTA CAACGCCGGT TCTGACAAAA GATAATATCA ACAACAAAAA AATAGAATTT
ACTGATATGT ATCATCCCTT GATAAAAAAT CCTGTTACAA ATTCCTTTTC AGTAACAGGA
GGGGCATTGA TAACCGGGTC AAATGCCAGC GGAAAATCAA CGTTTCTCAA GTCGGTAGCA
ATAAATGCAA TTTTTGCTCA GACTATTTTC ACATGTCTTG CAAAAGATTA TTATTCCAGC
TATTTTAATA TTTACAGTTC TATGGCTCTG AGTGACAATC TTGAAATGAA TGAGAGTTAC
TACATTGTTG AAATAAAATC ACTTAAACGT ATCTTGAAGG GTCTCAACGA CCATGTTCCA
TGTTTCTGTG TCATAGACGA AGTGTTAAGA GGTACAAATA CTATCGAAAG AATTGCAGCA
TCCTCTGAGA TAATGAACTT TATAACAGAC AATAATTGTA TATGTCTTTG TGCTTCTCAC
GATATAGAGC TTACTCAGAT ATTAGCAGAT AAAGTTGAGA ATTACCACTT TCAGGAGTTC
TTTGAGGATG ACAACATAAA GTTTGACTAT AAAATATACC CGGGTAAATC AACTACACGC
AATGCTATAA AATTATTAAA AATACTCGGT TATGATGAAA GCATAGTTGA CAATGCCGAA
CAAAGAGCCT GTCAATTTAA CCAAAACGGC TATTGGTCTA AGGTATGA
 
Protein sequence
MKFFYIILAI TAGCIILSII GKISRTNKNR KQIRAQWGKA PVTKYTADIY NSVRNYFDNN 
KDQGESFFID DITWNDLDMN RIFSRLNITC TDVGEEYLYN ILRELLYDQN ELTERDRLIE
YFRTNPSQRE KIQLILSGLG KLRYLSISEY INGKRSGGGI KSIYYKILSL IFIASIFATI
FYPGAIAIFL ISLAVNVVVY FKARDQIVGH LQSLGYIVNM LGISRRISKL NIKELNTYLN
ELKKCTAKVK GISVNAFYFL FYTSENYLFE LIKIFLLGEP IAFHSIFKIV NKYRHEIDSV
YRTIGLLDSL ISVASYRESL DYFTTPVLTK DNINNKKIEF TDMYHPLIKN PVTNSFSVTG
GALITGSNAS GKSTFLKSVA INAIFAQTIF TCLAKDYYSS YFNIYSSMAL SDNLEMNESY
YIVEIKSLKR ILKGLNDHVP CFCVIDEVLR GTNTIERIAA SSEIMNFITD NNCICLCASH
DIELTQILAD KVENYHFQEF FEDDNIKFDY KIYPGKSTTR NAIKLLKILG YDESIVDNAE
QRACQFNQNG YWSKV