Gene Ccel_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2447 
Symbol 
ID7312364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2951670 
End bp2955083 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content36% 
IMG OID643609377 
Productpeptidase M16 domain protein 
Protein accessionYP_002506756 
Protein GI220929847 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TTGTTTCGTA CATGATGACA TTTGTTTTAA TTTTAAGCCT CTGTCTTAAT 
GCAGCACCGA GTGTAAATGC TGCTCAAACG GAACTCAAGG CATTGCCGGA GGCAGGTCAG
GTAGTTTCCG GTTTCAAGGT TATGGAAATT GGAAATATGG ATATTATTGA CAGTAAAACT
GTTTTATTTG AGCATGAAAA AACAGGTGCA AAGTTTATTT TTATACAGAA TAAGGATACT
AACAGGACAT TTGACATTTC ATTCAAAACA CCTGCTTTTA ATGATACGGG GGTTAATCAT
ATACTAGAGC ATATAACCGT ATCCGGCTCA CAGAAGTATC CAATGAAAAA TGTATTATTT
ACAATTCTGA ATCAGACATA TTCTACTTTT ATAAATGCAT TTACAGCCCA AAACTTTACT
ACATATCCTG TCTCATCACT GAGTGAGGAT CAGCTCTTGA AGCTAGCAGA GGTTTATCTG
GACTGTGTAT ATCATCCGTC GGTATATAAT GACAAAAATA TCTTTAAAAG AGAAGCCTGG
AGATATGAAA TGACGGACAG CAAGGCTGAT CTTAATATCA GCGGTACAGT ATATAATGAA
ATGAAGGGTG CCTTGGGAAA TATAACAACT GCGGCGGCAT ACAATGATTT AAAAACCCTA
TTTCCTAACA GTACTCAATC TACCATTTCG GGAGGAGATC CTGAGAAGGT AAAAGACCTG
AAATATGAGG ACGTAATAAA GACTCATCAA ACATACTACC ATCCATCCAA TTCACTGATG
GTTCTCTATG GAAATGTGGA TTATGAAAAG TTTCTTAAAA TGATTGATAC TGATTATCTT
TCAAAATATG AAAAGAAAGA CATAAAGATT GAAAAACTAA AGCTGGAGCC ATTTAAGAAA
ACAGTAGAAA AAACCTATAA ATATCCTGTT GCTGCCGGTA CAAATACTAA AAATGCATCT
CAGATCGATT ATTGCTTTGC ACTGGAAAGT ATCTCCAACG AAGAATTACT GGGTGTAGCT
ATTTTGAATG AGTTAATTGG AAGTAATACT TCTGCATTGA AACAGGAATT CAGGGATAAA
AAGCTTGGAG GAGATATAGC AGTTACTTTT AATACAGGAT TATCAATACC TGTTTTGACT
TTCTCCGCAC AAAATACTGA TGAAAGTAAA AAAGCTGATT TTAAAGCACT TGTTGATAAA
TATCTGAGTA ATGTTGTAAA ATCTGGCTTT AAGACGGATG ACGTAGATTC AGTAATTGCC
GGAGAATTAA GGGGATTATC GAGCATTACC GAAACGCCTA ACCTTGGAGT AAATTTGTCT
ACACAAATGG GCAGCTTCTG GGCTAATTTA GGCAGTCCTG ATTTTTATAA CGATATGCTT
AAAAATATAA AGTCTATGGC TGCTAAATCA GGCAAAAAAT ATTTTGAAGG CCTTACCGAG
AGATTTCTTA TAAACAATAA AAACACAGCA CTGGTTACTA CTGTTCCTGA GGCAGGACTT
GCTGAAAAGC AGGCAGCAGA ACAGAAAAAG TATCTTTCTG ATTTAAAGGC ATCAATGAGC
CAACAGCAAA TAGATGCGAT TGTCAAAGAA ACAAAAACCT ACAACGAGTG GAACAGCAGA
GAAGATAATA AAGATGTAGT TAAGAGTATT CAGGCAGTGA AGATTTCTGA TTTGCCGGAG
GAAGTAAAAA ATTATAACGT TAAAGAAGTT AAATCTGATG GAGTAAGATT GATATCAGCG
GAAGCTAATG TTGGTGAAAT CGAATCCACA CGTCTCTATT TGGACACTTC AGCTGTTCCT
GCTGATAAGC TTCATTACTT GAAGCTTTAT ACTGATTTAT TGGGAAACCT TGATACCAAG
TCCCATACAA AGGATGAATT AGGAAATCTT AAGACAAGGT ATATCAGCGG AGTCGCATTT
AATTTATCCG CCTTGACTGA TAAAAATTAT AAAAATTATT CTCCTGTCCT AAGTGCTTCA
TGGACTGGAA TAATGGGGGA CTATGATAAG CAGATTGAGG TTGTAAAAGA CATTCTGCTC
AATACTCAAT TTAATAAAAA CACTGATATT TTAAATATAA TCAAGTCCAG AATATCGGAA
CTTAAAATGC AATTTACAAA CAGTCCTATA AGTATACAAG CAATGAGAAG CAGATCCTAT
TTTAGTGAGG TATATAACTA TCTTAATTAT AGTACAGGAT TAGATTATTA CAATTTCCTT
ACAGAATTGG AAAAAGAAAT TTCCAATAAT CCCCAAGGTG TATTAAAAGA GCTGAATAAT
ATCAAAACAT TGGTTACAAA TAAAAAGAAT TTGATAATAA CATTCGCAGG TAATAAAAAG
AGTATCAGCA GGTTTGAGTC TACAATCAAG AATTTAACCG ACGGGATGTC ATCAAAGGAT
ATTGTGAAGC AGGATTATTC AAAACTTCCA AAGCCTGTAA AAAGAGAGGG TATATCAGTA
GACGGTACTA TACAGTATAA TATGCTTTAT TCAACATATG AAAAGATGGG GACTGTATTT
AGCGGAAAAT ATATTCCAAT AGGTTCAGTT ATAAACGAAA ACTATATCAC TCCTAAGATA
AGATTCGGCT ATGGTGCGTA TGATAATATT GTCAATTTTG GAGAAGAAGG CTTTATGCTT
GCATCATTCC GTGACCCTAA TGTAAAAGAA ACCTTTGAGG TTTATAATGG ACTTCCGGAG
TTCGTTAAAA ATGTTGATCT TACTCAGGAA CAGCTGGATA GTTACATTTT GAAGTCGTTC
AGCGATTACA CCGTGTCTGC GGGCGAATTA TCGGGAGCCG GCACAGCACT TTCCTACTAT
TTGATGGATT TTAAATCAGA GGATATTTTA AAGATATTGA AAGAAATTAA ATCCGTTACG
GTGCAGGATG TTAAGGATAC GGCTTCAATG CTTGAAAATA TGCTTAAAAA CGGAGCATAT
TCAACAGCAG GCAGTAAGGA AAAACTTACT GAAAACAAGG AGCTTTATGA TGGTATAGTT
TCAGTAGGTC AGGAAGAGGA TTCTAAGTCA GATAGTTCAA TTACAAGAGG TGAGTTCTTT
AAATTGGTCT TGGCTGGTGC TCCAGAGCCT CTTGAAATAG CCAAACAACA AGGCCTGATA
ACCGCCGATA AAAAGGGAAA CTACCATGAG AACAGAAAGT TGACAAGAGA AGAGCTTGCA
GTTTTTGTAT ACAAAATAGC AACTCTGAGC GGCGTACAGC TTCCGACTGC AAACCCTGAA
ATCGCAGATA TTAATTCCTC AGCAACATGG TCTAGGAATG CTATCAAAGC TTTGGTGGGA
TTTGATGTAA TCAAGCTGGA TGACAAGGGC AATTTCAATC CGAAAGGTGA AGTAACAGAT
GCCTATGTCA CTGATCTTTT TAATAACTTA AATCAGAAAC TTTCAGGAAA ATAA
 
Protein sequence
MKKVVSYMMT FVLILSLCLN AAPSVNAAQT ELKALPEAGQ VVSGFKVMEI GNMDIIDSKT 
VLFEHEKTGA KFIFIQNKDT NRTFDISFKT PAFNDTGVNH ILEHITVSGS QKYPMKNVLF
TILNQTYSTF INAFTAQNFT TYPVSSLSED QLLKLAEVYL DCVYHPSVYN DKNIFKREAW
RYEMTDSKAD LNISGTVYNE MKGALGNITT AAAYNDLKTL FPNSTQSTIS GGDPEKVKDL
KYEDVIKTHQ TYYHPSNSLM VLYGNVDYEK FLKMIDTDYL SKYEKKDIKI EKLKLEPFKK
TVEKTYKYPV AAGTNTKNAS QIDYCFALES ISNEELLGVA ILNELIGSNT SALKQEFRDK
KLGGDIAVTF NTGLSIPVLT FSAQNTDESK KADFKALVDK YLSNVVKSGF KTDDVDSVIA
GELRGLSSIT ETPNLGVNLS TQMGSFWANL GSPDFYNDML KNIKSMAAKS GKKYFEGLTE
RFLINNKNTA LVTTVPEAGL AEKQAAEQKK YLSDLKASMS QQQIDAIVKE TKTYNEWNSR
EDNKDVVKSI QAVKISDLPE EVKNYNVKEV KSDGVRLISA EANVGEIEST RLYLDTSAVP
ADKLHYLKLY TDLLGNLDTK SHTKDELGNL KTRYISGVAF NLSALTDKNY KNYSPVLSAS
WTGIMGDYDK QIEVVKDILL NTQFNKNTDI LNIIKSRISE LKMQFTNSPI SIQAMRSRSY
FSEVYNYLNY STGLDYYNFL TELEKEISNN PQGVLKELNN IKTLVTNKKN LIITFAGNKK
SISRFESTIK NLTDGMSSKD IVKQDYSKLP KPVKREGISV DGTIQYNMLY STYEKMGTVF
SGKYIPIGSV INENYITPKI RFGYGAYDNI VNFGEEGFML ASFRDPNVKE TFEVYNGLPE
FVKNVDLTQE QLDSYILKSF SDYTVSAGEL SGAGTALSYY LMDFKSEDIL KILKEIKSVT
VQDVKDTASM LENMLKNGAY STAGSKEKLT ENKELYDGIV SVGQEEDSKS DSSITRGEFF
KLVLAGAPEP LEIAKQQGLI TADKKGNYHE NRKLTREELA VFVYKIATLS GVQLPTANPE
IADINSSATW SRNAIKALVG FDVIKLDDKG NFNPKGEVTD AYVTDLFNNL NQKLSGK