Gene Ccel_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1040 
Symbol 
ID7309862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1295415 
End bp1296497 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content35% 
IMG OID643607967 
ProductCollagenase and related protease-like protein 
Protein accessionYP_002505382 
Protein GI220928473 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAA AAACAGGACT GTCAATCCCG TGTCAATGGG ATAAAGACAG CCTCATTGAG 
ATTCTCAACT ACGGGGTAAG TAAAGAAATT GACATTAAGG AAGTGTACGG GACAGCGTCC
TTTGAAAATC TGCCGCATGG AAGAGCTTTT GAGGTTACCA AGCGAATCGA TAAAAATGAT
GCACTGGAAA TTAAGAAAAT AATTTCAGAA AAAGGCATTA CATTTGCCTA TCTTATTAAT
GCACCGCTTG AATTGGATTC ATACGAATTT TTAGAAAATG AACTGGATTG GATAGTAAAC
GATTTTAAGG CAGATTCGAT TACAATAAGC TCTTTAAAGC TTATGAAGTT TGTTCGTGCC
AAATATCCCG ATTTGAAAAT TAATGTATCA ACTATTGCCG GGGTTAAGAC TGTTGAAGAT
ATGAAACAGT ATCTTCCAAT CAATCCCAGC AAGTTTATAA CGCATCATGA TATAAACAGA
AACTATAAGG ATTTGGAAGA AATTATAGAG TTTTTAAGGG AAAAGAATAT AGACTTTGAG
GTTATGCTCA ACGAAAGCTG TCTGAGGAGA TGTGCCAGAC GTGATGAGCA TTATAGCACG
CTTGGGAAAG GATGCGGTGA TAGTGAATTC CATTTATGGT GTAACAGCTT AAAGGTATCG
CATCCATATC AGCTTATCAT GTGTAATTTT ATTCGTCCGG AAGACTTAAA AGTATATGAA
GATAAAGGGA TTAAACTATT TAAGGTAACA GGAAGGTCAA AACCATTGGG CTGGCTCCAG
GAAGTGGTAA GAGCTTATTT AAACAGAGAA TACAATGGAA ATCTGATTCG TCTTTTAGGG
GCTGATCCCA AACTGGAAGC GGAACGGTGG ATATATATAT CCAATAAAGC GTTAGATAAT
TTTCTGGAAA ATTATCCTAA AAATGGAGAC GTCGGAGAGG AAATAAGATA TTGCAAAAAT
ATAATTTTTG ACTTATACAG TAAAAATGAA TTTGCAATAA AAAATGATTT TATAAAGCCA
GAGATAAAGG ACAGGCAATT ATCTTTCAAA ATAGAAAAAG ATATTTATGC ATGGAATTAC
TAA
 
Protein sequence
MPEKTGLSIP CQWDKDSLIE ILNYGVSKEI DIKEVYGTAS FENLPHGRAF EVTKRIDKND 
ALEIKKIISE KGITFAYLIN APLELDSYEF LENELDWIVN DFKADSITIS SLKLMKFVRA
KYPDLKINVS TIAGVKTVED MKQYLPINPS KFITHHDINR NYKDLEEIIE FLREKNIDFE
VMLNESCLRR CARRDEHYST LGKGCGDSEF HLWCNSLKVS HPYQLIMCNF IRPEDLKVYE
DKGIKLFKVT GRSKPLGWLQ EVVRAYLNRE YNGNLIRLLG ADPKLEAERW IYISNKALDN
FLENYPKNGD VGEEIRYCKN IIFDLYSKNE FAIKNDFIKP EIKDRQLSFK IEKDIYAWNY