Gene Ccel_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1761 
Symbol 
ID7310494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2108628 
End bp2109674 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content39% 
IMG OID643608690 
Productpeptidase M42 family protein 
Protein accessionYP_002506092 
Protein GI220929183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTGA TAAAAGAACT AACAGATTTA AACGGTGTAT CAGGAAATGA AAATGAAGTA 
AGAGAATATA TTAAAAGTAA AATTAATGGA CTGTGTGATT CTATAGAAGT AGATTCAATA
GGAAATATTA TTGCGTATAA AAAAGGCAGC AGCGGCAAGT ATAAAGTCAT GCTTTCAGCC
CATATGGATG AAGTTGGATT CATGGTTTCA GGATATATGG AAAAAGGATT TTTGAAATTC
AAACCTGTCG GGGGTATTGA CAGCAGAATT TTACCCGGTA AAAGGGTTGT AATAGGTAAA
AAAAGGCTCA AAGGCGTAAT AGGTGCAAAA CCGCTACATC AGCAAAGTTC TGAAGAACGG
GAAAGGATAG CGAAAATCAA GGATTTATAT ATAGATATTG GGGCGGAAAC AAAGGAAGAG
GCCGAAAAAA TGGCTCCTTT AGGTGAGTTC ATTGCCTTTG ACAGCGAGTA TGTAGAATTG
GGAAAAGACT GTATAAAGGC AAAAGCACTT GACGACAGAA TTGGCTGTGC GGTACTTATG
GAGGTTTTAA AGTATAATTT CGAGTTTGAT TTGTATGCCT GCTTTACAGT ACAGGAAGAG
GTTGGGCTTA GAGGTGCACA GGTGGCTGCA TTTAAAATTA TGCCGGATAT AGCACTTGTT
CTGGAAGGGA CTACTTGTGC GGACGTTCCA GAGGTAAAAC CCTTTGATTT TTCAACAGTA
CTCGGTAATG GTGCAGCACT TACATTGGTA GACAGAACCT GTTACAGCGA CAGAAAGCTT
GTACAGTTTT TATATGATAC AGCAGTTAAA AACGGCATTA AGGTTCAATA CAAGCAGACC
ACCACAGGTG GAAATGATGC GGGACAAATA CAAAGAACCG GTACGGGAGT TAAAACTGCC
TCAATATCGG TGCCCTGCAG GTATATACAT TCTCCTGTAT CTGTTATGAG CATGAGTGAT
TTTGAATGTG TAGAAAGGCT TACACTTGCA GCATTAAATG AAATGAACAA AGATAAGGAT
TTTATAAAAA ACATCGCAGC AGTATAG
 
Protein sequence
MMLIKELTDL NGVSGNENEV REYIKSKING LCDSIEVDSI GNIIAYKKGS SGKYKVMLSA 
HMDEVGFMVS GYMEKGFLKF KPVGGIDSRI LPGKRVVIGK KRLKGVIGAK PLHQQSSEER
ERIAKIKDLY IDIGAETKEE AEKMAPLGEF IAFDSEYVEL GKDCIKAKAL DDRIGCAVLM
EVLKYNFEFD LYACFTVQEE VGLRGAQVAA FKIMPDIALV LEGTTCADVP EVKPFDFSTV
LGNGAALTLV DRTCYSDRKL VQFLYDTAVK NGIKVQYKQT TTGGNDAGQI QRTGTGVKTA
SISVPCRYIH SPVSVMSMSD FECVERLTLA ALNEMNKDKD FIKNIAAV