Gene Ccel_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1923 
Symbol 
ID7310641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2273419 
End bp2274504 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content37% 
IMG OID643608857 
Productpeptidase M24 
Protein accessionYP_002506251 
Protein GI220929342 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.412191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAAA AAGAAATATT ATCAAATAGA TTGAATACAT TCAGAGAAGA ATTGAAAAAA 
TTGGGTCAGG ATGGTGCTTT GATTACAAAA AGAGAAAACT ATATGTATCT TTCCGGTTTC
TCAGGTACGT CAGCAAACCT AGTAATTACT AGTAAAAAAG CATATCTTCT GACTGATTTC
AGATATGTTG AACAATCAGC GGTACAGGCT CCTCTGTTTG AAATAGTTGA GCATAAGCCT
GATATAAAAG ATACTATCCT AGAAATATTA GACTCCGAAG GTATTAAAAA TCTGGGATTT
GAAGACAAAA GCCTGACTTA TTCCGAATAC AAAAGCTTTA GCTGCAAATT CCGGGATATT
GAAATGGAAG GAATTGGCTC TGTTGTTGAA AGTCTGAGAA GTATAAAGGA TCAGTATGAA
ATAGAAACAA TAACCAAGGC GGTTGAGATT GCAGACGGTG CATTTACACA TGTGCTTGGC
ATTATAAAAC CTGGTATAAC GGAGTTGGAT GTTGCTGCAG AATTGGAATA TAAAATGAAG
AAATTAGGGG CATCAGGAGC TTCCTTTGAA ACAATTGTTG CATCGGGACT GAGATCATCC
ATGCCTCACG GAGTTGCTTC TGAAAAGAAG TTGGAGATTG GTGACACAAT AACAATGGAT
TTTGGTGCAT TATATAACCA TTACTGCTCC GATATAACAA GAACGGTTTT TCTTGGACAG
CCGGATAAAA AAATGGTAGA TATTTACAAT ATAGTTTTAG AGGCACAGTT ATCTTCAGTG
AGAGGTGCTA TACAAGGCAA AACGGGAAGA GAAGTCGACA AAATAGGTAG GGATATAATT
TATGGCAAGG GATTTGAGGG TAAATTCGGA CATGGACTCG GCCACGGTTT AGGCCTTGAG
ATACATGAAA ATCCACGACT TTCCCCAAGC GGAGATAAAA TATTGAAAAA TAACATGGCA
GTTACCGTAG AACCGGGTAT TTATGTTGAG GGTCTTGGAG GAGTAAGAAT TGAAGATACC
ATAATAATCA GAGATGACAA CCCTCTTGTT TTGACTCGTT CCCAAAAGGA TTTAATTATA
TTATAA
 
Protein sequence
MVEKEILSNR LNTFREELKK LGQDGALITK RENYMYLSGF SGTSANLVIT SKKAYLLTDF 
RYVEQSAVQA PLFEIVEHKP DIKDTILEIL DSEGIKNLGF EDKSLTYSEY KSFSCKFRDI
EMEGIGSVVE SLRSIKDQYE IETITKAVEI ADGAFTHVLG IIKPGITELD VAAELEYKMK
KLGASGASFE TIVASGLRSS MPHGVASEKK LEIGDTITMD FGALYNHYCS DITRTVFLGQ
PDKKMVDIYN IVLEAQLSSV RGAIQGKTGR EVDKIGRDII YGKGFEGKFG HGLGHGLGLE
IHENPRLSPS GDKILKNNMA VTVEPGIYVE GLGGVRIEDT IIIRDDNPLV LTRSQKDLII
L