Gene Ccel_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0472 
Symbol 
ID7309349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp540905 
End bp542188 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content34% 
IMG OID643607402 
Productpeptidase M16 domain protein 
Protein accessionYP_002504834 
Protein GI220927925 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.169068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTG ATACCATTGA ATATAAAAAG TACAACGAAT TATTTTACCG ATATGAACAT 
TCCAGTGGTT TAAATTGTAT AGTAATTCCT AAGAAGGGCT ACTACAAAAA GTATGCAACA
TTTTCTACTC AGTACGGTTC TGTAGACAAT GAATTTATCA TACCAGGAGA AAATGAACCG
ATAAGAGTTC CTGATGGAAT TGCCCATTTT CTGGAGCACA AGCTGTTTGA ACAAAAAGAC
GGAAGTGTTA TGGATAAGTT TGCCGCTTTA GGCTCGAAAC CAAATGCATT TACAAGCTTT
AACCAAACAG TGTACCTTTT TTCATGTACA GACTTGTTTA GCGAAAACTT CAAGCTTCTA
TTAAACTTTG TTCAAAATCC GTATATCACC GATGAAAGTG TTGAACGTGA AAAGAAGATA
ATAGGACAGG AAATTAATAT GTACCGTGAC GATCCCGGTT GGAGGGTAAA CTTCAACCTA
TTGAAAGCAA TGTATAAGCA CCATCCTGTA AGATACGATA TAGCAGGTAC TACTGACAGT
ATAAGTGAAA TTACAAAGGA AACTTTGTAT CAGTGCTACG AGACCTTCTA CCATCCATCT
AACATGATAA TAACAGTAGT TGGTGATGTG GATCACATTA AGGTTTTTGA ACAGGTTGAA
AATGGCATAC AGACATCGGA TAAGGCTTCT GAAATTAAAA GAATCTTTCC TAAAGAAAGT
GAAGGGGTTA ACAAAAGATA TTTTGAACAA AATATGCCAG TAGCAACGCC GTTATTTTAT
ATGGGGTTTA AAGACAGCAA TTTTGATTTA GAAGGCGGCG AAATCTTGAG ATATGAGATT
GCTGTAAAGC TTCTGCTTTC AATGATTATG GGGAAAAGTT CAAAGCTGTA TGAGAAGTTG
TACGATAAGG GACTTATTAA TGCCAGCTTT GAAATGGATT TTTCCTTAGA AAAGAGTTAT
GCTTATTCAA TGTTTGGAGG AGAATCTGTC AATCCTGAGG AGGTTCAGGA AATGATTACA
AATGAGATTA AGATACTAAA AAAGCAAGGC CTTGACGAAG AGGCTTTTAA CAGACTTCTT
AAAGCCTCTA AAGGTAGGTT TCTGAGACAG CTTAATTCCC TTGAAAATAT ATCCAGATCA
TTTATAAATT TATATTTCAA GGGTGTTACA ATGTTTGATT ATTTAGATGT TTATGATAAA
ATGAAATTTG ATTATATTAC AGATGTGTTT GACAGTCACT TTGACATTAA ACACATGGCA
TTATCTGTTG TTAAGCAGAA ATAA
 
Protein sequence
MKFDTIEYKK YNELFYRYEH SSGLNCIVIP KKGYYKKYAT FSTQYGSVDN EFIIPGENEP 
IRVPDGIAHF LEHKLFEQKD GSVMDKFAAL GSKPNAFTSF NQTVYLFSCT DLFSENFKLL
LNFVQNPYIT DESVEREKKI IGQEINMYRD DPGWRVNFNL LKAMYKHHPV RYDIAGTTDS
ISEITKETLY QCYETFYHPS NMIITVVGDV DHIKVFEQVE NGIQTSDKAS EIKRIFPKES
EGVNKRYFEQ NMPVATPLFY MGFKDSNFDL EGGEILRYEI AVKLLLSMIM GKSSKLYEKL
YDKGLINASF EMDFSLEKSY AYSMFGGESV NPEEVQEMIT NEIKILKKQG LDEEAFNRLL
KASKGRFLRQ LNSLENISRS FINLYFKGVT MFDYLDVYDK MKFDYITDVF DSHFDIKHMA
LSVVKQK