Gene Ccel_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0721 
Symbol 
ID7309577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp829645 
End bp830583 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content37% 
IMG OID643607660 
Productpeptidase M19 renal dipeptidase 
Protein accessionYP_002505080 
Protein GI220928171 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00790748 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTG TTGATGCACA TTGTGATACC ATTACAACTA TAATGAAAAC AGGCGAAGCC 
TTGAAAAACA ATAAGGGTCA TATTGATTTG GACAGATTAA AAAAGTATGA AAGCTTTGTT
CAGTTCTTTG CGGCATTTAT TGCTCCTGAA CAGGCAAAAA TGGGAGCTTT AAGGCGGACA
CTTGATATCA TAGATAAACT TTACAGAGAA ATTGAAATTA ATAAGAACGA TATTATGTTA
TGTCGTAATT ACAACGATAT AGTAAATGCA ATAAATAGTA GTAAAGTAGC TGCAGTTTTA
ACCATTGAGG GCGGGGAAGC ACTTGAGGGA AGTTTATCTG TATTGCGTAT TCTCTATCAA
CTGGGTGTAA GGGCAATAAC TCTTACTTGG AACTTCAGAA ACCAGATTGC TGACGGTGTA
GCTGACTCTG TTACAAATGG AGGTCTTACA CCCTTCGGCA GGGAAGTAGT TGCTGAGATG
AACAGACTGG GAATGATGGT AGATGTATCC CACATATCGG AAGCGGGATT TTGGGATGTA
ATAAATCTTT CGTCGGCACC GATAATAGCT TCGCATTCCA ATGCAAAGAA GATTTGTGCT
CACAAGAGAA ACTTAACCGA CGAACAGCTT CTTGCATTGA AAAAAAACGG TGGCGTAACA
GGCTTAAACC TTTATTCTGA TTTTATAGAA AATGAGGGTA AGGCTGAAAT GAAGCATGTC
ATAGCTCACA TTGAACATAT TATAGGACTT ACTGGAGAGG ATACTCTGGG ACTAGGAGCT
GATTTTGACG GTATAGATAA AACGCCTTCA GGACTTGAAG GAGTACAGTG CTTAACCAAT
TTAATAAATG AACTGCTTAG ACTTAATTAC AGCGAAACAC TAATAAACAA AATAGCAGGA
GAAAATTTTC TTCGAGTTAT AAAAACAGTA GCTAAGTAA
 
Protein sequence
MIFVDAHCDT ITTIMKTGEA LKNNKGHIDL DRLKKYESFV QFFAAFIAPE QAKMGALRRT 
LDIIDKLYRE IEINKNDIML CRNYNDIVNA INSSKVAAVL TIEGGEALEG SLSVLRILYQ
LGVRAITLTW NFRNQIADGV ADSVTNGGLT PFGREVVAEM NRLGMMVDVS HISEAGFWDV
INLSSAPIIA SHSNAKKICA HKRNLTDEQL LALKKNGGVT GLNLYSDFIE NEGKAEMKHV
IAHIEHIIGL TGEDTLGLGA DFDGIDKTPS GLEGVQCLTN LINELLRLNY SETLINKIAG
ENFLRVIKTV AK