Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0721 |
Symbol | |
ID | 7309577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 829645 |
End bp | 830583 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643607660 |
Product | peptidase M19 renal dipeptidase |
Protein accession | YP_002505080 |
Protein GI | 220928171 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00790748 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTTG TTGATGCACA TTGTGATACC ATTACAACTA TAATGAAAAC AGGCGAAGCC TTGAAAAACA ATAAGGGTCA TATTGATTTG GACAGATTAA AAAAGTATGA AAGCTTTGTT CAGTTCTTTG CGGCATTTAT TGCTCCTGAA CAGGCAAAAA TGGGAGCTTT AAGGCGGACA CTTGATATCA TAGATAAACT TTACAGAGAA ATTGAAATTA ATAAGAACGA TATTATGTTA TGTCGTAATT ACAACGATAT AGTAAATGCA ATAAATAGTA GTAAAGTAGC TGCAGTTTTA ACCATTGAGG GCGGGGAAGC ACTTGAGGGA AGTTTATCTG TATTGCGTAT TCTCTATCAA CTGGGTGTAA GGGCAATAAC TCTTACTTGG AACTTCAGAA ACCAGATTGC TGACGGTGTA GCTGACTCTG TTACAAATGG AGGTCTTACA CCCTTCGGCA GGGAAGTAGT TGCTGAGATG AACAGACTGG GAATGATGGT AGATGTATCC CACATATCGG AAGCGGGATT TTGGGATGTA ATAAATCTTT CGTCGGCACC GATAATAGCT TCGCATTCCA ATGCAAAGAA GATTTGTGCT CACAAGAGAA ACTTAACCGA CGAACAGCTT CTTGCATTGA AAAAAAACGG TGGCGTAACA GGCTTAAACC TTTATTCTGA TTTTATAGAA AATGAGGGTA AGGCTGAAAT GAAGCATGTC ATAGCTCACA TTGAACATAT TATAGGACTT ACTGGAGAGG ATACTCTGGG ACTAGGAGCT GATTTTGACG GTATAGATAA AACGCCTTCA GGACTTGAAG GAGTACAGTG CTTAACCAAT TTAATAAATG AACTGCTTAG ACTTAATTAC AGCGAAACAC TAATAAACAA AATAGCAGGA GAAAATTTTC TTCGAGTTAT AAAAACAGTA GCTAAGTAA
|
Protein sequence | MIFVDAHCDT ITTIMKTGEA LKNNKGHIDL DRLKKYESFV QFFAAFIAPE QAKMGALRRT LDIIDKLYRE IEINKNDIML CRNYNDIVNA INSSKVAAVL TIEGGEALEG SLSVLRILYQ LGVRAITLTW NFRNQIADGV ADSVTNGGLT PFGREVVAEM NRLGMMVDVS HISEAGFWDV INLSSAPIIA SHSNAKKICA HKRNLTDEQL LALKKNGGVT GLNLYSDFIE NEGKAEMKHV IAHIEHIIGL TGEDTLGLGA DFDGIDKTPS GLEGVQCLTN LINELLRLNY SETLINKIAG ENFLRVIKTV AK
|
| |