Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1719 |
Symbol | |
ID | 7310459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2067082 |
End bp | 2068206 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643608647 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_002506050 |
Protein GI | 220929141 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000748788 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GGTATTTAAT ATTTTTAGCA GCAATAATGT TACTTTTAAG CTTTAATTTT ATTGTTTCTG GTGCTACTCA GTTAAAGGTA TCTATAAACG GTTTCCAAAC CGACATAGGA ACGGCAACGG TAAACAATAA GGTTTACGTA GATGCAGAAG CATTTGCAAA TCAATTAGGT TTAAGTGCAA GTAAAACAAG CAATTCTATC AACATATCAA CTAAAAATGA TGATATCATT CCAAACATAA TTAAATCAAT AAGTCCATCA GTAGTTGGTA TTATTGGTAA TATTGACTCT GGGAATTCCA CTGCCGATGG AGTTGTGCTG GGAACAGGCG TAATAATCAA ATCTGGCGGA GATATCCTTA CAAACGCTCA TGTCGTTGAG AATATGAGCA GAATTATCGT CGTGTTAAAC GATGGCACAG GATATGAGGC CAGAATTAAG TACATTGATA AGCCAAGCGA CCTTGCTGTA ATAAAGATTG ACAGGATTGG ACTTACAGCG GCCACACTGG GCAAAATGCA GGATATAGTA ATAGGAAAGA CTGCTATAGC TATTGGAACT CCTATGTCGT TCCAAAACAG GAATTCTGCT TCAGTTGGTG TAATCAGCGG ACTTAACAGA AGTGTTGACG GCTTTTATCA ATACAAGCTT ATTCAAACTG ATGCTGCCAT AAACCCCGGT AACAGCGGCG GGCCACTTCT TACCACAAAG GGAGAGGTAA TTGGTATTAA CTCAATGACA ACCGTTAATG CACAAGGGTT AAGTTATGCC ATACCCATAG ATACTGTTCA GTATGTATTA AACCACTTTT ATAAATATGG AAAAGTAAAA AGAGTTACAT TGGGGGCTGA TTTTGAAGAG GACTATGTCG CCCTTTACGG CTTACCGAGT AAAAATGGTT TAAAAATAAC AAGCATTAAA AAAGGCTCTT GTTCCGAAAA ATATGGATTA AAAAAAGATG ATTTTATTTA CAGTATAAAC GGTGTATATG TAAATACACT TGTGGATCTT AATGAAGCAT ATAAATCCGT ATTACCCGGA AATAAAGTCA AAGTTGGAGT AAGAAGAAAT GGTAAGACAC AGAGCATCAA TGTGGTTATG GACGAATTAA AATAG
|
Protein sequence | MKKRYLIFLA AIMLLLSFNF IVSGATQLKV SINGFQTDIG TATVNNKVYV DAEAFANQLG LSASKTSNSI NISTKNDDII PNIIKSISPS VVGIIGNIDS GNSTADGVVL GTGVIIKSGG DILTNAHVVE NMSRIIVVLN DGTGYEARIK YIDKPSDLAV IKIDRIGLTA ATLGKMQDIV IGKTAIAIGT PMSFQNRNSA SVGVISGLNR SVDGFYQYKL IQTDAAINPG NSGGPLLTTK GEVIGINSMT TVNAQGLSYA IPIDTVQYVL NHFYKYGKVK RVTLGADFEE DYVALYGLPS KNGLKITSIK KGSCSEKYGL KKDDFIYSIN GVYVNTLVDL NEAYKSVLPG NKVKVGVRRN GKTQSINVVM DELK
|
| |