Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1111 |
Symbol | |
ID | 7309924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1368575 |
End bp | 1369855 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643608034 |
Product | glycoside hydrolase family 18 |
Protein accession | YP_002505449 |
Protein GI | 220928540 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAATTC ATGTTGTAAA GTCAGGTGAA AGCATATATT CAATTGCACA GCAATACAGG GTTTCACCGC AGAAAATAAT ATCCGATAAT GAACTGAACA ATCCGGATCA GCTGGTAGTA GGACAAACTT TAGTAATACT GGAGGGAACC CGACGACATG TTGTAGCCCC CGGTGAATCA GTATATTCCA TAGCAAGGTC ATATAGAATA AGTGTAAATG AACTTTTGGC AGCCAACCCA CAGATATCTG ATCCCTCCAG AATACAGCCG GGAATGGTTA TTACCATACC ACCTGTAACC TATAATTACG GACCTATGGA AGTAAATGGA TATGCATTTC CCAATATTGA CATGGAAGTG TTACGGAAAA CGTTGCCTAA TCTGACTTAC CTCAGTATTT TCAGCTATCA GGTAAGTCCT GACGGCAACC TACAGTCAAT ACCTGATGAG CCACTTATTC AAGCTGCAAG AGCCGCAAGG GTAGCTCCGC TTATGGTTAT AACCAATATA AAAGAAGGTG GGGGCTTTGA CAGCGATATA GCACATTCAA TACTAACAAA CGAGACAGCA CAGACAAACC TTCTTAATAA TGTTACCAGA ATTTTAAGAC AAAAAAATTA TTTTGGACTG GACATTGACT TTGAGTATAT ATACCCATAT GACAGGGAAA GCTATAATAA CTTCCTGAGA AGAGTTGTAA GGACTCTCAG GCCACTGGGT TACACCATTA CTACAGCACT TGCTCCGAAA ACATCGGCTA CTCAGAAAGG AAAACTTTAT GAAGCACATG ATTATCCTGT TCATGGAGCA TTGGTTGATC ACGTTATACT TATGACTTAT GAGTGGGGAT TCACATACAG TGCTCCTATG GCAGTATCTC CTATAACCGG TGTAAGAAGT GTTCTTGACT ATGCTGTAAC AGCAATCCCA AGACGTAAAA TATTCATGGG AATATCGAAT TATGGTTATG ACTGGACTCT TCCGTATACT CCCGGAACTG CTGCCAGAAC GGTTACCAAT ACAGGTGCAG TGGACCTTGC CAGAAGAAGA GGGGCTGAAA TCCAATATGA TGTAATATCT CAGGCACCTT TCTTCTATTA CTATGCTGAT GACAGAAAAC AGCATGTAGT CTGGTTTGAG GATGCCAGAA GTATTTTTGC CAGACTGACT CTGGCACATG AATACAGGCT CGGCGGAGTA AGTTACTGGA CAATTAACAG TTACTTCCCA CAGAATTGGT TGGTTTTAAG CTCTATATTC AATATAAGAA AGGTGCTTTA G
|
Protein sequence | MRIHVVKSGE SIYSIAQQYR VSPQKIISDN ELNNPDQLVV GQTLVILEGT RRHVVAPGES VYSIARSYRI SVNELLAANP QISDPSRIQP GMVITIPPVT YNYGPMEVNG YAFPNIDMEV LRKTLPNLTY LSIFSYQVSP DGNLQSIPDE PLIQAARAAR VAPLMVITNI KEGGGFDSDI AHSILTNETA QTNLLNNVTR ILRQKNYFGL DIDFEYIYPY DRESYNNFLR RVVRTLRPLG YTITTALAPK TSATQKGKLY EAHDYPVHGA LVDHVILMTY EWGFTYSAPM AVSPITGVRS VLDYAVTAIP RRKIFMGISN YGYDWTLPYT PGTAARTVTN TGAVDLARRR GAEIQYDVIS QAPFFYYYAD DRKQHVVWFE DARSIFARLT LAHEYRLGGV SYWTINSYFP QNWLVLSSIF NIRKVL
|
| |