Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0739 |
Symbol | |
ID | 7312141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 860434 |
End bp | 862464 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643607677 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002505097 |
Protein GI | 220928188 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000588173 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAAAGA AAATCTTCAT GTTACTGCTT GCTCTTTTGC AAGTAGCACT TTTGACTATA CCTCAACAAC TTGTTACGGC AGCATCGGCA CGCCAGATGG AGTATCTTGA CCGGGGAATC GTTGCGGTTA AAGTAAGCAA TGGAGTATTC GTAAGCTGGA GAATGCTAGG AACAGATTCA ACCAGTATCG CCTTCAATCT CTACCGTGAC GGGAAAAAGG TTAATACCAC TCCAATTACC TCTGGTACAA ATTATGTCGA TACAACAGGA ACAGCCAGCT CAACATACTA TGTTTGTCCT GTAAAGGATG GTGTTGAGCT GGCAAAATCC GAATCGACAG CAGTATGGGG GCAAAATTAT TTATCCCTGC CTCTTCAAAT ACCCGCAGGA GGAACAACAA AAAGCGGTGA ATATTACACA TACTCTGCAA ATGATTGTTC TGTGGGTGAT CTCGATGGGG ATGGCCAGTA TGAGATAATC CTTAAATGGG ACCCGTCTAA TGCAAAAGAC AACTCACAGA GCGGGTATAC AGGGAATGTA TACCTTGATG CATACAAACT AAATGGTAAG TTTTTGTGGA GAATTGATCT TGGAAGAAAT ATCCGTGCAG GTGCTCACTA TACGCAATTT ATGGTTTATG ATCTGGATGG CGATGGAAAA GCGGAGGTGG CCTGCAAGAC CGCTGATGGG ACAAAAGACG GGATAGGAAA AGTAATCGGG AACGCAAATG CAGACTATAC AAATTCTAAT GGCTACATTT TATCCGGCCC CGAGTACCTG ACCATTTTTA ATGGAGAGAC AGGAGCGGCA CTTACTACTA CCGACTATGA TCCTCCAAGA GGCACTGTAA GTTCCTGGGG AGACAGTTAC GGGAACCGTG TAGACCGTTT CTTGGCTTGT ATTGCCTACC TTGACGGTGT CCATCCAAGT CTTGTCATGT GCAGAGGCTA TTATACAAGA GCAGTTCTTG CAGCCTATAA CTGGAGAGAT GGCAAGCTTA CAAAAGTATG GACCTTTGAC AGTAAAGATA GTGGAAATTC TGCATATGAG GGACAGGGGA ACCACAATCT TAGTGTAGGG GATGTTGATA GCGACGGAAT GGATGAGATT GTTTACGGAG CTTGTGCAAT AGACCATAAC GGGAGAGGCC TGTATACTAC AAAGCTCAAT CACGGAGATG CAATGCACTT GTCTGATATG GATCCGGGCA GACCAGGACT TGAAGTCTGG CAGTGTCATG AAACTGGCTC CAATGCATCT TCCGGTGAGT ACCGTGATGC AAGAACAGGA CAACTAATAT GGGGACTTGC AGGTACATCA GACACCGGAA GGGGACTTGC ACTTGATATA GATCCCCGAT ACAAGGGTTT TGAAATGTGG TCATCAAGCA GTGACGGAGT ATACAATGTA AACGGTACAA AGGTGTCAAC CACAAAGCCC TCTATGAATT TCGGAGTTTG GTGGGATGGG GATTTAAACC GTGAACTTTT GGATGGAACT AAGCTCGACA AATGGGATTA TGTCAATAGT ACTCCCATGA GACTGCTTAC TCCTGCTGAT TGTGCTTCAA ACAACTCAAC AAAAGCAAAT CCTTGCCTTA GTGCGGACAT TTTAGGAGAT TGGCGAGAGG AAGTTATTTG GAGGACTACG GATAATAAGT ACCTGCGTAT TTATACAACA ACTGCAGTTA CGAGCAACAG AATTTATACC TTAATGCACG ATCCCCTGTA CAGACTTAGT ATTGCTTGGC AGAATGTGGC GTACAATCAG CCTCCGCATA CAGGATTTTA TCTTGGTGAT GGTATGAGTC AGGCACCAAC ACCTAATATT TATATTGCAA AGCCTGCTTC CTCGTTTTTA CCCGGTGACG TAAACAATGA CGGATCTGTT GATGCCTTGG ATTTTGCCAT TGTTAAAAAA TACCTATTAG GTCAATCAAC AGAGTTAGAT ACCGGTAAAG CAGACATGAA TTCAGACGGG GACATCAATG CACTAGACTT GGCATTACTA AAAGCAAGTT TGCTTTCGTA G
|
Protein sequence | MRKKIFMLLL ALLQVALLTI PQQLVTAASA RQMEYLDRGI VAVKVSNGVF VSWRMLGTDS TSIAFNLYRD GKKVNTTPIT SGTNYVDTTG TASSTYYVCP VKDGVELAKS ESTAVWGQNY LSLPLQIPAG GTTKSGEYYT YSANDCSVGD LDGDGQYEII LKWDPSNAKD NSQSGYTGNV YLDAYKLNGK FLWRIDLGRN IRAGAHYTQF MVYDLDGDGK AEVACKTADG TKDGIGKVIG NANADYTNSN GYILSGPEYL TIFNGETGAA LTTTDYDPPR GTVSSWGDSY GNRVDRFLAC IAYLDGVHPS LVMCRGYYTR AVLAAYNWRD GKLTKVWTFD SKDSGNSAYE GQGNHNLSVG DVDSDGMDEI VYGACAIDHN GRGLYTTKLN HGDAMHLSDM DPGRPGLEVW QCHETGSNAS SGEYRDARTG QLIWGLAGTS DTGRGLALDI DPRYKGFEMW SSSSDGVYNV NGTKVSTTKP SMNFGVWWDG DLNRELLDGT KLDKWDYVNS TPMRLLTPAD CASNNSTKAN PCLSADILGD WREEVIWRTT DNKYLRIYTT TAVTSNRIYT LMHDPLYRLS IAWQNVAYNQ PPHTGFYLGD GMSQAPTPNI YIAKPASSFL PGDVNNDGSV DALDFAIVKK YLLGQSTELD TGKADMNSDG DINALDLALL KASLLS
|
| |