Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0736 |
Symbol | |
ID | 7309590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 857323 |
End bp | 858597 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643607675 |
Product | cellulosome protein dockerin type I |
Protein accession | YP_002505095 |
Protein GI | 220928186 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00031925 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCTAGTTT GGTACTCACA ACAGCAATGG TATTCCTTGC AGCATTGCCG CTGCCAGCTT CGGCTGCGAC TACATACAAA CTTGGTGATG TTGACAATGA TACTCTAATT TCTGCCATTG ATTTAGCAGC TGTACAGCAG CACATACTTG GAAAAAAAAC CTTGACGGGT GAGGCCTTTA AAGCAGCTGA TGTAAATGCT AACGGAGAAA TTGAAGCATT GGATTTAGCC GAACTAAAAC AGTTTCTTCT CGGTAGGATT ACTAAGTTCT CCGGAGAAGG ACAACAACAA CCATCTGGAG TCGGAATAAC TTGGATGGAC GGTAATACAC TGTACCCGGT TGGAGTTAAC TATGCATGGT ACAACTGGTC GTATGAGTTT TCAGATAACA ACTGGAATTC CAACTTTACG AGAATCAAGA GTGATTTGGA CACAATGTCC TCAAAGGGAA TTAATTCTCT GAGATGGTGG GTATTCCCGG ACCTTGCCTA TGGTCCGCTA TGGTCAGGCC CAAATGAAGG AAGCCTTTGT ACAGGACTTC CTGAAAAATG GGTTGACCAT ATGAAGGAAA CTTGTGATTA TGCGTATTCA AAGGGTATAA AAATCTACTG GACTATAACA AGCTTTGACT GTGCAAGAGC AGATGATTCT GTTGACCATG ATGATATCAT TGATAATCCT ATAGTACTTC AAAGCTTCCT TGACAATGCT ATGAAGCCAA TACTGCAAAC ATTGGGCGAA CATCCGGGAG TATTGGGATG GGATATTATT AATGAACCTG AATGGATCAT AAAAAAAGAA GACAACGGTG AACCAAACAA TAAGGGAGAA ATCTTCCCAC TTGCTGCAAT GAGAAACTAC ATAAAAACTA CATGTGATTT TATACACCAA TATGCAAAGC AGCCTGTAAG CTTCGGGAGT GCAAATATGA AGTGGCTTGG TGCACAGTAT GATTTATGGG ACGGATTGGG ACTTGATTTC TACGATTTCC ACTGGTATGA CTGGGCTACT CCGTATTTTA ACCCTGTTAC AACTCCAGCT TCAAGTCTGA AGTTGGACAA ACCTGTAATA ATCGGTGAAA TGATGCCTGA TACCCAAAGT TCTTCACTAA AAATGACACA CAAGCAGGTA CTGGATGCCA TATATAAAAA CGGTTATGCC GGATATATGC TCTGGTCATG GAACGATGGA GCTTTTGACT GCAAACCTTA CGTTGGAAAC AACTTTATTG ATTTTGCCGC AGAGCATCCT GACGTAGTCA GATAA
|
Protein sequence | MKKIASLVLT TAMVFLAALP LPASAATTYK LGDVDNDTLI SAIDLAAVQQ HILGKKTLTG EAFKAADVNA NGEIEALDLA ELKQFLLGRI TKFSGEGQQQ PSGVGITWMD GNTLYPVGVN YAWYNWSYEF SDNNWNSNFT RIKSDLDTMS SKGINSLRWW VFPDLAYGPL WSGPNEGSLC TGLPEKWVDH MKETCDYAYS KGIKIYWTIT SFDCARADDS VDHDDIIDNP IVLQSFLDNA MKPILQTLGE HPGVLGWDII NEPEWIIKKE DNGEPNNKGE IFPLAAMRNY IKTTCDFIHQ YAKQPVSFGS ANMKWLGAQY DLWDGLGLDF YDFHWYDWAT PYFNPVTTPA SSLKLDKPVI IGEMMPDTQS SSLKMTHKQV LDAIYKNGYA GYMLWSWNDG AFDCKPYVGN NFIDFAAEHP DVVR
|
| |