Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1011 |
Symbol | |
ID | 7312173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 1256230 |
End bp | 1257645 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643607938 |
Product | glycoside hydrolase family 43 |
Protein accession | YP_002505353 |
Protein GI | 220928444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC AGGGATTCAA TCCATATCTT CCGTCATGGG AATATATTCC TGATGGTGAA CCGTATGTTT TTAATAACAG AGTATATGTC TATGGCTCAC ATGACCGTTT TAACGGGTAT GTATACTGTT TGAACGATTA CGTTTGCTGG TCCGCACCAG TCGATGATCT TGGCAACTGG AGGTATGAGG GAGTTATTTA CAAAAAGACT GATGACCCTC TCAATCCCGA TGGCAGCATG TGCCTTTATG CACCTGATGT TACTGTAGGC CCCGACGGAA GGTATTATTT GTACTATGTA CTTGACAAGC TTCCTATAGT GTCGGTTGCA GTCTGTGATA CCCCTGCTGG AAAATATGAA TTTTATGGCT ATGTTCACTA TTCCGACGGA ACCCGTCTTG GCGAAAGAGC CGATGATCAG CCCCAGTTTG ACCCCGGAGT ACTAACTGAA GGTGATAGAA CTTATTTGTA CACAGGTTTT TGTGCTGTAG GAGATAAGTC AAGAAAAGGG GCCATGGCAA CAGTGTTGGG ACCGGATATG CTGACAATTG TTGAAGAACC TGTATTCATA GCTCCTAGTC AGCCTTATAG TAAGGGCAGC GGCTATGAAG GACATGAGTT TTTTGAAGCC CCTTCCATCA GGAAAAATGG GGATACCTAT TACTTTATAT ATTCTTCGGT TGTATTTCAT GAACTTTGTT ATGCTACAAG CAAACACCCC ACAAAGGGTT TTGAATATAA GGGAGTTATT GTAAGTAATA GCGATCTTCA TATCGATACC TACAAGCCTG CCCAAAAACC TATGTTTTAT GGCGCCAATA ACCATGGAAG TATAGTTGAA ATTAATGGTA AATGGTATAT TTTTTACCAC AGGCATACCA ATGGAACAAA TTTCAGCAGG CAAGCCTGTT GCGAACAAAT TGAAATTCTG GAGGACGGCA CTATTCCACA GGTAGAAATG ACCTCCTGCG GATGCAACGG AGGGCCGTTG GAAGGACGCG GTGAATACGC CGCCTATCTG GCATGCAACT TGTTCTGCGA AGAAGAGTCG TTATATACTG ATTGGACAGC CTCATGGATG AACAATCAGT TCCCAAAAAT AACTCAAGAC GGAAGAGATG GAGACGAAGA AATCGGTTAC ATTGCCAACA TGAAGGCTTC TGCCACAGCT GGCTTCAAAT ATTTTGACTG TAAGGGAATT AAAAAGGTTA AAATCAAGGT ACGCGGATAC TGTCAAGGTG ACTTTGAAGT AAAAACCGCA TGGGATGGTC CGGCCCTTGG AAAGATATCG GTAGGTTTTA CCAATGTATG GAAGGAGTAT TCTGCCGACA TAGTCATTCC CGACGGAGTA CAAGCACTGT ATTTCACATA TACGGGCCAG GGTAGTGCAA GCCTTGCTTC TTTTACACTG GAATAA
|
Protein sequence | MKKQGFNPYL PSWEYIPDGE PYVFNNRVYV YGSHDRFNGY VYCLNDYVCW SAPVDDLGNW RYEGVIYKKT DDPLNPDGSM CLYAPDVTVG PDGRYYLYYV LDKLPIVSVA VCDTPAGKYE FYGYVHYSDG TRLGERADDQ PQFDPGVLTE GDRTYLYTGF CAVGDKSRKG AMATVLGPDM LTIVEEPVFI APSQPYSKGS GYEGHEFFEA PSIRKNGDTY YFIYSSVVFH ELCYATSKHP TKGFEYKGVI VSNSDLHIDT YKPAQKPMFY GANNHGSIVE INGKWYIFYH RHTNGTNFSR QACCEQIEIL EDGTIPQVEM TSCGCNGGPL EGRGEYAAYL ACNLFCEEES LYTDWTASWM NNQFPKITQD GRDGDEEIGY IANMKASATA GFKYFDCKGI KKVKIKVRGY CQGDFEVKTA WDGPALGKIS VGFTNVWKEY SADIVIPDGV QALYFTYTGQ GSASLASFTL E
|
| |