Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0203 |
Symbol | |
ID | 7309107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 228204 |
End bp | 230342 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643607132 |
Product | glycoside hydrolase family 3 domain protein |
Protein accession | YP_002504570 |
Protein GI | 220927661 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGC CGAAATATTT GGACAAAAGC CTTTCGTTCA AGGAAAGAGC TGTCGATCTT GTATCAAGAA TGACATTAGA AGAAAAAGCT TCACAATTGA GGTACGATGC ACAGCCTGTC GAAAGACTGG GAATTCCCAG ATATAACTGG TGGAACGAAG CTCTGCACGG AGTTGCAAGA GCGGGAGTTG CAACGGTATT TCCACAAGCA ATAGGGCTGG CTGCTATATT TGATGATGAA TTTCTTGAGA AAATTGCAGA TGTAATTGCA ACAGAAGGAC GAGCTAAATA CAATGAGAGT TCAAAAAAAG GTGACAGGGA TATATACAAA GGTATAACTT TCTGGTCTCC CAATGTTAAT ATTTTTAGAG ATCCAAGATG GGGTCGTGGA CATGAAACAT ACGGAGAAGA TCCGTACTTG ACGTCAAGGC TTGGAGTTGC ATTTGTAAAG GGTTTGCAGG GCGATGGTAA ATATCTGAAA TCTGCTGCCT GTGCAAAACA CTTTGCGGTT CACAGCGGCC CTGAGGACGA TAGACACCAC TTTAATGCTG TAGCTTCGCA GAAGGACATG TATGAAACAT ACCTGCCTGC TTTTGAAGCA CTTGTCAAGG AAGCAAAGGT AGAATCCGTT ATGGGAGCTT ATAACAGAAC TAACGGAGAA CCATGTAATG GTAGCAAAAC CCTTTTGAAG GATATTTTAA GAGACGATTG GGGCTTCGAC GGACATGTAG TTTCAGATTG CTGGGCAATA AAGGATTTTC ATGAAGGACA CGGAGTTACC AAAACACCTA CCGAATCTGT AGCTCTTGCA TTGAAAAATG GATGTGACCT TAACTGCGGG AATATGTACC TTCTTATTCT CTTGGCATTA AAAGAAGGAA AAATAACTGA AGAGGATATT GACCGTGCAG CGATCAGACT AATGACAACC AGAATGAAGC TGGGTATGTT TGATGACGAC TGTGAATTTG ATAAGATTCC TTATGAAGTA AATGATTCTA TAGAACATAA TAAGCTTTCA TTGGAAGCTG CAAGAAAATC CATGGTATTA CTTAAAAATA ACGGGTTATT GCCATTGGAC AGCAAAAAAA TTAAAAATAT AGCTGTTATC GGACCAAACG CGGACAGCAG TTTAGCACTC CGGGCCAATT ACAGCGGAAC GCCATCTCAC AATATTACTA TCCTTGACGG TGTACGTAGC AGGGTTTCAG AGGATACAAG GGTGTGGTAT TCACTGGGAA GTCACCTATT CATGAATAGA GAAGAGGATC TCGCACAGCC TGATGACAGA CTGAAAGAGG CTGTATCTAT GGCAGAGAGA AGTGATGTAG TCGTCCTATG TCTCGGGCTT GACGCATCAG TAGAAGGGGA ACAGAACGAT CAGGGCACTG TTATACTGGA TGCAGGAGGC GACAAGGCCG ATCTCAATCT GCCGGAATCC CAGAGAAATC TGCTTAATGC AGTACTTGCA ACAGGTAAGC CTACAATCGT AGCCTTGCTT TCAGGAAGTG CATTATCAAT TGGAGATGCA GCAGATAAGG CGGCAGCCAT AGTTCAGTGC TGGTATCCTG GTTCAAAGGG TGGACTTGCA TTTGCAGAAA TGATATTCGG AGATTATTCT CCAGCAGGAA GACTTCCTGT TACCTTCTAC AAATCCACTG AAGAGCTTCC TCCGTTTGAA GACTACTCAA TGGAAAACAG AACCTATAAG TTCATGAAGG GCGAAGCCCT GTATCCTTTC GGATTCGGCT TATCCTATAC TAACTTTGAA TATTCCAATA TTGTGTGCCC GCAGGCTGTA AATAATGGAG AGAGCCTGTC TGTATCAGTG GACGTACAGA ATGCAGGAAG TGTTGATTCA GACGAGGTTG TACAGGTATA TATAAAGGAT ATGGAAGCCT CGGTAAGAGT TCCTAATCAT AGTCTTTGCG GCTTTAAGCG TATATTCTTG AAGAGCGGTG AAAAGAAGAC CGTGACTTTT GAAATAGATT CAAGGGCAAT GACTATAGTT GATGAAGAAG GAAAACGTTA TATTGAAAAT GGTGATTTTA CATTGTATGT AGGCGGTGCA CAACCCGATA ATGTCAGCGA AAGATTATTA GGTAAAAAGC CGTTGGTAGC ATCATTTAAC GTTAAGTAA
|
Protein sequence | MNKPKYLDKS LSFKERAVDL VSRMTLEEKA SQLRYDAQPV ERLGIPRYNW WNEALHGVAR AGVATVFPQA IGLAAIFDDE FLEKIADVIA TEGRAKYNES SKKGDRDIYK GITFWSPNVN IFRDPRWGRG HETYGEDPYL TSRLGVAFVK GLQGDGKYLK SAACAKHFAV HSGPEDDRHH FNAVASQKDM YETYLPAFEA LVKEAKVESV MGAYNRTNGE PCNGSKTLLK DILRDDWGFD GHVVSDCWAI KDFHEGHGVT KTPTESVALA LKNGCDLNCG NMYLLILLAL KEGKITEEDI DRAAIRLMTT RMKLGMFDDD CEFDKIPYEV NDSIEHNKLS LEAARKSMVL LKNNGLLPLD SKKIKNIAVI GPNADSSLAL RANYSGTPSH NITILDGVRS RVSEDTRVWY SLGSHLFMNR EEDLAQPDDR LKEAVSMAER SDVVVLCLGL DASVEGEQND QGTVILDAGG DKADLNLPES QRNLLNAVLA TGKPTIVALL SGSALSIGDA ADKAAAIVQC WYPGSKGGLA FAEMIFGDYS PAGRLPVTFY KSTEELPPFE DYSMENRTYK FMKGEALYPF GFGLSYTNFE YSNIVCPQAV NNGESLSVSV DVQNAGSVDS DEVVQVYIKD MEASVRVPNH SLCGFKRIFL KSGEKKTVTF EIDSRAMTIV DEEGKRYIEN GDFTLYVGGA QPDNVSERLL GKKPLVASFN VK
|
| |