Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3001 |
Symbol | |
ID | 7311610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3552745 |
End bp | 3553755 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643609905 |
Product | glycosidase PH1107-related |
Protein accession | YP_002507275 |
Protein GI | 220930366 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.345013 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATTA ACGGTAAGAG TTTAAAGAAT ATTCCGTGGC AAGATAAACC TCTCGGCTGT AATAGCGTTA TATGGAGACA CAAAGGGAAT CCCATTATCG TCTGGAATCC TACACCTAAA ACAGCAAGGA TATATAATAG TTCTGTAGTT CCCTGGAACT CAGGCTTTGC AGGTATTTTC AGAGCAGACC ACAAAGACGG TAAAGCTCAA ATCCATGTTG GATTCAGCAG TGATGGGGTT AATTGGAACA TTGAGGATGA GCCTATAGTA TGGCATGATG AGGATGGTAA TCTGTATCAG CCTAACTATT CGTATGACCC CCGGATTGTG GAATTGGAAG GTATTTATTA TATTGTCTGG TGCACAGACT TCGGTGGGGC CTCTCTGGGC CTAGGTGTCA CAAAAAACTT TAAACAGTTT ACACGTCTTG AAAATCCTTT TATACCTTTT AATCGTAATG GTGTTTTGTT TCCACGCAAG GTAAATGGTA AATACTTACT TTTAAGCAGA CCCAGCGATA CAGGTCATAC ACCTTTCGGA GATATTTTCA TAAGCGAGAG TCCCGATCTT GTTCACTGGG GACGTCACAG ACGTGTAATG CAAAAAGGAG GTTCAGGGTG GTGGCAAAGT GTAAAAATAG GAGCAGGTGC GGTTCCTATC GAAACAACGG AAGGCTGGCT TCTCTTTTAC CATGGTGTTT CAGGAACCTG TAATGGCTTC GTATACAGTT TTGGTGCGGC AATTCTGGAC ATTGAAATCC CTTCTAAAGT TCTTTACCGC ACAAGAGATT ATCTTCTCAC CCCCGAAATG TCATATGAAA CATCAGGTTT TGTACCTAAT GTGGTGTTCC CTTGTGCTGC ACTGCACGAT TCTGAGACTG GCAGAATCGC TATTTATTAC GGTGCCGCCG ACACATATTC CGCTCTTGCA TATGCAAAGG AAGATGAATT AATAAACTTT ATTAAATCAA ATTCCGAGTT GCTGCCAGGC GATGCGGAGG AATATAGATA G
|
Protein sequence | MSINGKSLKN IPWQDKPLGC NSVIWRHKGN PIIVWNPTPK TARIYNSSVV PWNSGFAGIF RADHKDGKAQ IHVGFSSDGV NWNIEDEPIV WHDEDGNLYQ PNYSYDPRIV ELEGIYYIVW CTDFGGASLG LGVTKNFKQF TRLENPFIPF NRNGVLFPRK VNGKYLLLSR PSDTGHTPFG DIFISESPDL VHWGRHRRVM QKGGSGWWQS VKIGAGAVPI ETTEGWLLFY HGVSGTCNGF VYSFGAAILD IEIPSKVLYR TRDYLLTPEM SYETSGFVPN VVFPCAALHD SETGRIAIYY GAADTYSALA YAKEDELINF IKSNSELLPG DAEEYR
|
| |