Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0755 |
Symbol | |
ID | 7309607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 884578 |
End bp | 886518 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643607693 |
Product | glycoside hydrolase family 9 |
Protein accession | YP_002505113 |
Protein GI | 220928204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.549048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTAT TAACTTCTGA AAGAAAGGGA AAAACAAAAG CAAGAATTGC ATGCTTTTTT ATAAGTGGTA TTCTTGCAAC ATCCGGGTGT ATTATCCCCG GAAATTTTAA TCAAAATGTA TATGCGGCAA GTACATCTCC CGGGGATTAT CAGCAGGATT CCCGCATACG TCTCAACTCT ATCGGTTACC TTCCCGAAGC CGAGAAAAAA GCAACAATCG CAGCCTCAAG CAGTGAATTT ATTGTAGTAA ATTCAAGCGG AACAGCTGTT CTCACAAGTA GAACTACATC TGCTTATAAT ACTGATACTA GTGAGCAGGT TAACATTGCT GATTTTTCAT CGGTAAAGAC GGAGGGAAGT TACACCTTGC TGGTGCCGGG AATAGGTAAG AGTGTAACAT TCAAAATTGA TAAAAACATT TACGCGAATC CCTTTAAAAC AGCTATGCTA GGAATGTACC TCTGGCGCTG CGGGACGTCA GTTTCCGCAA CTCATAACGG TAATGTATTC TCTCATGAGA CTTGTCATAC CAAAGATGCA TATACAGACT ATATCAATGG TCAGCATAGC ATAAAGGATG GTGGAAAGGG CTGGCACGAT GCCGGGGATT ACAATAAATA TGTTGTAAAT GCCGGTATTA CAGTAGGTTC CATGTTTTTT GCATGGGAGC AGTTTAAAGA CCAAATAAAA GAAATTTCCT TGACAATGCC GGAGAGTAAC AATTCAATGC CGGACTATTT GGATGAACTG AAATATGAAA CCGATTGGCT TTTAACCATG CAATATCCCG ACGGAAGCGG GAAGGTAAGC CATAAGCTGT CTACCAAAGA CTTTGGTGGA TTTGTGCTAC CTGAAAAAGA AACTACGGAC AGGTTTTTCA CTCCATGGGG AAGTGCTGCA ACAGCAGACT TTGTTGCTAT GATGGCAATG GCTTCAAGAG CATTCCGGCC GTATGATGCA GCATATGCCG ACAAATGTAT TGCTGCCGCA AAAGTCAGTT ATGCATTTTT GAAAGCAAAT CCTTGGAATA CAAAACCTGA TCAAAGCGGT TTTACTACCG GTGCTTATGA TACCACAGAT ACTGACGACA GACTTTGGGC AGCTGCAGAG ATGTGGGAAA CTCTTGGAGA CAGCAGTTAC CTGGCAGATT TTGAAGCGAG TGCAAATACC TTTACCAAGA AAATTGACGT TGATTTCGAT TGGGGAAATG TAAATAATCT TGGAATGTTT ACATACCTGT TGTCTGAGAG GAGCGGAAAA AATCCTGCAC TTTATAATAC AATTAAGAGT GCATTGATTT CTGCAGCTGA CAGTATCGTT GCTATTGCTG ACGGACACGG TTATGGAAGA CCACTAGGAG CTACCTATTA CTGGGGCTGC AACGGTACCG TAGCACGACA GACCATGATT CTTAATATTG CGAATAAGCT ATCACCAAAG TCCGAGTATG TTAATACATC TCTGGATGCA TTAAACTTCC TGTTTGGTAG AAACTATTAT AATCGTTCCT TTGTTACAGG CCTTGGCCTA AACCCTCCAA TGAATCCACA TGACAGACGT TCAGGTGGTG ATTCCTTAAA AGATCCTTGG CCCGGGTATT TAGTAGGTGG TGGTTGGCCG GGAGCCAAAG ACTGGACGGA TAATCAGGAT AGTTATGAGA CCAATGAGAT TGCAATCAAT TGGAATGGTG CATTGATTTA TGCTTTGGCT GCATCTCTTG ACACCATGTC TGATACTCCT GATGTACTAT CCGGAGATAT AAATATGGAT GGAAAAGTGG ATGCTATTGA TTATGCATTG CTGAAACAGA GCATATTAGG TTTGGCCAAT TTAACAGGGG ATGCATTGAG ACTTGCTGAT GTAAACCATG ATAATACAGT CGATGCTTTG GATCTATCAG TTTTAAAACA GTATTTATTA GGTAAGATTA CAAACTTATA A
|
Protein sequence | MGLLTSERKG KTKARIACFF ISGILATSGC IIPGNFNQNV YAASTSPGDY QQDSRIRLNS IGYLPEAEKK ATIAASSSEF IVVNSSGTAV LTSRTTSAYN TDTSEQVNIA DFSSVKTEGS YTLLVPGIGK SVTFKIDKNI YANPFKTAML GMYLWRCGTS VSATHNGNVF SHETCHTKDA YTDYINGQHS IKDGGKGWHD AGDYNKYVVN AGITVGSMFF AWEQFKDQIK EISLTMPESN NSMPDYLDEL KYETDWLLTM QYPDGSGKVS HKLSTKDFGG FVLPEKETTD RFFTPWGSAA TADFVAMMAM ASRAFRPYDA AYADKCIAAA KVSYAFLKAN PWNTKPDQSG FTTGAYDTTD TDDRLWAAAE MWETLGDSSY LADFEASANT FTKKIDVDFD WGNVNNLGMF TYLLSERSGK NPALYNTIKS ALISAADSIV AIADGHGYGR PLGATYYWGC NGTVARQTMI LNIANKLSPK SEYVNTSLDA LNFLFGRNYY NRSFVTGLGL NPPMNPHDRR SGGDSLKDPW PGYLVGGGWP GAKDWTDNQD SYETNEIAIN WNGALIYALA ASLDTMSDTP DVLSGDINMD GKVDAIDYAL LKQSILGLAN LTGDALRLAD VNHDNTVDAL DLSVLKQYLL GKITNL
|
| |