Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1439 |
Symbol | |
ID | 7310212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 1747872 |
End bp | 1750226 |
Gene Length | 2355 bp |
Protein Length | 784 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643608365 |
Product | glycosyltransferase 36 |
Protein accession | YP_002505773 |
Protein GI | 220928864 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTCG GGTATTTTGA CCGAAAGAAC AGGGAATATG TAGTAACAAG GCCTGATACA CCGACGCCAT GGATTAACTA CATAGGCAGT GGAAATTATG GTGGTATAGT TTCCAATACA GGGGGAGGTT ACAGTTTTCA TAAGGACCCT CAAAATCGCA GAGTTACACG CTACAGGTAT AATAATATAC CTATGGACAG ACCCGGAAGG TATGTATACA TAAGAAACAA GGACACAGGG GAGTACTGGA ATCCAGGCTA TCAGCCCGTA CAGAAGAAAC TGGACGGATA CAGCTGCCGT CACGGACTTG GCTACAGTGT TTTGACCGGG GAATATAAAG GAGTTATAGG CGAGGTTACA TATTTTGTAC CTGATGATAA GAACTTTGAA CTATGGTTTG TCAAAGTATC TAATACTCGC AGCATACAGC AGAATCTTCA GATTTTTGCA TACTCGGAGT TTTGCTTCTG GGATGCAATA ATGGATCAGC AGAATGTTGA CTGGGTACAG CAGATTAATC AGGGCAGATT TGATGATGGA ATTATTACCT ACCATCCTCA TCACGTTAGT GACAATGCCG CTTTTTTTGC AACGGGTGAA AAGGTAAGCA GCTTTGATAC CAATCTTGAA ACCTTTATTG GAAGATATAG ATCGGAAGGC AATCCTATTG CCGTAGAACA GGGAGCTTGC AGTAATTCCA TATCCTACAG GACAAACGGT GTCGGTGCCT TTTGTATAGA CTGTGACCTT GGCCCCAATG AAGAACGTGA GTTGGTTTTT GTGCTTGGGT TTGCAGAGGA AAAATCAGAG ATAAGAAAAG ACATAAAGGA ATATCTTTTG CCCGAAAATG CTAAAGCGGC GTTTAGCAGA CTACAGGCTT CATGGCTTGA CTTTACATCC AAGCTCAGTG TTGAAACACC TGATGAGGAT ATGAATCTGT TTGTAAACAT ATGGAATCAG TATCAGTGCA AAACTACCCT CAACTGGTCA AGATTTGTTT CACTGTATCA GCTGGGTCTT GGGAGAGGTA TGGGTATCAG AGACAGTGCA CAGGATACAC TTGGCGTAAT GCATACGATA CCTGCCGAGG CAAAGGAGGT TATTATAAAG CTTCTTAAAT GCCAGTATAC AGACGGAAGA GCATATCATC TGTTTTTTCC GCTTACAGGA GAAGGAGGAC AGGGGGATGC TCCCGTCAAG AAATTTGACT GGTATTCCGA CGACCATTTG TGGCTGATAC TTGCTGTAAA TGCTTATATA AAGGAAACTG GAGATTTTGA GTTTTTGAAC ATGGAAGTTC CGTATAACGA TAAAATTACC TCACAGACCG TAATGCGGCA CCTTGATATG GCATTGGAAT TTACAAACAA TAACCGAGGC CCTCATAATA TTGCGTTGGC GGGACGTGCT GACTGGAATG ACACACTTAA CCTTGATACA GGTAAGGGTG TTGCAGAAAG TGTATTTACG TCTATGCTAT ATTGCAGGGC ATTAATAGAA ATGATTGAAA TACTGGATTA CCTTAAAAAT ACAGATATGA TAAAAAAGTA TTCCGACATG TATGAGGATA TGAAGAACGC TATAAATGAT ACCTGTTGGG ATGGAGAATG GTACAAGAGG GCTTTTGATG ATAACAGTCA GCCTCTTGGT TCAAAGGAAA ATAAGTTCGG TAAAATATTC ATAAATTCCC AGTCATGGGC AGTTTTAAGC AAGGTAGCGG AAAACGGAAG AGCAAATGAG TCAATGGAAT CCGTTGAGAA GTATCTCAAT TCGAAATATG GAGTTGTAAC TATGTATCCT GCTTACACAG AGTATGACAC CACAAAAGGA GGAGTAACTA CATTTCCACC GGGAACAAAG GAGAATGGAG GGATCTTCCT TCACACGAAT CCTTGGGTAA TGATTTCAGA GGTAATGCTC GGTCACGGGG ACAAGGCCTT CATGTACTAT AATCAGATTT TGCCGGGCAA AAGGAATGAT GATGCGGAGT TGTATGAGGT AGAGCCGTAC GTATACTGCC AGAACATTCT CGGCAAGGAG CATCCTCAGT TTGGTATAGG CAGAAATTCC TGGCTTTCGG GAACAGCTGC ATGGAATATG GTAGCATCAA GCCAGTATAT ACTGGGAATA AGGGCAAACT ATGATTCACT GACGGTAGAT CCGTGTATCC CTTCAAGCTG GAAGGGTTTT AAAGCTACAA GAGTATTCAG AGGTGCCACC TATTATATAG AAGTACAAAA TCCAAACAGA GTTTGTGCAG GGGTCGAAAA AATAATTGTT GATGGCGTTG AGACGGAAAA GATACCTGTT TTTGAGGCAG GAACAGAGCA CAATGTAGTC GTTGTAATGA AATAA
|
Protein sequence | MRFGYFDRKN REYVVTRPDT PTPWINYIGS GNYGGIVSNT GGGYSFHKDP QNRRVTRYRY NNIPMDRPGR YVYIRNKDTG EYWNPGYQPV QKKLDGYSCR HGLGYSVLTG EYKGVIGEVT YFVPDDKNFE LWFVKVSNTR SIQQNLQIFA YSEFCFWDAI MDQQNVDWVQ QINQGRFDDG IITYHPHHVS DNAAFFATGE KVSSFDTNLE TFIGRYRSEG NPIAVEQGAC SNSISYRTNG VGAFCIDCDL GPNEERELVF VLGFAEEKSE IRKDIKEYLL PENAKAAFSR LQASWLDFTS KLSVETPDED MNLFVNIWNQ YQCKTTLNWS RFVSLYQLGL GRGMGIRDSA QDTLGVMHTI PAEAKEVIIK LLKCQYTDGR AYHLFFPLTG EGGQGDAPVK KFDWYSDDHL WLILAVNAYI KETGDFEFLN MEVPYNDKIT SQTVMRHLDM ALEFTNNNRG PHNIALAGRA DWNDTLNLDT GKGVAESVFT SMLYCRALIE MIEILDYLKN TDMIKKYSDM YEDMKNAIND TCWDGEWYKR AFDDNSQPLG SKENKFGKIF INSQSWAVLS KVAENGRANE SMESVEKYLN SKYGVVTMYP AYTEYDTTKG GVTTFPPGTK ENGGIFLHTN PWVMISEVML GHGDKAFMYY NQILPGKRND DAELYEVEPY VYCQNILGKE HPQFGIGRNS WLSGTAAWNM VASSQYILGI RANYDSLTVD PCIPSSWKGF KATRVFRGAT YYIEVQNPNR VCAGVEKIIV DGVETEKIPV FEAGTEHNVV VVMK
|
| |