Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3412 |
Symbol | |
ID | 7311973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3964636 |
End bp | 3967110 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643610317 |
Product | glycosyltransferase 36 |
Protein accession | YP_002507680 |
Protein GI | 220930771 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTG GTCATTTTAA TCCAGTTAAC AAGGAGTATG TTATTACTCG CCCGGATACT CCTGCCCCTT GGTGTAATTA CCTTGGGTCT GTAGACTACG GTGCAATTAT ATCCAACAAT GCTACAGGCT ATAGTTTTGT AAAATCCGGT GCAGCCGGGA GAATAATTCG TTTCAGATTA AATTCCATGT CCAACGATCA ACCCGGCAGA TATATTTATA TACGGGATAA TGCAGACGGG GACTACTGGT CAGGCTCATG GCAGCCGGTT TGCAAATCAA TTGACAGCTA TAAGAGTGAG TGCAGACATG GAACTGCCTA TACTATTATT TCTTCCTCCT ACAAGGATAT AGAAACCCGT ACTCTTTATT ACGTTCCCCT TGATAAAAAT TATGAAGTAT GGAATATCAG AATAAAAAAC AGTAGTAATA ATAAAAGACA CCTATCCATA TATGGTACGG CGGAATTCAC AAACCACGAC CACTATGAAA ATGACACTGT CAATCTGCAG TATTCGCAAT TTATAAGCAA AACATATTTC AAGGATAACC ACATACTTCA GGTAATAAAT GAAAACGGCA GCGAAGCATC ATCAGATGTT GAAGGAACCT CCAATAAAAA GGGTGACCCC ATATACAGAT TTTTTGGTCT GGCAGGCCAG TCCGTTTCTG CTTTCGACGG CGAAAGAGAT ATGTTTATAG GTAACTACAG AAATTATGGA AATCCTGTTG CCGTAGAATT AGGAAAGTGT TCAAATACTG TTGCCTACAA CGGAAATTCC TTTGGAGCTC TCCAGACAGA TATAGAATTG AATCCTGGTG AGGAAACTGA AATGACTTTC CTTCTTGGAG CAGGAAATGA AGATTTCGCA AGAAATATTA TTTCTAAATA TGATTCGGTT GAAAAAGCTA ATATATGTTC TTGTGAAGGT TTTTGCAACT TCAGCGAGGT TGTATCACAT GAGCTTACGC AGCTGAAAAA CTTCTGGCAT TCACGATTGG ATAACCTTCA GGTGGAAACC CCGGATGATA ATTTTAATAA CATGCTGAAT GTCTGGAATG CCTACCAGTG CTTTATAACA TTTTTCTGGT CAAGGGCCGC CTCTTTCCAA TACTGCGGCC TGAGAAACGG GTTAGGGTAC CGTGACACTG TACAGGATAT ACAGGGTATA ATCCACTTGG ATTATAAAGC AGCTAAAGAA CGGCTTTGGC TTATGCTTTC AGGACAAGTT TTAAATGGCG GCGGTCTGCC TCTTGTGAAG TTCGACCATA AGCCGGGACA GGAAGCTACT CCTGATGAAT CACAATATGC AAAAGAAACG GGACAATCCT TCTACCGGGC GGATGATGCA CTGTGGCTCT TCCCGACTGT AATTACATAT ATCAAGGAAA GCGGTGACTG GAACTTTATA GACGAAAAAG TTCCTTATGC AGACAAGGGT GAAGCAACCG TTTATGCCCA TCTTAAACAG GCGATTCAGT TTAATCTGGA CAGACAAGGC AGTCATGGCC TGCCTGTAGG GTTATTTGCA GACTGGAACG ATTGTTTGAG GCTTGGCTCT AAAGGTGAAT CACTTTTTGT TACCTTCCAG CTGTATTATG CACTTAAAAT TTTTAAAGAA TTTGCTACAA AAAAGGATGC TTTGGCTGAC ATTGAATGGG CACAAAACTG TCTTAATGAA TTAAGTGGGA ATATTCAAAA GTTTGCATGG GAAGGGGATC AATTTGTTCG TGGGTTTACA GAGGATGGAT ATACTATAGG ATCAAAAAGT AATTCCGAAG CAAGCCTGTG GCTTAACCCT CAAGTCTGGT CCGTTATCAG CGGTGCTGCT GACGAAAAAA CAGCTAAAAC CGTTCTTGAC AAGGTATATG ACAATCTCAA TACTAAATAT GGTGCAATGT TGTTTTACCC GGCTTTCAGA GAATACGGAC TTCCTGTTGC AAGAATGTCC CTTTTTAATG CAGGAACCAA AGAAAATGCC GGAATTTTCT CTCAGCCCCA AGGTTGGGTA ATACTAGCTG AAACAATCAT AGGAAATGGC AACAGAGCCT ACGAATATTT TACTGAAATT AATCCTGCCG CCATGAATGA CCATGCTGAA ATAAGAAAAC TGGAGCCGTA CATACATGGT CAGGCCACTG AAGGGATTGA TACCCTAAAC CACGGACGTT CACATGTTCA TTGGCTGACA GGTACTGCCT CAACTGTTAT GGTTTCCATG GTATACGGTA TTCTGGGATT ACAACCTGAA TATAACGGTA TAAAAATAAA TCCATGCATC CCTTCAGGCT GGAAGAATTT CAAAATGAAC AAAGTATTCA GAAATACTGT TCTCAACATA ACCGTCGATA ACAGTCAGGG CGTTGAAAAG GGAGTACATT ATATTACCGT AAATGGAAAA CGTATTGATG GCTGTTATAT CAGTGCGGAT GAACTTAAAG ATACTAATGA AATATTAGTA GTAATGGGTA AGTAA
|
Protein sequence | MNFGHFNPVN KEYVITRPDT PAPWCNYLGS VDYGAIISNN ATGYSFVKSG AAGRIIRFRL NSMSNDQPGR YIYIRDNADG DYWSGSWQPV CKSIDSYKSE CRHGTAYTII SSSYKDIETR TLYYVPLDKN YEVWNIRIKN SSNNKRHLSI YGTAEFTNHD HYENDTVNLQ YSQFISKTYF KDNHILQVIN ENGSEASSDV EGTSNKKGDP IYRFFGLAGQ SVSAFDGERD MFIGNYRNYG NPVAVELGKC SNTVAYNGNS FGALQTDIEL NPGEETEMTF LLGAGNEDFA RNIISKYDSV EKANICSCEG FCNFSEVVSH ELTQLKNFWH SRLDNLQVET PDDNFNNMLN VWNAYQCFIT FFWSRAASFQ YCGLRNGLGY RDTVQDIQGI IHLDYKAAKE RLWLMLSGQV LNGGGLPLVK FDHKPGQEAT PDESQYAKET GQSFYRADDA LWLFPTVITY IKESGDWNFI DEKVPYADKG EATVYAHLKQ AIQFNLDRQG SHGLPVGLFA DWNDCLRLGS KGESLFVTFQ LYYALKIFKE FATKKDALAD IEWAQNCLNE LSGNIQKFAW EGDQFVRGFT EDGYTIGSKS NSEASLWLNP QVWSVISGAA DEKTAKTVLD KVYDNLNTKY GAMLFYPAFR EYGLPVARMS LFNAGTKENA GIFSQPQGWV ILAETIIGNG NRAYEYFTEI NPAAMNDHAE IRKLEPYIHG QATEGIDTLN HGRSHVHWLT GTASTVMVSM VYGILGLQPE YNGIKINPCI PSGWKNFKMN KVFRNTVLNI TVDNSQGVEK GVHYITVNGK RIDGCYISAD ELKDTNEILV VMGK
|
| |