Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2354 |
Symbol | |
ID | 7311026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2770902 |
End bp | 2773292 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643609280 |
Product | glycosyltransferase 36 |
Protein accession | YP_002506668 |
Protein GI | 220929759 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACG GTTATTTTGA TGATTGTAAT AAGGAATATG TTATTACAAG ACCCGATACA CCTACTCCAT GGTCAAATTA TTTAGGATCA ATGGAATATG GGGCTTTAAT AACTAATAAC GCAGCCGGCT ATAGCTTCGT GAAATCCGGC TCTGAGGGGC GAATTTTACG TTTTCGGTTC AACTCAGTTA CACAGGATAT GCCAGGTAGG TTTATTTACA TAAAGGATCA GAATTCGGGA GATTACTGGT CAGCCTCCTG GCAGCCGACA GGTAAGGAGT TGAGTGACTA TAAATCGGTT TGTCGGCATG GTACAGCTTA TACTGTAATT TCTTCAGGCT ATGACAAAAT TTCTTCCGAG ACGCTTTACT ACGTTCCGTT GGGTCAAAAC CACGAGGTCT GGCATTTTAA AATAAGGAAC AATGATACAA AAAGACGGAA AATATCTGTT TTCAGCTATG CGGAATTTAC AAGTGATAAT AGTTCCATGC TGGATATGGA GAATATCCAG TACACCCGGT TTCTAAGCCG TACGTATTTT AAGGACAACT ACATACTACA GTCACTAAAA GAACTGACCG AGGAAAGGGT TTTTCGGTTC TTTGCCGCAA GCGGCGGAGT GAAGGTTTCA GGATATGATG GTTCAAGAGA AAAATTTGTA GGACCCTACG GGTCATACAG TAACCCTTTG GCACTGAAGA ACGGATATTG CAGTAATTCG CTGAATTATA CAGGAAACTC CTGTGGCTCT CTTCAAATAG ATTTGGAATT ATTGCCCAGT ACCGAGAAGG AAATCGTTTT TATTCTTGGG GAGGGTAATG AAGAGACGGC TGAAATGAAG GTAAAGCATT ACAAGGGCGG GGGAGTGGTT GAAGAGGAGC TGGCACAGTT AAAGGCATAC TGGCATGGTA AGCTAGAAGT ATTTCAGGTA AAAACGCCTG ATTCTGCCTT TAACAGCATG ATGAATGTAT GGCACAGCTA TGAATGCTTT GTGAATACCT TCTGGTCCAG AACAGCTTCC CTTATTTATT CAAGCCTGAG AAATGGCTTT GGTTACAGAG ATACAATGGC AGATATTCAG AGTATTATGC ATCTTGACAG TAAGCTTGCA GGTGAAAGAT TGGTTACAAT GTTATCGGGG CAGGTATCAA ACGGGGGAGC TCTTCCTCTT GTAAGGTTTG ATCATAAGCC CGGGGCCGAA CCTGTTCCAG GTTCTTCAGA GTATCAAGAA AAGACAGGCT ATAAGGAATA TCGCTGTGAT GATGCACTTT GGCTGTTTCA GGCTGTTCCC CAATACATAA GGGAAAGCGG TGAGCTTGAT TTTCTCAACA AGATTATTCC CTACTCCGAC AAAGGCGAAG ATACTGTTTA CCTGCATCTA AAGAAGGCTC TAAATTTTAG CCTTGAAAGA TTGGGGCGGC ATAACCTGGT ACTGGGCATT GATACAGACT GGAACGATTG TCTGAGACTT GGAGAGAACG GAGAATCTGT TTTTGCCTCC TTTCAGCTTT ATCTGGCAAT ATGTGAATTC AAAAAAATTG CACTGAGTAA TGGGAACTGT GAGGATGTAG ATTGGGCGGA AACGAACAGA AAAAAACTAT ATGACAGTCT ACAGAAATAT TGCTGGGAGG ACGGCCAGTT TATAAGAGGC TTTACAGGGG ACAATCAGGT AATTGGTTCG CCTAAAAGCA AGGAAGCTGC TTTGTGGCTG AATCCACAAA CATGGTCAGT TATTAGTGGC GTTGCAACAC ATGACCAGGC CAAAAAAGCA TTAGACAAAG TACACGATAT CCTCAAAACC AAATACGGTG CAATGCTTTT CTATCCATCT ACAAAGACGA TTGGACCGCC TATATTCCTT ATGAGCTTGT ATCCACCCGG AATAAAGGAA AATGCCAGTA TATTTTTAAT GGCGGAAGCT TGGATTATCC AGGCAGAAGC TATGATGGGT CACGGAAACC GTGCATGGGA TTACTATAAC AGCACTAATC CTGCAGCTCA AAATGACTCG GCTGATTTAC GCCATACAGA GCCGTATGTT TACAGTCAGT TTATTGATGG ACTGGAAAGC CCGAACCACG GCAGATCTCA CGGGCACTGG TTGACAGGTT CCGCATCATC TATAATGACT GCCGTAGTTG AAGAAATTTT GGGACTTAAA GCCGACTACG ACGGCTTGAT TATTGATCCG TGCATTCCAT CGGAGTGGAA AGAATTTAGT ATGGTCAGGC ATTTCAGAGG AAGGAAACTG AATATTATTG TACAGAATTC CGGTGGTGTA GAAAAAGGTG TAAAAAAAAT CAGCATAAAC GATAAAACCA TATTGAACAG CTGCCTTATC CCGCTAGATT GTATGGAGGC AGTAAATACT GTTCAGGTAA TTATGGGTTG A
|
Protein sequence | MNYGYFDDCN KEYVITRPDT PTPWSNYLGS MEYGALITNN AAGYSFVKSG SEGRILRFRF NSVTQDMPGR FIYIKDQNSG DYWSASWQPT GKELSDYKSV CRHGTAYTVI SSGYDKISSE TLYYVPLGQN HEVWHFKIRN NDTKRRKISV FSYAEFTSDN SSMLDMENIQ YTRFLSRTYF KDNYILQSLK ELTEERVFRF FAASGGVKVS GYDGSREKFV GPYGSYSNPL ALKNGYCSNS LNYTGNSCGS LQIDLELLPS TEKEIVFILG EGNEETAEMK VKHYKGGGVV EEELAQLKAY WHGKLEVFQV KTPDSAFNSM MNVWHSYECF VNTFWSRTAS LIYSSLRNGF GYRDTMADIQ SIMHLDSKLA GERLVTMLSG QVSNGGALPL VRFDHKPGAE PVPGSSEYQE KTGYKEYRCD DALWLFQAVP QYIRESGELD FLNKIIPYSD KGEDTVYLHL KKALNFSLER LGRHNLVLGI DTDWNDCLRL GENGESVFAS FQLYLAICEF KKIALSNGNC EDVDWAETNR KKLYDSLQKY CWEDGQFIRG FTGDNQVIGS PKSKEAALWL NPQTWSVISG VATHDQAKKA LDKVHDILKT KYGAMLFYPS TKTIGPPIFL MSLYPPGIKE NASIFLMAEA WIIQAEAMMG HGNRAWDYYN STNPAAQNDS ADLRHTEPYV YSQFIDGLES PNHGRSHGHW LTGSASSIMT AVVEEILGLK ADYDGLIIDP CIPSEWKEFS MVRHFRGRKL NIIVQNSGGV EKGVKKISIN DKTILNSCLI PLDCMEAVNT VQVIMG
|
| |