Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2811 |
Symbol | |
ID | 4809648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3316953 |
End bp | 3318728 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108231 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039203 |
Protein GI | 125975293 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAACA AAACTTCGAT AAAAGTTTTC CTGGCATTGA TGCTTGCGGT TCTTATGCCG GTGGTTTCAT GCATCAATGT GTCAAATGCA GTGCTTTCTG ACGGGGATAA GTATGAGTTT GAAGACGGTA TCCATAAAGG TGCCCAGATT TATACCGACT ATGTCGGCCA AAATGAATAC GGCGAGGTAT TCGACCTTAC CGGAAGCACA TGCAGCTTTA TTGCACAAAA GGGTACAAGC ACTTCGGTGA ATGTTGAGGT TGACAAGGAA GGACTTTATG AAATATTCAT CTGCTATGTG CAGCCTTACG ACAAAAACAA AAAAGTGCAG TATCTTAATG TAAACGGTGT CAATCAGGGA GAAATAAGCT TTCCGTTTAC TCTTAAATGG AGAGAGATTT CAGCAGGAAT TGTAAAGCTT AATGCAGGTA TCAACAATAT TGAACTTGAA AGCTACTGGG GTTATACCTA TTTTGATTAT CTGATTGTCA AACCTGCGGA TGAAAGCATT GTAGAACTTA AGGTTCCGAA AAAATTGGTA AATCCCAATG CCACAAAAGA AGCTAAAGCC CTTATGAGTT ATCTGGTTGA TATTTACGGT AAACACATCC TTTCGGGTCA GCAGGAGATC TGCGGTTCCC ACAACTATCC GGGATCTGAG GCGGAGTTTA CATACATACA GGAAAAGACC GGGAAACTGC CTGCGATAAG AGGTTTTGAC TTTATGAACT ACAGAGGGAA CGGTTTGATG TGGGACGACC AGTGTGCCGA GCGTGTTATC GAATGGTACA AGGAAAAAGG CGGTATACCG ACGGTTTGCT GGCACTGGTT CTCGCCCGGT GATATTGGAA AAAAAGCGGA CAACAGTTTC TATACAGAAA GTACGACTTT CAGCATATCA AGGGCTTTGA CTCCCGGAAC GGAGGAAAAT ATTGCACTGC TTAACGATAT CGACACCATG GCCAGAAAGC TCAAGCAGGT TCAGGATGCC GGAGTTCCGG TACTGTTCAG ACCGCTCCAT GAGGCGGAAG GTGGATGGTT CTGGTGGGGA GCCGAAGGTC CGGAGCCTTG TGTCAGACTG TACAGGCTGC TCTATGACAA ATTTACCAAT GAATATGGTT TGAACAATCT TATCTGGGTT TGGACTTCAT ATGATTATGA AACCTCGGCT GCATGGTATC CGGGCGATGA TGTGGTGGAT ATCATCGGTT ACGACAAATA TAATGCAAAA GACGGAAAAC CCAATGGAAG TGCGATTTCA TCCACATTCT ACAATCTGGT GAAACTTACT AATGGCAAAA AGTTGGTTGC GATGACTGAA AATGATACAA TTCCGAGAGT TTCAAACCTT GTAAATGAAA AAGCAGGATG GCTTTATTTC TGTCCGTGGT ACGGCTGGTG GCTGACAAGC GAACAGAACA ATCCTGTGGA TTGGCTTGTT GAAATGTATC AGAGCGATTA CTGCATAACT TTGGACGAGC TTCCTGACTT AAAGAATTAT CCTATATCGG ATTATGAGGA TTCCAATCCG GATCCGTCTC CGACACCTAC ACAGCCGCCG AAAATTACCT ATGGAGATTT GAACGGAGAC GGCAAAGTCA ATTCAACGGA CCTGACAATT ATGAAAAGAT ATATTCTCAA AAACTTTGAT AAGCTAGCCG TCCCTGAAGA AGCGGCTGAC CTGAACGGGG ACGGAAGGAT AAACTCAACG GACCTTTCGA TACTACACAG GTATCTGCTT CGCATAATAA CGAGTTTTCC CGTTGAACAA CAGTAG
|
Protein sequence | MENKTSIKVF LALMLAVLMP VVSCINVSNA VLSDGDKYEF EDGIHKGAQI YTDYVGQNEY GEVFDLTGST CSFIAQKGTS TSVNVEVDKE GLYEIFICYV QPYDKNKKVQ YLNVNGVNQG EISFPFTLKW REISAGIVKL NAGINNIELE SYWGYTYFDY LIVKPADESI VELKVPKKLV NPNATKEAKA LMSYLVDIYG KHILSGQQEI CGSHNYPGSE AEFTYIQEKT GKLPAIRGFD FMNYRGNGLM WDDQCAERVI EWYKEKGGIP TVCWHWFSPG DIGKKADNSF YTESTTFSIS RALTPGTEEN IALLNDIDTM ARKLKQVQDA GVPVLFRPLH EAEGGWFWWG AEGPEPCVRL YRLLYDKFTN EYGLNNLIWV WTSYDYETSA AWYPGDDVVD IIGYDKYNAK DGKPNGSAIS STFYNLVKLT NGKKLVAMTE NDTIPRVSNL VNEKAGWLYF CPWYGWWLTS EQNNPVDWLV EMYQSDYCIT LDELPDLKNY PISDYEDSNP DPSPTPTQPP KITYGDLNGD GKVNSTDLTI MKRYILKNFD KLAVPEEAAD LNGDGRINST DLSILHRYLL RIITSFPVEQ Q
|
| |