Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0032 |
Symbol | |
ID | 4808797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 38313 |
End bp | 40085 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105441 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036466 |
Protein GI | 125972556 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4124] Beta-mannanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00146021 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAA TTTTTGGCAG GACACTGAGT CTGCTGGTAA CATTTGCAAT GGTGTTTTCC GTTCTTTTAG TCATGCCCGT TTCAACTTAT GCTGCATATT CCCTTCCTGT GGACGTTGAA GCAGAAGATT GCACTCTTGG CAACGGTGCC GTTGTTACCA CCAATGTATA CGGAACTCAA TATCCCGGAT ATTCCGGCGA CGGATTCGTA TGGGTGGCCA ACTCGGGAAC GATAACATTG GAAGTCACCA TTCCTGAAAA CGGTATGTAT GAGCTTTCCA CAAGATGCTG GATGTATCTT GGCAAAGAAG ATGAGACCAG AATGCAGGTT ATAAGCATCA ACGGAAAATC ACACAGCAAC TATTTTATTC CAAACAAAGG CCAATGGATT GATTACAGTT TCGGATTCTT CTATCTTGAG GCCGGTAAAG CAACTATTGA GATAGGTTCC TCCGGAAGCT GGGGCTTTAT ACTGTACGAC AAAATATACT TTGACCATGC TGACATGCCC GATCATATAA TTGACCCGAC TCCGTGTGAT CCAAATGCAA CTCCTGAAAC AAGAGCTCTT ATGAAATACC TTACCAGCGT GTACGGAAAA TATGTTATTT CCGGCCAGCA GGAGATTTAC GGAAACGGAA ACGACGGCAA TTATGAACTT GAATTCGATT ATATTTATGA GAAGACAGGC AAATATCCTG CAATCAGAGG CTTTGATTTC ATGAACTACA ATCCTCTGTA CGGATGGGAA GACGGTACAA CGGCACGTAT AATCGACTGG GTAAAAAATC GCGGCGGTAT TGCAACAGCA TGCTGGCATA TAAATATTCC CAGGGATTTT GCAAGTTATA AACTCGGTGA GCCGGTGGAT TGGACAAACT GTACATACAA ACCGACAAGC AGCTTTAATA CCGCAAACTG CCTTGATGAA ACAACAAAAG AACATGCTTA CCTGATGATG GCAATTGAAG ACCTTGCAGA GCAGCTTTTA ATTCTTCAGG AGCAAAACAT TCCTATACTT TTCCGTCCGT TCCATGAAGC TGAAGGCTAC AACAACACCG ACGGCTCCGG CGCATGGTTC TGGTGGGGTT CTGCAGGTGC TGAAGTTTAC AAGGAACTCT GGAAACTTCT TTATAAAACT CTTACCGAAA AATACGGCAT TCATAATTTG ATATGGGAAG TAAACCTTTA TACATATGCC AATTCTTATG AATGGTATCC CGGCGATGAG TATGTGGACA TTATCGGATA CGACAAATAT GAAGGTTCAC CCAATACCTG GGGCACAAGC GCCGCATCAT CATTATTCCT TACACTTGTA AATTACACAA ACGACACAAA GATGGTTGCA TTGACTGAAA ATGACGTTAT TCCCGATATT CAAAATATAG TTAATGAGGA AGCCTGGTGG CTGTATTTCT GCCCATGGTA CGGTGATTTC CTTATGAGTC CCAGATACAA CGACCCCGTA CTTTTGAACA CTATCTACAA CAGTGAATAT GTAATCACCT TGGATGAACT TCCGGAAAAC CTTTATGAAT ATGATGGTGA AATACCGGAT ATCAACTACG GCGATTTGAA CAATGACGGA AATATAAACT CAACCGATTA TATGATACTG AAGAAATATA TTTTAAAAGT TCTTGAAAGA ATGAATGTCC CTGAAAAAGC AGCAGATTTA AACGGTGACG GTTCAATCAA TTCAACCGAT TTGACAATAT TAAAAAGATT TATAATGAAA GCAATTACAA AATTTCCCGT TACACAAAAG TAA
|
Protein sequence | MGKIFGRTLS LLVTFAMVFS VLLVMPVSTY AAYSLPVDVE AEDCTLGNGA VVTTNVYGTQ YPGYSGDGFV WVANSGTITL EVTIPENGMY ELSTRCWMYL GKEDETRMQV ISINGKSHSN YFIPNKGQWI DYSFGFFYLE AGKATIEIGS SGSWGFILYD KIYFDHADMP DHIIDPTPCD PNATPETRAL MKYLTSVYGK YVISGQQEIY GNGNDGNYEL EFDYIYEKTG KYPAIRGFDF MNYNPLYGWE DGTTARIIDW VKNRGGIATA CWHINIPRDF ASYKLGEPVD WTNCTYKPTS SFNTANCLDE TTKEHAYLMM AIEDLAEQLL ILQEQNIPIL FRPFHEAEGY NNTDGSGAWF WWGSAGAEVY KELWKLLYKT LTEKYGIHNL IWEVNLYTYA NSYEWYPGDE YVDIIGYDKY EGSPNTWGTS AASSLFLTLV NYTNDTKMVA LTENDVIPDI QNIVNEEAWW LYFCPWYGDF LMSPRYNDPV LLNTIYNSEY VITLDELPEN LYEYDGEIPD INYGDLNNDG NINSTDYMIL KKYILKVLER MNVPEKAADL NGDGSINSTD LTILKRFIMK AITKFPVTQK
|
| |