Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2895 |
Symbol | |
ID | 4809102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3421057 |
End bp | 3422808 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108314 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039286 |
Protein GI | 125975376 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.22791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAATTTTAGG TTGTTTTGTG ATTGTTGTTT TACTGGCAGT ACTGGCGGCA ATTCCGCTGG CAGGGTATTA CCTTTATTTT AAGCCCAATG ACGAAGTAGT TCCTGCTTTT GCCGAAGGGG AATTGGTTTT GGTTGTCGAA GGGGAAAAGG TACTTTCAAA GAACAGCCCA AAGATTTTGG AAGGTGAGGT GCTTCTTCCT TTCGACATTG TAAAAGAATA TTTTGACCCT CATATTTTCT GGGACGAAGC TTTGGGCAAA GTCACCATAA CCACGAAAGA CAGAGTAATA AGAATGAAGA CGGACAAGCT TGACGCAATT GTCAACAACC AGCCTATAAC TCTTAATATC CCTGTAACGG TGGAAAATGA CGTGGTGTAT ATTCCTATTG AGTTTTTGGC CGAATTTTAC GGTATTGAAG TGTCATACGT TGAAAAGAGC AATGTCGTTA TTATTGACTA TAAAAACAGC ATAAAACAGA TAGCTGAGCC TATTGATCCG GAGGCGGTTG TAAGAAAAGG CCGTTCGATA CGCGAGCCTA TTATAAGAAA GTTTGATTTA TCCTCCGGTG ATACGGCGGA AAACACACTC AGGATATTTG AAGAGTATGA CAAATGGTAC AAAGTCAGAA CTTGGGACGG TGCCATCGGA TATATTGAAA AACGTTTTGT GGTTGTGAAA AAATTGATGG TGGAGAAGAT TTCTGATGAC AAAACTCCAA AACCGGCGTG GACACCTCCA AAGGGTAAGA TAAATCTTGC GTGGGACATG ATATATACAA GAAGAGAAGA CCATTCATCT TTGGGAGAAA TGAAGGGGCT TGACGTCATT TCTCCCACGT GGTTTCAGGT GAAAAATGCA AAAGGAGAGT TAATCAACAG GGCATATTCA AAATATGTTG ATTGGGCCCA TAGCAGAGGT TATCAGGTAT GGGCCCTTTT AAGTAATGAC TTTACCGACA GCGAAATGAC CAGCAAATTT TTAAACAATA CCGACGCAAG GGACAACCTT ATCAGGGAAA TACTTGCTTA TGCGGCACTG TATAATCTTG ACGGCATTAA TATTGATTTT GAAAACATGT ATATTTCGGA CAGAGATGTT TTTACACAAT TCGTAAGAGA AATTGCCCCT TTGCTGAGGG AACAGGGACT TGTGGTATCG GTGGATGTCA ACGATATACA ATGTTATGAC AAAAAGGCAT TGAGCGAGGC TGTTGATTAT ATAATGTATA TGTCCTACGA CCAGCACTGG AGTACAAGTC CTGTGGCAGG CTCAGTGGCA CAGGTGAGCT GGCAGGAGAA AATAGTAAAA AGAGTCCTTG AACAGGAAGG AGTACCAAGG GAAAAACTGC TTTTGGGAAT TCCTTTTTAT ACCCGGCTGT GGAAAGAGAC GGTTGATGAA TCCGGTAAAA AGAAGCTGAC CAGCAGCGCT CTTACAATGA AGCAGGCAAA AAACCTGATT ATAGAAAACA ATGCAAAGGT AGAATGGGAT GAGGAAAGCG GACAGTTCTA CGCAGAATAC ACCAAGGACA ATACAAATTA CAGACTTTGG CTTGAGGATG CCAACTCAAT CAATTTAAGG ACTTCTCTGG TGCATAAATA CAGACTTGCG GGTACATGTG CATGGAGCAT TTATTTTGTG TCCGAGGATA TATGGGATGT GTTGAACAAA AATCTCAAGG AAATAGAAAG CTATCAGGAA TGGCTTGAAC AAAACCGGAA CAATCAGTAT AAATTTCCCT GA
|
Protein sequence | MKKKILGCFV IVVLLAVLAA IPLAGYYLYF KPNDEVVPAF AEGELVLVVE GEKVLSKNSP KILEGEVLLP FDIVKEYFDP HIFWDEALGK VTITTKDRVI RMKTDKLDAI VNNQPITLNI PVTVENDVVY IPIEFLAEFY GIEVSYVEKS NVVIIDYKNS IKQIAEPIDP EAVVRKGRSI REPIIRKFDL SSGDTAENTL RIFEEYDKWY KVRTWDGAIG YIEKRFVVVK KLMVEKISDD KTPKPAWTPP KGKINLAWDM IYTRREDHSS LGEMKGLDVI SPTWFQVKNA KGELINRAYS KYVDWAHSRG YQVWALLSND FTDSEMTSKF LNNTDARDNL IREILAYAAL YNLDGINIDF ENMYISDRDV FTQFVREIAP LLREQGLVVS VDVNDIQCYD KKALSEAVDY IMYMSYDQHW STSPVAGSVA QVSWQEKIVK RVLEQEGVPR EKLLLGIPFY TRLWKETVDE SGKKKLTSSA LTMKQAKNLI IENNAKVEWD EESGQFYAEY TKDNTNYRLW LEDANSINLR TSLVHKYRLA GTCAWSIYFV SEDIWDVLNK NLKEIESYQE WLEQNRNNQY KFP
|
| |