Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0536 |
Symbol | |
ID | 4808285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 654353 |
End bp | 656044 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105950 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036965 |
Protein GI | 125973055 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000215938 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTCTTGTTTT ATTGATTGCA TTAATAATGA TTGCAACCTT ATTAGTGGTT CCCGGTGTGC AAACATCGGC AGAAGGGTCA TATGCTGATT TGGCAGAACC GGATGACGAC TGGCTGCATG TGGAAGGTAC GAATATTGTT GACAAGTACG GCAATAAAGT TTGGATAACA GGAGCCAACT GGTTTGGATT CAATTGTAGA GAGAGAATGC TTTTGGATTC ATACCATAGC GATATTATAG CAGATATTGA ATTGGTTGCG GATAAAGGTA TAAATGTGGT TAGAATGCCG ATTGCGACAG ATTTGCTCTA TGCATGGAGC CAGGGGATAT ATCCGCCTTC AACCGATACA AGCTACAACA ATCCGGCTTT GGCTGGATTA AACAGCTATG AGTTGTTTAA TTTCATGCTG GAAAATTTCA AAAGAGTCGG TATCAAAGTT ATACTTGATG TGCACAGCCC GGAAACTGAC AACCAGGGGC ATAACTATCC TCTTTGGTAC AATACCACTA TAACGGAGGA GATATTCAAA AAGGCCTGGG TATGGGTGGC TGAGCGCTAT AAAAATGATG ACACAATAAT CGGATTTGAC CTAAAAAATG AGCCCCATAC CAATACCGGC ACCATGAAAA TAAAAGCTCA AAGTGCCATA TGGGATGACT CCAACCATCC GAACAACTGG AAAAGAGTGG CTGAGGAAAC TGCCTTGGCA ATATTGGAAG TACATCCAAA TGTATTGATA TTTGTTGAAG GTGTGGAGAT GTATCCCAAA GACGGCATAT GGGATGACGA AACTTTTGAC ACAAGCCCGT GGACAGGAAA CAATGACTAT TACGGAAACT GGTGGGGCGG TAACTTAAGA GGCGTGAAGG ATTATCCGAT TAATCTTGGA AAATATCAGT CGCAGCTTGT TTATTCACCT CATGATTATG GCCCGATAGT TTATGAGCAG GATTGGTTTA AAGGCGATTT TATCACTGCC AATGATGAAC AGGCAAAAAG GATTCTGTAT GAGCAATGCT GGAGAGACAA TTGGGCATAT ATCATGGAAG AAGGAATATC ACCGTTGCTC CTTGGCGAAT GGGGAGGTAT GACCGAAGGC GGCCACCCGC TTCTTGACCT GAACTTGAAG TATTTAAGAT GCATGAGAGA TTTTATATTG GAAAACAAAT ATAAATTGCA TCATACTTTC TGGTGCATAA ACATTGACTC GGCAGATACC GGCGGATTGT TTACCCGTGA TGAGGGAACA CCGTTCCCGG GGGGAAGAGA TCTTAAGTGG AATGACAACA AGTACGACAA TTACTTGTAT CCTGTTCTTT GGAAAACCGA GGACGGAAAG TTTATAGGTC TTGACCACAA GATTCCTCTC GGCAGAAACG GTATATCAAT AAGTCAGCTT TCAAACTATA CACCGTCGGT TACTCCGTCT CCCAGCGCAA CTCCTTCTCC GACAACAATA ACTGCACCGC CGACGGATAC CGTTACATAC GGAGATGTGA ACGGAGACGG AAGGGTAAAC TCCAGCGATG TGGCATTGTT GAAAAGATAT TTGTTGGGTT TGGTTGAAAA TATCAATAAA GAAGCAGCGG ACGTAAATGT CAGTGGAACT GTAAATTCAA CGGATTTGGC AATTATGAAA AGGTATGTTT TGCGTAGCAT AAGTGAGTTG CCGTATAAAT AA
|
Protein sequence | MKKFLVLLIA LIMIATLLVV PGVQTSAEGS YADLAEPDDD WLHVEGTNIV DKYGNKVWIT GANWFGFNCR ERMLLDSYHS DIIADIELVA DKGINVVRMP IATDLLYAWS QGIYPPSTDT SYNNPALAGL NSYELFNFML ENFKRVGIKV ILDVHSPETD NQGHNYPLWY NTTITEEIFK KAWVWVAERY KNDDTIIGFD LKNEPHTNTG TMKIKAQSAI WDDSNHPNNW KRVAEETALA ILEVHPNVLI FVEGVEMYPK DGIWDDETFD TSPWTGNNDY YGNWWGGNLR GVKDYPINLG KYQSQLVYSP HDYGPIVYEQ DWFKGDFITA NDEQAKRILY EQCWRDNWAY IMEEGISPLL LGEWGGMTEG GHPLLDLNLK YLRCMRDFIL ENKYKLHHTF WCINIDSADT GGLFTRDEGT PFPGGRDLKW NDNKYDNYLY PVLWKTEDGK FIGLDHKIPL GRNGISISQL SNYTPSVTPS PSATPSPTTI TAPPTDTVTY GDVNGDGRVN SSDVALLKRY LLGLVENINK EAADVNVSGT VNSTDLAIMK RYVLRSISEL PYK
|
| |