Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0043 |
Symbol | |
ID | 4808808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 54373 |
End bp | 56601 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640105452 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036477 |
Protein GI | 125972567 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA TGGGGTTGGG CAGATTTAGC AATAAATTTG TCCTGACTGT TTTGCTGTTC TTTTTGACCG CCGTGCTGCT GCCGTCGTCT TCTTTTGAAT CAAAAGTACA GGCTGCTTCT TCGCCTCGTT ACGGCGGTGC ATATTATAAT TACGGAGAAG CTTTGCAAAA AGCGATTCTT TTTTACAAGG CCAATCGTCT TGGAGATTTG CCCGATGATT ACATCCTGCC TTACAGGGCT GACGCCGCAA TGACCGACGG TCAGGATGTG GGTCTGGATC TTACCGGAGG ATGGGCTGAT GCCGGAGACG GAATCAAATT TACCCATCCC ATGTCCTATG CCGCGGGCCA ATTGGGCTGG GCTGTGTATG AATATCGTCA GGCCTTTGAA AAGGCAGGGC TGTTGGATGA TATTCTGGAC GAAATAAAAT GGGCTACGGA CTTTTTCATT AAAGCTCATC CGGAGCCAAA TGTATTGTAT TATATGTGCG GCTACAATGA TTCCGACCAC TCGGTATGGG TTCCCCACGA ACTTTTGGAT TATGTGACGG ACAGAAAGTC CTTTGTGTTA AACCCGTCAA CCCCGGGCTC GGATGTGGCA GGACAAACTG CGGCGTGTCT GGCTATTGCA TCAATAATAT TTGAGCCGAC AGATCCCGAA TATGCCGAAA CCTGTTTGAC CCATGCAAAG CAGATTTTTG AGTTTGGTGA TAAATACAGA GGCAAAAATC CTCTGGATGT TTTATACCCG TCCGGAGGTT ATCTGGATGA TTTGGCCTGG GGTGCCGTGT GGCTCTACAT AAAAACCGGA GATTCCACTT ATCTTGAGAA AGCAAAATCC TTCCTGCCTG TTACTTCATT GGGCGGAGGG CATACCCATT GCTGGGATGA TGTCAGCTAC GGTGCTGCGC TTAAAATAGC CCAGCTTACT CATGACGAAG GATATGCCGC CATGGTCGAA AAGAACCTTG ATTTCTGGTT GCCCGGAGGA GGAATCACTT ACACTCCGGG AGGCCTGGCA TGGCTTTCAC CGTGGGGTTC TTTGCGTTAT GCATCCACAG CTGCGTTTTT GGCCTTTGTG TGGGCTGACG ACCCAACGGT TGGAACACCG TCAAAGAAAG AAACTTACCG TGCTTTTGCG GAAAAACAAA TAAACTATAT TCTCGGAGAT AACCCTCGAA AAGGAAGTTA TGTGGTGGGA TTTGGCGAGA ATTCTCCAAA ACATCCGCAT CACAGAACGG CTCATGGTTC GTGGGTAAGT ATGCTTGAAG TGCCGAGTTT CCATCGCCAC ATTCTCTACG GCGCATTGGT TGGTGGACCG AGTTCCGATG ACAGCTGGGA AGATGATATT TCCGATTATA CCCGGAATGA AGTGGCAACC GATTACAATG CAGGTTTTGT AGGGGCTTTG GCAAAAATGT ATGACATGTA TGGAGGAGAA CCCCTTGAAA ATTGGCCTCA ACCTGAAGAT TTCAGAGCAC CGGAAGACGA CATTGTTGAA TACTTCTGCC GTGGCTGGAT AATTTACGAA GGCTATGGAA CCTTAAACTT GCTGCTTCAG GTTAACAACC GTTCCGGATG GCCTCCGACA ATGAAGGATA AGTTGTCTGT CCGTTACTTC ATGGATTTAA CTGAAGTGTT TGAGTCCGGA GGAACTGTGG ATGACGTTCA GATAAGTCTT GGACAGAACG AGGGAGCAAA ACTTATAGGT TTGAAGCATT ACAGGGACAA CATTTACTAC TTTACGGTTG ACTTTACGGG AACCATGATT ATGCCTGCCG AGTGGGAGAT GTGTGAAAAA GATGCCCATG TAACAATCAA ATACAGAGAC GGCATAACAG GTTCTAATGA AAACGACTGG TCATATCAGA ACTTGAGAAA GGATCCGGAT TATGATGCTA CATCCTTTGC AGGACTGACT CCTTATATAC CTGTATATGA CAACGGTGTG CTTTTGTGGG GTGAAGAACC GCCTGCCGGC GGTGATGATC CCGGGTCTTC GCCGCCGCCG ACTCCAACTG AGCCGGTAAT TGTGTATGGT GATTTGAACG GCGACGGAAA TATAAATTCC ACGGATTTTA CGATGTTAAA GAGAGCAATA TTGGGTAATC CGGCTCCCGG TACAAATTTG GCAGCCGGTG ATTTGAACAG GGACGGTAAT ACAAATTCAA CAGACTTGAT GATTTTAAGA AGGTATTTGC TAAAGTTAAT TGGTTCGTTA CCTATATAA
|
Protein sequence | MKKMGLGRFS NKFVLTVLLF FLTAVLLPSS SFESKVQAAS SPRYGGAYYN YGEALQKAIL FYKANRLGDL PDDYILPYRA DAAMTDGQDV GLDLTGGWAD AGDGIKFTHP MSYAAGQLGW AVYEYRQAFE KAGLLDDILD EIKWATDFFI KAHPEPNVLY YMCGYNDSDH SVWVPHELLD YVTDRKSFVL NPSTPGSDVA GQTAACLAIA SIIFEPTDPE YAETCLTHAK QIFEFGDKYR GKNPLDVLYP SGGYLDDLAW GAVWLYIKTG DSTYLEKAKS FLPVTSLGGG HTHCWDDVSY GAALKIAQLT HDEGYAAMVE KNLDFWLPGG GITYTPGGLA WLSPWGSLRY ASTAAFLAFV WADDPTVGTP SKKETYRAFA EKQINYILGD NPRKGSYVVG FGENSPKHPH HRTAHGSWVS MLEVPSFHRH ILYGALVGGP SSDDSWEDDI SDYTRNEVAT DYNAGFVGAL AKMYDMYGGE PLENWPQPED FRAPEDDIVE YFCRGWIIYE GYGTLNLLLQ VNNRSGWPPT MKDKLSVRYF MDLTEVFESG GTVDDVQISL GQNEGAKLIG LKHYRDNIYY FTVDFTGTMI MPAEWEMCEK DAHVTIKYRD GITGSNENDW SYQNLRKDPD YDATSFAGLT PYIPVYDNGV LLWGEEPPAG GDDPGSSPPP TPTEPVIVYG DLNGDGNINS TDFTMLKRAI LGNPAPGTNL AAGDLNRDGN TNSTDLMILR RYLLKLIGSL PI
|
| |