Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0435 |
Symbol | |
ID | 4808363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 547412 |
End bp | 548464 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105849 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001036866 |
Protein GI | 125972956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAGGA AAATTTTGAT TGTGTTTGTA CTGCTGTCAA TTGCAATTTC CCAGTTTTTG TATTTTGAAG TGGGATGTTT TGATGTTTAT GCCTGGAATA AGGCAGTTAT TGGAGATGTA AATGCGGACG GTGTTGTAAA TATCAGTGAT TATGTACTTA TGAAGAGATA TATTCTCCGT ATTATTGCAG ATTTTCCTGC TGACGATGAT ATGTGGGTCG GAGATGTAAA CGGAGATAAT GTTATCAACG ATATTGACTG TAATTATTTG AAAAGATATT TACTTCATAT GATTAGGGAA TTTCCGAAGA ACTCTTACAA TTCGGCACCT ACATTTACGC CCATACCGAC ATTTACACCG ACACCGACGC CGACAAAGGC ACCTGCCGCT CCTGCAAATA CACAATCCGG CATTTTGAAT GACGGATATT TTCCTCCGGG AACTTCGAAG CATGAACTTA TAGCAAGGGC TTCAAGTCTA AAAGTGAGTG AAGTTAAAGC CATTATAAAA AAGCAGGTGG ATGAACACTG GGATGTTATA AGGGACGTGT GTGGGTTTAA GAATAAAGAA GTTGCTTATG CGTTTTTCTT TGGAATGGCT ACCAGAGAGT CCACTTTTAG AGCTGCAACT GAGACCGGAA GCGGGGCTTC ACACGCTTTT GGCCCTTTGC AGACAGCCGA GACGGCTTAT GCAAATGCGA ATCCCAACTA CATGCCCGAG CATAATGTTC CGGAGATGCA CCAATATGAT TTTACGGAAT ACAACTTCTA TGATGTGGGT ATATCCGTAC ATATGGGAAT CAGACATTTT TTGCACTTTG CAAGACTTGC GAAGGAAAAA TACAGCGGCA GGGACATAGC GCGTCACGGC TTGATGGGAT ACAATACAGG TTGGATTGAC GGTGCGGACG AATCTTGGAT TGTAAGATAT GCGGATGAAA CTGCTGCTTT GGGGGCATGG TATCTTAGGA ACAATCATAT GTCCGATGAT GAGTTTACAT GGGATACCGA TCCGAGGGTG GATCGCAGCA ATCCATGGGA GATTTATTAC TAA
|
Protein sequence | MKRKILIVFV LLSIAISQFL YFEVGCFDVY AWNKAVIGDV NADGVVNISD YVLMKRYILR IIADFPADDD MWVGDVNGDN VINDIDCNYL KRYLLHMIRE FPKNSYNSAP TFTPIPTFTP TPTPTKAPAA PANTQSGILN DGYFPPGTSK HELIARASSL KVSEVKAIIK KQVDEHWDVI RDVCGFKNKE VAYAFFFGMA TRESTFRAAT ETGSGASHAF GPLQTAETAY ANANPNYMPE HNVPEMHQYD FTEYNFYDVG ISVHMGIRHF LHFARLAKEK YSGRDIARHG LMGYNTGWID GADESWIVRY ADETAALGAW YLRNNHMSDD EFTWDTDPRV DRSNPWEIYY
|
| |