Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0400 |
Symbol | |
ID | 4808403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 496855 |
End bp | 497883 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105814 |
Product | hypothetical protein |
Protein accession | YP_001036831 |
Protein GI | 125972921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0606591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCAG TAAAAGAATA CAAATGTCCC AGCTGCGGTG CAGGATTGGA TTTTGATCCA CCTTCACAAA AATGGAAATG TAACTACTGC TTCAATGAAT ACGAAAAGGA TCAATTTGAC ATATCCGACA ATGAGGAACT TCTCAATGAG GAAATGCCGG AATTGGATTC ATATATTTGC AACAGCTGTG GAGCCGAATT GATTGCTGAC AATACCACCT CAGCCACCTT TTGTATTTAC TGCAAAAGTC CCGCAGTCAT CAAATCAAGA TTTTCCGGGA GATTCAGACC CAGAAATGTA ATTCCTTTCA GATTGACAAA GGAACAGGCA AAGGAAATCT ATCAAAAGTG GATAAAAAAA CGCATATTTG CTCCCAAAGA ATTTAAATTA AAAGAGGAAG TTGACAAAAT AACCGGAATA TATGCTCCCT TCTGGCTCTT TGACTGCAAA GCGGACGGCT TTATCAGCGG AGAAGCCACC AGAGTCCACT CCTGGAGACA GGGCAACTAC AAATACACTC AAACAAAGTA TTACAGCATT CTAAGAAGGG GACACGCCCG CTATAAGAGA ATTCCTGTGG ATGCATCCAA AAAACTGGAC GATAAATATA TGCATATGAT AGAACCTTTT GATTACAAAG ATTTGAAAGA CTTTTCCATG AAATATATGT CCGGTTTTAT GGCCGAAAGA TATGATGTCG AATCCGCCGA AGCCGTAACT ATTCTTAAAG ACAGAGTAAA AGACTATCTG TCGGAAAGAC TAAGAGGCAC TGTCAGCGGA TATTCTTCCT GCAGTATAAC AAGCAAAAAC ATAAATATAT CCGAAATCGA GCAAAGTTAT TCAATGCTGC CGGTTTATTT ATTGGTAAAC AAATACAAAG ACAAAAGCCA CATATTCATG ATAAACGGAC AAACCGGAAA AGTCGTCGGA GACACTCCGC TTTGTCTTCC GAAACAAATC CTTTTTGCAG TTGCAGTTTT CCTGCTCGTA TGGATTATAG GTGTGTTTGG AGGTGCCTTA TTTGCGTAA
|
Protein sequence | MTSVKEYKCP SCGAGLDFDP PSQKWKCNYC FNEYEKDQFD ISDNEELLNE EMPELDSYIC NSCGAELIAD NTTSATFCIY CKSPAVIKSR FSGRFRPRNV IPFRLTKEQA KEIYQKWIKK RIFAPKEFKL KEEVDKITGI YAPFWLFDCK ADGFISGEAT RVHSWRQGNY KYTQTKYYSI LRRGHARYKR IPVDASKKLD DKYMHMIEPF DYKDLKDFSM KYMSGFMAER YDVESAEAVT ILKDRVKDYL SERLRGTVSG YSSCSITSKN INISEIEQSY SMLPVYLLVN KYKDKSHIFM INGQTGKVVG DTPLCLPKQI LFAVAVFLLV WIIGVFGGAL FA
|
| |