Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1560 |
Symbol | |
ID | 4810067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1888998 |
End bp | 1889906 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640106978 |
Product | cystathionine beta-synthase (acetylserine-dependent) |
Protein accession | YP_001037979 |
Protein GI | 125974069 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.180171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATATG ATGATATGCA ACAACTTATA GGAAACACAC CTCTTGTACG TCTTCGCCAT TCCGGTTTTC CTGACGGTGT CCGCGTATAT GCCAAGCTTG AACTGTATAA CCCCAGTGGC AGTGTAAAAG ACCGAATCGG AAAATATATG ATTGAAGATG CGGAAAAAAG AGGTGTTTTG GTTCCCGGCA GCACTATAGT GGAAGGAACT GCGGGAAATA CCGGACTGGG CATTGCTTTT GCCGCCCTTA ACCGGGGTTA CCGGTTAATT ATGGTGGTAC CAACCAAATT TTCACAAGAA AAACAGGCTC TGCTCCGCGC CCTCGGTGCG GAAGTAATTA ACACACCCCG TGAAGAAGGA ATGTTGGGAG CGGAACGAAA AGCGGAAGAA TTGCGGCAGT CCATTCCTAA AGCCGTTTCA CTGGAGCAAT TCCGAAATCC GGCAAATCCG TTGGCTCATT ACGAAACAAC CGGTCCTGAA ATTTGGGAGG ACCTGGGAGC GGACATAGAT TATTTTGTCG CCGGTGCCGG AAGCGGTGGT ACATATGCGG GAATCGTGCG TTACTTAAAA GAACAAAAAC CGGATATCAA AGGTATTCTT GCCGACCCCA TCGGTTCCAC AATGGGAGGA GGAGAGCATG GTGATTATGA CATAGAAGGC ATTGGAAACG ACTTTATACC GGAAACAATG GATATGTCCC TTGTGGACGA GGTTATAAAA GTCAGTGATG ATGAAGCTTT TACCGAGACC AGGCTTCTGG CGCGAAATGA AGGTATTATT GCCGGCTCGT CCTCCGGAGC AAATCTTGCA GCGGTACGCA AGCTTGCCAT GCGGATTCAA CGCGGCACCA TTGTTACAGT GCTGCCTGAC AGGGGTGAAC GCTATCTATC CAAAAACCTG TTTTTGTGA
|
Protein sequence | MIYDDMQQLI GNTPLVRLRH SGFPDGVRVY AKLELYNPSG SVKDRIGKYM IEDAEKRGVL VPGSTIVEGT AGNTGLGIAF AALNRGYRLI MVVPTKFSQE KQALLRALGA EVINTPREEG MLGAERKAEE LRQSIPKAVS LEQFRNPANP LAHYETTGPE IWEDLGADID YFVAGAGSGG TYAGIVRYLK EQKPDIKGIL ADPIGSTMGG GEHGDYDIEG IGNDFIPETM DMSLVDEVIK VSDDEAFTET RLLARNEGII AGSSSGANLA AVRKLAMRIQ RGTIVTVLPD RGERYLSKNL FL
|
| |