Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1842 |
Symbol | |
ID | 4809388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2186610 |
End bp | 2187902 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107256 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_001038256 |
Protein GI | 125974346 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000076838 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAA GAAAATTGAA ATTTGACACT TTGCAGGTCC ATGCAGGGCA GAAGCCCGAC CCTACTACAG GTTCAAGAGC GGTTCCGATT TACCAGACCA CGTCCTATGT GTTCAACAGT CCGGAGCATG CGGCAAATCT CTTTGCGTTA AAGGAGCCCG GCAATATTTA CACAAGAATT ATGAACCCCA CAACTGATGT TTTTGAACAG AGAATGGCCG CTTTGGAAGG AGGAGTCGGA GCGCTGGCGG TGGCTTCAGG TTCGGCGGCT ATTACCTATG CAATACTCAA TATAGCCGGA GCGGGAGATG AAATTGTTTC GGCCAGCACT CTTTACGGAG GAACATATAA TCTTTTTGCG GCAACTTTAC CGAGGATTGG AATTAAAACT GTTTTTGTAG ACCCTGACGA TCCGGAAAAT TTCAGAAAAG CCATTAATGA AAAGACAAAG GCTCTTTATA TTGAGTCTCT TGGAAACCCC GGAATTAACA TAGTTGATAT TGAGGCGGTC GCAAAAATTG CCCATGAGAA CGGTATACCG CTTATTGTTG ACAATACCTT TGGTACTCCG TATCTTATCA GGCCTTTTGA GTTTGGAGCC GATATTGTTG TGCATTCTGC GACGAAGTTC ATCGGCGGTC ACGGTACTTC CATAGGAGGA GTTATTGTTG ATTCGGGAAA ATTTGACTGG GCCGGAAGCG GAAAATTCCC GGTTCTTACC GAGCCGGATC CAAGCTATCA CGGCTTAAAA TATGTTGAGG CAGTAGGACC TCTTGCATAC ATTATCAGAG CCAGAGTGCA GCTTTTGAGA GATACAGGTG CGTGCATAAG TCCGTTTAAT TCATTCCTTC TCCTTCAGGG ACTGGAGACT CTGTCACTGA GGGTTGAAAG GCATGTTTCA AATGCAAAAA AGATTGCCGA GTATTTGCAA AATCATCCGA AAGTGGCGTG GGTAAATTAT CCGAGCCTTA AAGGCAACAA ATATTATGAC CTTGCTCAAA AATACTTCCC GAAAGGAGCA GGTTCAATAT TTACTTTCGG AATAAAGGGC GGATATGAAG CGGCAAAGAA ATTCATTGAA AACCTTGAAA TATTCTCACT TCTTGCCAAT GTTGCCGATG CAAAATCCCT GGTTATTCAT CCGGCGTCGA CAACCCATTC CCAGCTTTCG GAGGAGGAGC AAAGATCCTG CGGAGTTACA CCGGATCAAA TCAGGCTTTC AATCGGAATT GAAGACGTCG ACGACCTTAT TTATGATATT GAACAGGCAC TGGAAAAGGC TGTGGGGGCA TAA
|
Protein sequence | MAERKLKFDT LQVHAGQKPD PTTGSRAVPI YQTTSYVFNS PEHAANLFAL KEPGNIYTRI MNPTTDVFEQ RMAALEGGVG ALAVASGSAA ITYAILNIAG AGDEIVSAST LYGGTYNLFA ATLPRIGIKT VFVDPDDPEN FRKAINEKTK ALYIESLGNP GINIVDIEAV AKIAHENGIP LIVDNTFGTP YLIRPFEFGA DIVVHSATKF IGGHGTSIGG VIVDSGKFDW AGSGKFPVLT EPDPSYHGLK YVEAVGPLAY IIRARVQLLR DTGACISPFN SFLLLQGLET LSLRVERHVS NAKKIAEYLQ NHPKVAWVNY PSLKGNKYYD LAQKYFPKGA GSIFTFGIKG GYEAAKKFIE NLEIFSLLAN VADAKSLVIH PASTTHSQLS EEEQRSCGVT PDQIRLSIGI EDVDDLIYDI EQALEKAVGA
|
| |