Gene Cthe_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1842 
Symbol 
ID4809388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2186610 
End bp2187902 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content45% 
IMG OID640107256 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001038256 
Protein GI125974346 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000076838 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAA GAAAATTGAA ATTTGACACT TTGCAGGTCC ATGCAGGGCA GAAGCCCGAC 
CCTACTACAG GTTCAAGAGC GGTTCCGATT TACCAGACCA CGTCCTATGT GTTCAACAGT
CCGGAGCATG CGGCAAATCT CTTTGCGTTA AAGGAGCCCG GCAATATTTA CACAAGAATT
ATGAACCCCA CAACTGATGT TTTTGAACAG AGAATGGCCG CTTTGGAAGG AGGAGTCGGA
GCGCTGGCGG TGGCTTCAGG TTCGGCGGCT ATTACCTATG CAATACTCAA TATAGCCGGA
GCGGGAGATG AAATTGTTTC GGCCAGCACT CTTTACGGAG GAACATATAA TCTTTTTGCG
GCAACTTTAC CGAGGATTGG AATTAAAACT GTTTTTGTAG ACCCTGACGA TCCGGAAAAT
TTCAGAAAAG CCATTAATGA AAAGACAAAG GCTCTTTATA TTGAGTCTCT TGGAAACCCC
GGAATTAACA TAGTTGATAT TGAGGCGGTC GCAAAAATTG CCCATGAGAA CGGTATACCG
CTTATTGTTG ACAATACCTT TGGTACTCCG TATCTTATCA GGCCTTTTGA GTTTGGAGCC
GATATTGTTG TGCATTCTGC GACGAAGTTC ATCGGCGGTC ACGGTACTTC CATAGGAGGA
GTTATTGTTG ATTCGGGAAA ATTTGACTGG GCCGGAAGCG GAAAATTCCC GGTTCTTACC
GAGCCGGATC CAAGCTATCA CGGCTTAAAA TATGTTGAGG CAGTAGGACC TCTTGCATAC
ATTATCAGAG CCAGAGTGCA GCTTTTGAGA GATACAGGTG CGTGCATAAG TCCGTTTAAT
TCATTCCTTC TCCTTCAGGG ACTGGAGACT CTGTCACTGA GGGTTGAAAG GCATGTTTCA
AATGCAAAAA AGATTGCCGA GTATTTGCAA AATCATCCGA AAGTGGCGTG GGTAAATTAT
CCGAGCCTTA AAGGCAACAA ATATTATGAC CTTGCTCAAA AATACTTCCC GAAAGGAGCA
GGTTCAATAT TTACTTTCGG AATAAAGGGC GGATATGAAG CGGCAAAGAA ATTCATTGAA
AACCTTGAAA TATTCTCACT TCTTGCCAAT GTTGCCGATG CAAAATCCCT GGTTATTCAT
CCGGCGTCGA CAACCCATTC CCAGCTTTCG GAGGAGGAGC AAAGATCCTG CGGAGTTACA
CCGGATCAAA TCAGGCTTTC AATCGGAATT GAAGACGTCG ACGACCTTAT TTATGATATT
GAACAGGCAC TGGAAAAGGC TGTGGGGGCA TAA
 
Protein sequence
MAERKLKFDT LQVHAGQKPD PTTGSRAVPI YQTTSYVFNS PEHAANLFAL KEPGNIYTRI 
MNPTTDVFEQ RMAALEGGVG ALAVASGSAA ITYAILNIAG AGDEIVSAST LYGGTYNLFA
ATLPRIGIKT VFVDPDDPEN FRKAINEKTK ALYIESLGNP GINIVDIEAV AKIAHENGIP
LIVDNTFGTP YLIRPFEFGA DIVVHSATKF IGGHGTSIGG VIVDSGKFDW AGSGKFPVLT
EPDPSYHGLK YVEAVGPLAY IIRARVQLLR DTGACISPFN SFLLLQGLET LSLRVERHVS
NAKKIAEYLQ NHPKVAWVNY PSLKGNKYYD LAQKYFPKGA GSIFTFGIKG GYEAAKKFIE
NLEIFSLLAN VADAKSLVIH PASTTHSQLS EEEQRSCGVT PDQIRLSIGI EDVDDLIYDI
EQALEKAVGA