Gene Athe_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1124 
Symbol 
ID7408706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1215621 
End bp1216898 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content40% 
IMG OID643715490 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_002572998 
Protein GI222529116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000369055 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAA GAAAATACGG TTTTGATACT CTTCAACTTC ATGCAGGGCA GTTTGTTGAC 
AGAGAGACAA AATCAAGAGC TGTTCCAATT TACCAGACAA CCTCTTACAT TTTCGAAACA
CCTGAAGAGG CAGCAGATTT GTTTGCACTC AAAAAAGCGG GAAATATCTA TACAAGAATA
GGAAATCCGA CAACAGATGT TTTAGAAAAA AGGATTGCAG CACTGGACGG TGGGGTTGGG
GCTGTTGCAA CCTCTTCAGG GCAGGCTGCT ATCACTTATG CCATTTTAAA TATCGCAAGA
AGCGGTGATG AGGTTGTTGC AGCATCCACT TTATACGGCG GAACATACGC CCTTTTTGCT
CACACATTAA GAAAGCTTGG CATCACAGTA AAATTTGTAA ATCCTGATTA TCCAGAAGAG
TTCGAAAAGG CAATCACAGA CAAGACAAAG GCTATATTTG TAGAAACTCT TGGAAATCCC
AATATAAACA TACCTGATTT TGAAGCGATA GCTGAAATTG CCCACAAGCA TGGGATTCCA
TTTATTGTTG ACAACACGTT TGCAACCCCG TATCTATTTC GTCCTATTGA ACATGGGGCA
GACATTGTTG TGTATTCAAT GACAAAGTTT TTGGGCGGGC ATGGAACATC AATTGCAGGG
ATTGTTGTTG ACTCTGGCAA GTTTGAGTGG AACGAAAAAT TCCCAGATTT GATTCAGCCA
GACCCAAGCT ATCATGGACT TATTTACACA AAAGAGTTTG GAAATGCTGC ATATATTGCA
AAACTAAGAC TTACGCTCTT GCGCGACATT GGTGCATGCC TCTCCCCATT TAATTCGTTC
TTGATACTTC TTGGTGTTGA GACACTCTCT TTGAGAATGC AAAAACATGT TGACAATGCA
ATGAAACTTG CCAAGTTTCT AAATGACCAT CCAAAGGTTG AGTGGGTGAA CTATCCAGCT
TTAGAAGGTA ACAAGTATTA TGAGCTTTAC AAGAAGTATC TTCCAAAAGG ACCGGGTGCA
ATCTTCACAT TTGGACCAAA AGGTGGCTAC GATGCTGCAA AGAAGATTAT AAACAATGTA
AAGCTCTTTT CACACCTTGC TAATGTTGGC GATGCAAAGT CGCTTATAAT TCATCCTGCA
TCAACAACCC ATCAACAGCT AACAGAAGAA GAGCAAAGAG CAGCTGGAGT TTTGCCAGAA
ATGATTAGAC TCTCTGTAGG TATTGAAGAT ATTGAAGATT TAATATATGA TATTGAGAGT
GCACTAAATA AAGTGTAA
 
Protein sequence
MEERKYGFDT LQLHAGQFVD RETKSRAVPI YQTTSYIFET PEEAADLFAL KKAGNIYTRI 
GNPTTDVLEK RIAALDGGVG AVATSSGQAA ITYAILNIAR SGDEVVAAST LYGGTYALFA
HTLRKLGITV KFVNPDYPEE FEKAITDKTK AIFVETLGNP NINIPDFEAI AEIAHKHGIP
FIVDNTFATP YLFRPIEHGA DIVVYSMTKF LGGHGTSIAG IVVDSGKFEW NEKFPDLIQP
DPSYHGLIYT KEFGNAAYIA KLRLTLLRDI GACLSPFNSF LILLGVETLS LRMQKHVDNA
MKLAKFLNDH PKVEWVNYPA LEGNKYYELY KKYLPKGPGA IFTFGPKGGY DAAKKIINNV
KLFSHLANVG DAKSLIIHPA STTHQQLTEE EQRAAGVLPE MIRLSVGIED IEDLIYDIES
ALNKV