Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1124 |
Symbol | |
ID | 7408706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1215621 |
End bp | 1216898 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643715490 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_002572998 |
Protein GI | 222529116 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000369055 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAA GAAAATACGG TTTTGATACT CTTCAACTTC ATGCAGGGCA GTTTGTTGAC AGAGAGACAA AATCAAGAGC TGTTCCAATT TACCAGACAA CCTCTTACAT TTTCGAAACA CCTGAAGAGG CAGCAGATTT GTTTGCACTC AAAAAAGCGG GAAATATCTA TACAAGAATA GGAAATCCGA CAACAGATGT TTTAGAAAAA AGGATTGCAG CACTGGACGG TGGGGTTGGG GCTGTTGCAA CCTCTTCAGG GCAGGCTGCT ATCACTTATG CCATTTTAAA TATCGCAAGA AGCGGTGATG AGGTTGTTGC AGCATCCACT TTATACGGCG GAACATACGC CCTTTTTGCT CACACATTAA GAAAGCTTGG CATCACAGTA AAATTTGTAA ATCCTGATTA TCCAGAAGAG TTCGAAAAGG CAATCACAGA CAAGACAAAG GCTATATTTG TAGAAACTCT TGGAAATCCC AATATAAACA TACCTGATTT TGAAGCGATA GCTGAAATTG CCCACAAGCA TGGGATTCCA TTTATTGTTG ACAACACGTT TGCAACCCCG TATCTATTTC GTCCTATTGA ACATGGGGCA GACATTGTTG TGTATTCAAT GACAAAGTTT TTGGGCGGGC ATGGAACATC AATTGCAGGG ATTGTTGTTG ACTCTGGCAA GTTTGAGTGG AACGAAAAAT TCCCAGATTT GATTCAGCCA GACCCAAGCT ATCATGGACT TATTTACACA AAAGAGTTTG GAAATGCTGC ATATATTGCA AAACTAAGAC TTACGCTCTT GCGCGACATT GGTGCATGCC TCTCCCCATT TAATTCGTTC TTGATACTTC TTGGTGTTGA GACACTCTCT TTGAGAATGC AAAAACATGT TGACAATGCA ATGAAACTTG CCAAGTTTCT AAATGACCAT CCAAAGGTTG AGTGGGTGAA CTATCCAGCT TTAGAAGGTA ACAAGTATTA TGAGCTTTAC AAGAAGTATC TTCCAAAAGG ACCGGGTGCA ATCTTCACAT TTGGACCAAA AGGTGGCTAC GATGCTGCAA AGAAGATTAT AAACAATGTA AAGCTCTTTT CACACCTTGC TAATGTTGGC GATGCAAAGT CGCTTATAAT TCATCCTGCA TCAACAACCC ATCAACAGCT AACAGAAGAA GAGCAAAGAG CAGCTGGAGT TTTGCCAGAA ATGATTAGAC TCTCTGTAGG TATTGAAGAT ATTGAAGATT TAATATATGA TATTGAGAGT GCACTAAATA AAGTGTAA
|
Protein sequence | MEERKYGFDT LQLHAGQFVD RETKSRAVPI YQTTSYIFET PEEAADLFAL KKAGNIYTRI GNPTTDVLEK RIAALDGGVG AVATSSGQAA ITYAILNIAR SGDEVVAAST LYGGTYALFA HTLRKLGITV KFVNPDYPEE FEKAITDKTK AIFVETLGNP NINIPDFEAI AEIAHKHGIP FIVDNTFATP YLFRPIEHGA DIVVYSMTKF LGGHGTSIAG IVVDSGKFEW NEKFPDLIQP DPSYHGLIYT KEFGNAAYIA KLRLTLLRDI GACLSPFNSF LILLGVETLS LRMQKHVDNA MKLAKFLNDH PKVEWVNYPA LEGNKYYELY KKYLPKGPGA IFTFGPKGGY DAAKKIINNV KLFSHLANVG DAKSLIIHPA STTHQQLTEE EQRAAGVLPE MIRLSVGIED IEDLIYDIES ALNKV
|
| |