Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0531 |
Symbol | |
ID | 4446981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 564983 |
End bp | 566317 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639688328 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_830030 |
Protein GI | 116669097 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACC GCAAGTTCGG CTTCCGCACC CGGGCCCTTC ACGCCGGCGG CACACCCGAC GCCGAGCACG GCGCGCGGGC CGTCCCGATC TACCAGACCA CGTCCTTCGT CTTCAAGGAC ACCCAGGACG CCGCCAACCT CTTCGCCCTG CAGAAGTACG GCAACATCTA CTCGCGCATC GGCAACCCCA CGGTGGCGGC GTTCGAAGAG CGCATTGCCT CCCTGGAGGG CGGCATCGGA GCCGTTGCGA CGTCGTCGGG CATGGCCGCG GAGTTCATCA CCTTTGCCGC GCTCACCCAG GCAGGCGACC ACATCGTGGC GGCCTCCCAG CTGTACGGCG GCACGGTCAC CCAGCTCGAC GTCACGTTGC GCCGCTTCGG GGTGGACACC ACGTTCGTCC CCGGCACCGA CCCGGCGGAC TACGCCGCCG CGGTCCGGGA GAACACCAAG GCGATCTTCG TCGAGGTGGT GGCCAACCCG TCGTCGGAAG TCCAGGACCT TGAGGGGCTG GCAAAGGTGG CGCGCGACGC CGGCATTCCG TTGGTCGTCG ACGCCACCTT GAGCACGCCG TACCTGGTGC GGCCGATCGA GCACGGGGCG GACATCGTCA TCCACTCCGC CACCAAGTTC CTCGGCGGAC ACGGCACCAC CCTCGGCGGC GTGATCGTCG AGAGCGGCCG GTTCAACTGG GGCAACGGCA AGTTCCCCAC CATGACCGAG CCCGTGGCCT CCTACGGCAA CGTCTCCTGG TGGGGCAACT TCGGTGAGTA TGGCTTCCTG ACCAAGCTCC GCTGCGAGCA GCTGCGGGAT ATCGGCCCCG CACTCTCTCC GCAGTCCGCG TTCCAGCTGC TGCAGGGCGT GGAAACCCTT CCGCAGCGCC TCGACGAGCA CCTGAAGAAC GCCCAGGCCG TGGCCGAATG GCTCGAAGCG GACGAGCGCG TGGCGTACGT CAACTTCTCC GGATTGCCGT CGCACCCGCA CTTCGAGCGG GCGCAGAAGT ACCTGCCCCT GGGTCCAGGC TCGGTATTCT CCTTCGGTGT CAAGGGCGGC CGCGCAGCCG GGCAGAAATT CATCGAGGCA CTCCAGCTGG CCTCGCACCT GGCCAACGTC GGCGACTCCC GTACTCTCGT GATCCACCCC GGCTCCACCA CCCACCAGCA GCTGAGCCCG GCCCAGCTTG AGTCTGCGGG AGTACCGGAA GACCTGGTGC GGATTTCGAT CGGGCTCGAG GACCTCGAGG ACATCCTCTG GGACCTCGAC CAGGCGCTGG ACGCTGCGTC TACACCGGTG GTCGAGCTTG TCGAGGCCGA CACCTGCACG ATCGGAGCGA ACTGA
|
Protein sequence | MADRKFGFRT RALHAGGTPD AEHGARAVPI YQTTSFVFKD TQDAANLFAL QKYGNIYSRI GNPTVAAFEE RIASLEGGIG AVATSSGMAA EFITFAALTQ AGDHIVAASQ LYGGTVTQLD VTLRRFGVDT TFVPGTDPAD YAAAVRENTK AIFVEVVANP SSEVQDLEGL AKVARDAGIP LVVDATLSTP YLVRPIEHGA DIVIHSATKF LGGHGTTLGG VIVESGRFNW GNGKFPTMTE PVASYGNVSW WGNFGEYGFL TKLRCEQLRD IGPALSPQSA FQLLQGVETL PQRLDEHLKN AQAVAEWLEA DERVAYVNFS GLPSHPHFER AQKYLPLGPG SVFSFGVKGG RAAGQKFIEA LQLASHLANV GDSRTLVIHP GSTTHQQLSP AQLESAGVPE DLVRISIGLE DLEDILWDLD QALDAASTPV VELVEADTCT IGAN
|
| |