Gene Arth_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0531 
Symbol 
ID4446981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp564983 
End bp566317 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID639688328 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_830030 
Protein GI116669097 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACC GCAAGTTCGG CTTCCGCACC CGGGCCCTTC ACGCCGGCGG CACACCCGAC 
GCCGAGCACG GCGCGCGGGC CGTCCCGATC TACCAGACCA CGTCCTTCGT CTTCAAGGAC
ACCCAGGACG CCGCCAACCT CTTCGCCCTG CAGAAGTACG GCAACATCTA CTCGCGCATC
GGCAACCCCA CGGTGGCGGC GTTCGAAGAG CGCATTGCCT CCCTGGAGGG CGGCATCGGA
GCCGTTGCGA CGTCGTCGGG CATGGCCGCG GAGTTCATCA CCTTTGCCGC GCTCACCCAG
GCAGGCGACC ACATCGTGGC GGCCTCCCAG CTGTACGGCG GCACGGTCAC CCAGCTCGAC
GTCACGTTGC GCCGCTTCGG GGTGGACACC ACGTTCGTCC CCGGCACCGA CCCGGCGGAC
TACGCCGCCG CGGTCCGGGA GAACACCAAG GCGATCTTCG TCGAGGTGGT GGCCAACCCG
TCGTCGGAAG TCCAGGACCT TGAGGGGCTG GCAAAGGTGG CGCGCGACGC CGGCATTCCG
TTGGTCGTCG ACGCCACCTT GAGCACGCCG TACCTGGTGC GGCCGATCGA GCACGGGGCG
GACATCGTCA TCCACTCCGC CACCAAGTTC CTCGGCGGAC ACGGCACCAC CCTCGGCGGC
GTGATCGTCG AGAGCGGCCG GTTCAACTGG GGCAACGGCA AGTTCCCCAC CATGACCGAG
CCCGTGGCCT CCTACGGCAA CGTCTCCTGG TGGGGCAACT TCGGTGAGTA TGGCTTCCTG
ACCAAGCTCC GCTGCGAGCA GCTGCGGGAT ATCGGCCCCG CACTCTCTCC GCAGTCCGCG
TTCCAGCTGC TGCAGGGCGT GGAAACCCTT CCGCAGCGCC TCGACGAGCA CCTGAAGAAC
GCCCAGGCCG TGGCCGAATG GCTCGAAGCG GACGAGCGCG TGGCGTACGT CAACTTCTCC
GGATTGCCGT CGCACCCGCA CTTCGAGCGG GCGCAGAAGT ACCTGCCCCT GGGTCCAGGC
TCGGTATTCT CCTTCGGTGT CAAGGGCGGC CGCGCAGCCG GGCAGAAATT CATCGAGGCA
CTCCAGCTGG CCTCGCACCT GGCCAACGTC GGCGACTCCC GTACTCTCGT GATCCACCCC
GGCTCCACCA CCCACCAGCA GCTGAGCCCG GCCCAGCTTG AGTCTGCGGG AGTACCGGAA
GACCTGGTGC GGATTTCGAT CGGGCTCGAG GACCTCGAGG ACATCCTCTG GGACCTCGAC
CAGGCGCTGG ACGCTGCGTC TACACCGGTG GTCGAGCTTG TCGAGGCCGA CACCTGCACG
ATCGGAGCGA ACTGA
 
Protein sequence
MADRKFGFRT RALHAGGTPD AEHGARAVPI YQTTSFVFKD TQDAANLFAL QKYGNIYSRI 
GNPTVAAFEE RIASLEGGIG AVATSSGMAA EFITFAALTQ AGDHIVAASQ LYGGTVTQLD
VTLRRFGVDT TFVPGTDPAD YAAAVRENTK AIFVEVVANP SSEVQDLEGL AKVARDAGIP
LVVDATLSTP YLVRPIEHGA DIVIHSATKF LGGHGTTLGG VIVESGRFNW GNGKFPTMTE
PVASYGNVSW WGNFGEYGFL TKLRCEQLRD IGPALSPQSA FQLLQGVETL PQRLDEHLKN
AQAVAEWLEA DERVAYVNFS GLPSHPHFER AQKYLPLGPG SVFSFGVKGG RAAGQKFIEA
LQLASHLANV GDSRTLVIHP GSTTHQQLSP AQLESAGVPE DLVRISIGLE DLEDILWDLD
QALDAASTPV VELVEADTCT IGAN