Gene TM1040_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0585 
Symbol 
ID4076150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp624873 
End bp626165 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content61% 
IMG OID638005882 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_612580 
Protein GI99080426 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.528029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000697093 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGACA GCCCGACCTA TGGCTTTGAT ACGCTACAAA TTCACGCTGG CGCCCGCCCT 
GATCCGGCAA CCGGCGCGCG GCAGACGCCG ATCTATCAAT CCACTGCCTA TGTGTTTCGT
GACGCCGACC ACGCGGCGGC GCTCTTTAAC CTGCAGGAAG TAGGCTACAT CTATTCACGC
CTCACCAACC CGACCGTTGC GGTCCTGCAA GAGCGTCTCG CCACGCTCGA GGGTGGTGTC
GGCGCGGTCT GCTGCTCGTC GGGGCACGCC GCACAGATCA TGGCCCTGTT CCCCCTGATG
GGACCCGGCA AGAACGTGGT TGCCTCCACC CGCCTTTACG GGGGCACGGT GACGCAGTTC
AGCCAGACCA TCAAACGCTT CGGCTGGTCC GCAAAATTTG TTGATTTTGA CGATCTCGAT
GCGCTGCGCG AGGCGATCGA CGATGACACG CGGGCCGTCT TCTGCGAGTC CGTTGCCAAC
CCCGGTGGCT ATGTCACCGA CATCCGCGCG GTCGCCGATG TGGCAGATGC CGCAGGCCTG
CCCCTGATCG TCGACAACAC CTCTGCAACG CCCTACCTGT GCCAGCCGAT TTCCCAGGGC
GCGACCTTGG TAGTGCATTC CACAACCAAA TACCTCACCG GCAATGGTAC CGTAACCGGC
GGCTGCATCG TGGATTCGGG CAAGTTCGAC TGGTCCGCCT CCGACAAGTT CCCCTCGCTC
TCAGCTCCGG AACCCGCCTA TCACGGGCTC AAGTTCCACG AGACCTTCGG CAATCTTGCC
TTCACCTTCC ACGGCATTGC CATCGGGCTG CGTGATCTGG GCATGACCAT GAACCCGCAA
GCCGCACATT ACACCCTGAT GGGCATTGAG ACCCTGTCGC TTCGGATGCA GCGCCATGTG
GAAAATGCTG TGAAAGTCGC GACTTGGCTC GAAAACGACC CGCGCGTGGA TTACGTGACC
TATGCGGGTC TCCCCTCCTC GCCATATGCC GAGCGCGCCC AGCGTTGCTA TCCCAAGGGC
ACCGGCGGGC TGTTCACAGT CGCCATCAAA GGCGGCTATG ACGCTTGTGT CAAACTGGTC
GACAGCCTCG AGATCTTCTC GCATGTGGCC AATCTTGGCG ACACCCGGTC ACTCATCATC
CATTCCGCGT CCACGACTCA CCGCCAGCTT ACGCCCGAAC AGCAAGAAGC AGCGGGCGCT
GGCCCCAATG TGGTGCGTGT GTCCATCGGC ATCGAGGACG CAGACGATTT GATCCGCGAT
CTCGATCAAG CTCTGGCCAA GGCCTGCGGC TGA
 
Protein sequence
MSDSPTYGFD TLQIHAGARP DPATGARQTP IYQSTAYVFR DADHAAALFN LQEVGYIYSR 
LTNPTVAVLQ ERLATLEGGV GAVCCSSGHA AQIMALFPLM GPGKNVVAST RLYGGTVTQF
SQTIKRFGWS AKFVDFDDLD ALREAIDDDT RAVFCESVAN PGGYVTDIRA VADVADAAGL
PLIVDNTSAT PYLCQPISQG ATLVVHSTTK YLTGNGTVTG GCIVDSGKFD WSASDKFPSL
SAPEPAYHGL KFHETFGNLA FTFHGIAIGL RDLGMTMNPQ AAHYTLMGIE TLSLRMQRHV
ENAVKVATWL ENDPRVDYVT YAGLPSSPYA ERAQRCYPKG TGGLFTVAIK GGYDACVKLV
DSLEIFSHVA NLGDTRSLII HSASTTHRQL TPEQQEAAGA GPNVVRVSIG IEDADDLIRD
LDQALAKACG