Gene Rsph17029_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1095 
Symbol 
ID4895095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1131502 
End bp1132794 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content67% 
IMG OID640111681 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001042977 
Protein GI126461863 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.337792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAG ACCGCAAACT CGGCTTCGAC ACGCTGCAGA TCCACGCCGG GGCCAAGCCG 
GATCCGGCGA CGGGCGCCCG GCAGGTGCCG ATCTACCAGA CGACCGCCTA TGTGTTCCGC
GACGCCGAGC ACGCGGCACG TCTCTTCAAT CTGGAAGAGG TGGGCTATAT CTACTCGCGC
CTTACCAACC CGACCGTCAT GGCTCTGGCC GAGAGGGTGG CCGCGCTGGA GGGAGGCGCG
GGCGCCGTCT GCTGCTCCTC GGGGCATGCC GCGCAGATCA TGGCGCTCTT CCCGCTGATG
GCACCCGGCC GCAACATCGT GGCTTCGACA CGTCTCTACG GCGGCACGAT CACACAATTC
TCGCAGACGA TCAGGCGGTT CGGCTGGTCG GCCAAGTTCG TGGACTTCGA CGATCCCGCC
GCCATCGAGG CCGCGATCGA CTCGGATACG CGCGCCCTCT TCTGCGAGAC CATTGCCAAC
CCCGGCGGCG TCATCACGGA TCTCGATGCG GTCTCGGCCA TCGCGGACAA GATGGGCCTG
CCGCTCATCG TGGACAACAC CACTGCCACG CCTTGGCTCT GCCGCCCGAT CGAGCATGGC
GCGACACTCG TCGTTCATTC CGCAACGAAA TACCTGACCG GCAATGGCAC GGTGACCGGC
GGCGTGATCG TGGACTCGGG CAAGTTCGAC TGGTCCGCGT CGGACAAGTT CCCGAGCCTG
TCGCAGCCCG AGCCGGCCTA CCATGGCCTC GTCTTCCACA AGGCGCTGGG GCCGATGGCC
TACACGTTCC ACTCCATCGC CGTGGGCCTG CGCGATCTCG GCATGACCAT GAACCCGCAG
GGGGCGCATT ACACGCTGAT GGGGATCGAG ACCCTCAGCC TGCGCATGGC CCGGCATGTC
GAGAACGCGC AGAAGGTGGC CGCCTGGCTG GAGCAGGACC CGCGGGTGGA ATTCGTGAGC
TACGCAGGAT TGCCCTCCTC GCCCTGGCAC GGCCGCGTCG CGCGGATCTG CCCGAAGGGG
GCCGGAGCGC TCTTCACCTT CGCGGTCAAG GGCGGCTACG ACGCGTGCGT GGCGCTCGTC
GATGCGCTGC AGCTGTTCAG CCATGTCGCC AACCTCGGCG ATACACGGTC GCTTGTGATC
CACTCGGCCT CCACCACCCA TCGCCAGCTC ACGCCCGAGC AGCAGGTGGC GGCCGGCGCA
GCGCCGAATG TCGTGCGCAT CTCGATCGGG ATCGAGGATG CCGACGATCT GATCGCGGAC
CTGGATCAGG CCCTAGCCAA GGCGACGGCC TGA
 
Protein sequence
MSSDRKLGFD TLQIHAGAKP DPATGARQVP IYQTTAYVFR DAEHAARLFN LEEVGYIYSR 
LTNPTVMALA ERVAALEGGA GAVCCSSGHA AQIMALFPLM APGRNIVAST RLYGGTITQF
SQTIRRFGWS AKFVDFDDPA AIEAAIDSDT RALFCETIAN PGGVITDLDA VSAIADKMGL
PLIVDNTTAT PWLCRPIEHG ATLVVHSATK YLTGNGTVTG GVIVDSGKFD WSASDKFPSL
SQPEPAYHGL VFHKALGPMA YTFHSIAVGL RDLGMTMNPQ GAHYTLMGIE TLSLRMARHV
ENAQKVAAWL EQDPRVEFVS YAGLPSSPWH GRVARICPKG AGALFTFAVK GGYDACVALV
DALQLFSHVA NLGDTRSLVI HSASTTHRQL TPEQQVAAGA APNVVRISIG IEDADDLIAD
LDQALAKATA