Gene RSP_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2431 
Symbol 
ID3720028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp1064635 
End bp1065927 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content67% 
IMG OID640070612 
Productputative O-acetylhomoserine sulfhydrylase 
Protein accessionYP_352493 
Protein GI77462989 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCAG ACCGCAAACT CGGCTTCGAC ACGCTGCAGA TCCACGCCGG GGCCAAGCCG 
GATCCGGCGA CGGGCGCCCG GCAGGTGCCG ATCTACCAGA CGACCGCCTA TGTGTTCCGC
GACGCCGAGC ACGCGGCACG TCTCTTCAAT CTGGAAGAGG TGGGCTATAT CTACTCGCGC
CTTACCAACC CGACGGTCAT GGCTCTGGCC GAGAGGGTGG CCGCGCTGGA GGGAGGCGCG
GGCGCCGTCT GCTGCTCCTC GGGGCATGCC GCGCAGATCA TGGCGCTCTT CCCGCTGATG
GCACCCGGCC GCAACATCGT GGCCTCGACA CGTCTCTACG GCGGCACGAT CACACAATTC
TCGCAGACGA TCAGGCGGTT CGGCTGGTCG GCCAAGTTCG TGGACTTCGA CGATCCCGCC
GCCATCGAGG CCGCGATCGA CTCGGATACG CGCGCCCTCT TCTGCGAGAC CATTGCCAAC
CCCGGCGGCG TCATCACGGA TCTCGATGCG GTCTCGGCCA TCGCGGACAG GATGGGCCTG
CCGCTCATCG TGGACAACAC CACTGCCACG CCCTGGCTCT GCCGCCCCAT CGAGCATGGC
GCGACGCTCG TCGTTCATTC CGCAACGAAA TACCTGACCG GCAATGGCAC AGTGACCGGC
GGCGTGATCG TGGACTCGGG CAAGTTCGAC TGGTCGGCGT CGGACAAGTT CCCGAGCCTG
TCGCAGCCCG AGCCGGCCTA CCATGGCCTC GTCTTCCACA AGGCGCTGGG GCCAATGGCC
TACACGTTCC ACTCCATCGC TGTGGGCCTG CGCGATCTCG GCATGACCAT GAACCCGCAG
GGGGCGCATT ACACGCTGAT GGGGATCGAG ACCCTCAGCC TGCGCATGGC CCGGCATGTC
GAGAACGCGC AGAAGGTGGC CGCCTGGCTG GAGCAGGACC CGCGGGTGGA ATTCGTGAGC
TACGCAGGAT TGCCCTCCTC GCCCTGGCAC GGCCGCGTCG CGCGGATCTG CCCGAAGGGG
GCCGGAGCGC TCTTTACCTT CGCGGTCAAG GGCGGCTACG ACGCGTGCGT GGCGCTCGTC
GATGCGCTGC AGCTGTTCAG CCATGTCGCC AACCTCGGCG ATACACGGTC GCTTGTGATC
CACTCGGCCT CCACCACCCA TCGCCAGCTC ACGCCCGAGC AGCAGGTGGC GGCCGGCGCA
GCGCCGAATG TCGTGCGCAT CTCGATCGGA ATCGAGGATG CCGACGATCT GATCGCGGAC
CTGGATCAGG CCCTCGCCAA GGCGACGGCC TGA
 
Protein sequence
MSSDRKLGFD TLQIHAGAKP DPATGARQVP IYQTTAYVFR DAEHAARLFN LEEVGYIYSR 
LTNPTVMALA ERVAALEGGA GAVCCSSGHA AQIMALFPLM APGRNIVAST RLYGGTITQF
SQTIRRFGWS AKFVDFDDPA AIEAAIDSDT RALFCETIAN PGGVITDLDA VSAIADRMGL
PLIVDNTTAT PWLCRPIEHG ATLVVHSATK YLTGNGTVTG GVIVDSGKFD WSASDKFPSL
SQPEPAYHGL VFHKALGPMA YTFHSIAVGL RDLGMTMNPQ GAHYTLMGIE TLSLRMARHV
ENAQKVAAWL EQDPRVEFVS YAGLPSSPWH GRVARICPKG AGALFTFAVK GGYDACVALV
DALQLFSHVA NLGDTRSLVI HSASTTHRQL TPEQQVAAGA APNVVRISIG IEDADDLIAD
LDQALAKATA