Gene RPD_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1104 
Symbol 
ID4021580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1254918 
End bp1256222 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content64% 
IMG OID637961296 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_568243 
Protein GI91975584 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.394848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG AAACGATTGC AATTCATGCC GGCTACGATC CCGATCCGAC GACCAAGGCG 
GTCGCGGTTC CGATCTATCA GACAGCCTCC TATGCATTCG ACAGCGCCGA TCACGGCGCC
GCTTTGTTCA ATCTCGAAAC CGAGGGCTAT CGCTACTCGC GGATCGCAAA TCCGACCAGC
AACGTGCTGG AAAAGCGCGT CGCCGAGCTT GAAGGCGGGG TGGGGGCGCT CGCGGTCGCC
AGCGGGCAGG CTGCACTGCA CTACGCCTTC GTGAACCTCG CCGATCACGG CGGCAATATC
GTTTCGGTGC CGCAGCTCTA CGGCACGACG CACACGCTGC TGTCGCACGT TCTGCCGCGG
CAGGGGATTA TCGGTCGCTT CGCCGAGAGC GACCAGCCCG ATGCGATCGA GCGTCTGATC
GACGAGAATA CGCGCGCCGT CTTCTGCGAG ACCATCGGCA ATCCCGCGGG GAACATCTGC
GACATCGAGC GCATCGCCGA GGTGGCGCAC CGGCACGGCG TGCCGCTGAT CGTCGACAAC
ACCGTTGCGA CTCCGATCCT GCTCAAGCCG ATCGACTACG GCGCGGACAT CGTCGTGCAT
TCGCTGACCA AGTTTCTCGG CGGCCACGGC ACCACGCTCG GCGGCGCGAT CGTCGACAGC
GGCCGGTTCG ACTGGTCGGC GCAGCCGCAG CGCTTTCCCG CCTTCAACCA GCCGGATCAT
TCCTATCACG GCATGATCTA CAGCGAACAT TTCGGTCCGC GCGCCTATAT CGAGCGCGCC
CGCAGCGTGT TTCAGCGTAC CATGGGGTCG GTGCTGTCGC CGTTCAGCGC GTTTCTGCTG
CTGCAGGGGA TCGAGACCGT CGCGCTGCGG ATGGAGCGGC ACGTCGAGAA TGCCCGCAAG
GTGGCCGAGT TCCTGCGCGA CGATCCGCGC GTCGCCTGGG TGAACTACAC CGGATTCCCC
GACAGTCCGT ATTACGAACT GGTCCAGAAA TATCTCGGCG GCCAGGCGTC GTCGCTGTTC
ACCTTCGGCA TCAAAGGCGG CCTCGAGGCC GGCAAGAGCT TCTACGACGC GCTCAAGCTG
GTCACCCGGC TGGTCAATAT CGGTGACGCC AAGTCGCTGG CCTGTCATCC GGCGTCGACG
ACGCATCGCC AGATGTCGGC CGAACAGCAG CGCATCGCCG GCGTACTGCC GGAAACGATC
CGGCTGTCGA TCGGGATCGA GCACATCTCC GATATTATCG CCGATCTCGA CCAGGCGCTG
GCGCAGGCCA GCGGTCAACA GCCGCGCCTT CTGGCCGCGG AATAG
 
Protein sequence
MRNETIAIHA GYDPDPTTKA VAVPIYQTAS YAFDSADHGA ALFNLETEGY RYSRIANPTS 
NVLEKRVAEL EGGVGALAVA SGQAALHYAF VNLADHGGNI VSVPQLYGTT HTLLSHVLPR
QGIIGRFAES DQPDAIERLI DENTRAVFCE TIGNPAGNIC DIERIAEVAH RHGVPLIVDN
TVATPILLKP IDYGADIVVH SLTKFLGGHG TTLGGAIVDS GRFDWSAQPQ RFPAFNQPDH
SYHGMIYSEH FGPRAYIERA RSVFQRTMGS VLSPFSAFLL LQGIETVALR MERHVENARK
VAEFLRDDPR VAWVNYTGFP DSPYYELVQK YLGGQASSLF TFGIKGGLEA GKSFYDALKL
VTRLVNIGDA KSLACHPAST THRQMSAEQQ RIAGVLPETI RLSIGIEHIS DIIADLDQAL
AQASGQQPRL LAAE