Gene RPC_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4056 
Symbol 
ID3969305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4506287 
End bp4507594 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID637927160 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_533901 
Protein GI90425531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.326439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAA AGCTTCATCC CGATACGCTC GCGCTTCACG CCGGCTGGCG CGCCGACCCG 
GCCACCGGCT CAGTCGCGGT GCCGATCTTT CAGACCACCT CGTACCAGTT CCACAACACC
GAGCACGCCG CCAACTTGTT CGCGCTGAAG GAACTCGGCA ACATCTACAC GCGGATCGGC
AACCCGACCA ATGACGTGCT GGAGCAGCGC GTGGCGGCGC TTGAAGGCGG CGTCGCGGCG
CTCGCGGTGT CGTCGGGCCA AGCCGCTTCG GCGTTCTCGC TGCAGAATCT TGCCCGGGTC
GGCGACAACG TCGTCAGTTC CACCGACCTC TATGGCGGCA CCTGGAATCT GTTCGCCAAC
ACGCTGAAGG ACCAGGGCAT CGAAGTGCGC TTCGTCGACC CGGCGGATCC CGAAGCCTTC
GCCCGCGCCA CCGACGATCG CACCCGCGCC TACTACGCCG AAACCCTGCC GAACCCGAAG
CTGGCGGTGT TTCCGATCGC CGAAGTCGCG GCGATCGGCC GCAAGTTCGG CATTCCGCTG
ATCGTCGACA ACACCGCCGC CCCGTTGCTG GTGCGTCCGT TCGATCATGG CGCGGCGGTC
GTGGTGTATT CGGCCACCAA ATATCTCAGC GGCCACGGCA CCTCGATCGG CGGCCTGATC
GTCGACGGCG GCAATTTCGA CTGGGAGAAA TTCCCGGAAC GCCAGCCGGC GCTGAATACG
CCCGATCCGA GCTATCACGG CGCGGTCTGG GTCGAGGCGG TCAAGCCGAT CGGCCCGGTC
GCCTACATCA TCAAGGCCCG CACCACGCTG TTGCGCGACA TCGGCTCGGC GCTGTCGCCG
TTCAACGCGT TCCAGATCAT TCAGGGCCTT GAAACCCTGC CCTTGCGCAT CGAGCGCCAC
GTGCAGAACG CGCAAGCCGT CGCCGACTTC CTGGAGAAGC GCCCCGAGGT CACCAAGGTG
ATCCATCCCT CCAAGTTGAC CGGGGTCGCC CGCGAGCGCG CCGACAAATA TCTCAACGGC
AAGTTCGGCG GCCTGGTCGG CTTTGAACTC GCCGGCGGCA AGGAGGCCGG GCGCAAATTC
ATCGACGCGC TGCAGCTGCT GTACCACGTC GCCAATATCG GCGATGCTCG CAGTCTGGCG
ATCCATCCGG CCTCGACCAC GCATTCGCAG CTCTCGGTCG AGGACCAACT CGCCACCGGC
GTGTCGGACG GCTACGTGCG GCTGTCGGTC GGCCTCGAGC ACATCGACGA CATCATCGCC
GATCTGGAAA CCGGCCTTGC CGCGGGACGT CTGGCCGCCG CCGCGTAA
 
Protein sequence
MTRKLHPDTL ALHAGWRADP ATGSVAVPIF QTTSYQFHNT EHAANLFALK ELGNIYTRIG 
NPTNDVLEQR VAALEGGVAA LAVSSGQAAS AFSLQNLARV GDNVVSSTDL YGGTWNLFAN
TLKDQGIEVR FVDPADPEAF ARATDDRTRA YYAETLPNPK LAVFPIAEVA AIGRKFGIPL
IVDNTAAPLL VRPFDHGAAV VVYSATKYLS GHGTSIGGLI VDGGNFDWEK FPERQPALNT
PDPSYHGAVW VEAVKPIGPV AYIIKARTTL LRDIGSALSP FNAFQIIQGL ETLPLRIERH
VQNAQAVADF LEKRPEVTKV IHPSKLTGVA RERADKYLNG KFGGLVGFEL AGGKEAGRKF
IDALQLLYHV ANIGDARSLA IHPASTTHSQ LSVEDQLATG VSDGYVRLSV GLEHIDDIIA
DLETGLAAGR LAAAA