Gene RPB_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1360 
Symbol 
ID3908465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1550300 
End bp1551607 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID637883254 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_484981 
Protein GI86748485 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.364939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.516055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA AGCTTCACCC TGAAACCCTC GCGCTGCACG CGGGCTGGCG CGCCGATCCT 
TCGACCGGCT CGGTCGCAGT GCCGATCTTT CAAACGACGT CCTATCAATT CAACAACACA
GAGCACGCGG CCAACCTGTT CGCACTGAAG GAACTCGGCA ACATCTACAC CCGGATCGGC
AACCCAACGA CCGACGTGCT GGAAAAGCGC GTCGCGGCGC TGGAAGGCGG CGTCGCGGCG
CTCGCAGTGG CGTCGGGCCA GGCGGCTTCG GCCTTTGCGA TCCAGAACCT CGCTCGCGTC
GGCGACAACG TCGTCAGTTC GACCGATCTC TATGGCGGCA CCTGGAATCT GTTCGCCAAC
ACGCTGAAGG ACCAGGGCAT CGAAGTCCGC TTCGTCGATC CGGCCGATCC GCAAGCCTTC
GAGCGCGCCA CCGACGACCG CACCCGCGCC TATTACGCCG AGACGCTGCC GAACCCGAAA
CTCGCGGTGT TTCCGATCGC CGAAGTGGCC GCGATCGGCC GCAAATTCGG CATTCCGCTG
ATCGTCGACA ACACCGCCGC GCCGTTGCTG GTGAAGCCGC TCGAACACGG CGCCGCGATC
GTGGTGTATT CAGCGACCAA ATATCTCGGC GGCCACGGCA CCTCGATCGG CGGCTTGATC
GTCGACGGCG GCAATTTCGA CTGGGAGACA TTCCCGCAGC GTCAGCCGGC GCTGAACACC
CCGGATCCGA GCTATCACGG CGCCGTCTGG GTCGAGGCGG TGAAGCCGAT CGGCCCGGTC
GCCTACATCA TCAAGGCGCG CACCACGCTG CTGCGCGACA TCGGCTCGGC GCTGTCGCCG
TTCAACGCCT TCCAGATCCT GCAGGGCCTG GAAACGCTGC CGCTGCGAAT CGAACGTCAC
GTGCAGAACG CCCAGGCGGT CGCCGACTAT CTCGAAAAGC GGCCCGAGGT CGTCAAGGTG
ATCCATCCGT CGAAGCTGAG CGGCGTCGCG CGAGAGCGCG CCGACAAATA CCTCAAGGGC
AAGTTCGGCG GCCTGGTCGG CTTCGAGCTC GCGGGCGGCC TCGAGGCCGG GCGCAAATTC
ATCGACGCGC TGCAACTGTT GTATCACGTC GCCAATATCG GCGACGCGCG CAGCCTCGCG
ATCCATCCGG CGACGACCAC GCACTCGCAG CTTTCCGCCG AGGACCAGCT CGCGACCGGC
GTGTCGGACG GCTATGTCCG GCTGTCGGTC GGCCTCGAAC ACATCGACGA CATCATCGTC
GATCTGGAGC GCGGCCTCGC CGCGGGACGC CTCGCCAAGG CGGCGTAA
 
Protein sequence
MTEKLHPETL ALHAGWRADP STGSVAVPIF QTTSYQFNNT EHAANLFALK ELGNIYTRIG 
NPTTDVLEKR VAALEGGVAA LAVASGQAAS AFAIQNLARV GDNVVSSTDL YGGTWNLFAN
TLKDQGIEVR FVDPADPQAF ERATDDRTRA YYAETLPNPK LAVFPIAEVA AIGRKFGIPL
IVDNTAAPLL VKPLEHGAAI VVYSATKYLG GHGTSIGGLI VDGGNFDWET FPQRQPALNT
PDPSYHGAVW VEAVKPIGPV AYIIKARTTL LRDIGSALSP FNAFQILQGL ETLPLRIERH
VQNAQAVADY LEKRPEVVKV IHPSKLSGVA RERADKYLKG KFGGLVGFEL AGGLEAGRKF
IDALQLLYHV ANIGDARSLA IHPATTTHSQ LSAEDQLATG VSDGYVRLSV GLEHIDDIIV
DLERGLAAGR LAKAA