Gene RPB_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1001 
Symbol 
ID3909298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1146374 
End bp1147678 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content66% 
IMG OID637882894 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_484622 
Protein GI86748126 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG AGACGATCGC CATTCACGCC GGCTACGATC CCGATCCGAC CACCAAGGCG 
GTCGCGGTCC CGATCTACCA GACCGCATCC TACGCCTTCG ACAGCGCCGA CCACGGCGCC
GCATTGTTCA ATCTCGAGAC CGAAGGCTAT CGCTATTCGC GGATCGCCAA TCCGACCAGC
GCTGTGCTGG AGAAGCGCGT CGCCGAGCTC GAGGGTGGCG TCGGCGCGCT CGCGGTCGCC
AGCGGCCAGG CGGCGCTGCA CTACGCCTTC GTCAACGTCG CCGATCACGG CGGCAACATC
GTCTCGGTGC CGCAGCTCTA CGGCACCACG CACACGCTGC TCTCCCACAT CCTGCCGCGC
CAGGGCATCC ACGGCCGCTT CGCCGAGAAC GACAGTGCCG CGGCGATCGA GAAGCTCATC
GATGCCGATA CCCGCGCGGT GTTCTGCGAG ACCATCGGCA ATCCCGCCGG CAATGTCTGC
GATATCGAGC GCATCGCCGA GGTGGCGCAT CGGCACGGCG TGCCGCTGAT CGTCGACAAC
ACGGTGGCGA CGCCGATCCT GATGAAGCCG TTCGACCACG GCGCCGACAT CGTGGTGCAT
TCGCTGACCA AGTTCCTCGG CGGCCACGGC ACGACGCTGG GCGGCGCGAT CGTCGACAGC
GGCCGGTTCG ACTGGGCGGC GCAGCCGCAG CGCTTTCCGG CGTTCAACCA GCCCGATCAT
TCCTATCACG GCATGGTCTA TGCCGAGCGG TTCGGCCCGA CGGCCTATAT CGAGCGCGCC
CGCAGCATCT ATCAGCGCAC CATGGGGTCG GTGCTGTCGC CGTTCAATGC GTTCCTGCTG
CTGCAGGGCA TCGAGACCGT CGCGCTGCGG ATGGAACGCC ACGTCGAGAA CGCCCGCAAG
GTGGCGGAGT TCCTGCGCGA CGATCCGCGC GTCGCCTGGG TGAACTACAC CGGCTTCCCG
GACAGTCCGT ATTACGAGCT GGTGCAGAAA TATCTCGGCG GCCGGGCGTC GTCGCTGTTC
ACCTTCGGCA TCAAGGGCGG CCTCGAGGCC GGCAAGAATT TCTACGATTC GCTGCGGCTG
ATCACCCGGC TGGTCAATAT CGGCGACGCC AAATCGCTCG CCTGCCATCC GGCCTCGACC
ACGCATCGGC AGATGTCCGC CGAGCAGCAG CGCACCGCCG GCGTGCTGCC GGAGACGATC
CGGCTGTCGA TCGGCATCGA ACACATCGCC GACATCATCG AAGATCTCGA TCAGGCGCTG
GCGCAGGCCG GTGGCCGGCG CACGCAACTG ATCGCGGCGG AATAA
 
Protein sequence
MRNETIAIHA GYDPDPTTKA VAVPIYQTAS YAFDSADHGA ALFNLETEGY RYSRIANPTS 
AVLEKRVAEL EGGVGALAVA SGQAALHYAF VNVADHGGNI VSVPQLYGTT HTLLSHILPR
QGIHGRFAEN DSAAAIEKLI DADTRAVFCE TIGNPAGNVC DIERIAEVAH RHGVPLIVDN
TVATPILMKP FDHGADIVVH SLTKFLGGHG TTLGGAIVDS GRFDWAAQPQ RFPAFNQPDH
SYHGMVYAER FGPTAYIERA RSIYQRTMGS VLSPFNAFLL LQGIETVALR MERHVENARK
VAEFLRDDPR VAWVNYTGFP DSPYYELVQK YLGGRASSLF TFGIKGGLEA GKNFYDSLRL
ITRLVNIGDA KSLACHPAST THRQMSAEQQ RTAGVLPETI RLSIGIEHIA DIIEDLDQAL
AQAGGRRTQL IAAE