Gene RPD_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0903 
Symbol 
ID4021377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1016203 
End bp1017429 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content66% 
IMG OID637961093 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_568042 
Protein GI91975383 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA CCAAATCGCC CGATCCGAAG CCCTCCATCG CCATCCCCGA CACCGCTCAC 
TATCGCCGCG AGACCCGCCT GGTGCATTCC GGCAGCCTAC GCTCGCAATA TGGCGAGACC
TCCGAGGCGC TGTTTCTGAC CCAGGGCTTC GTCTACGACA GCGCCGAGCA ATGCGAAGCG
CGCTTCACCG GCGACGATCC CGGCTTCCAA TATTCGCGAT TCTCCAACCC GACGGTGTTC
AGCTTCGAGC AACGGATGGC GGAGTTCGAA GGCGCCGAGG CGGCGCGCGC CACCGCCACC
GGCATGGCCG CGGTGACCGC CGCGATGCTG GCGCCGCTGC GCGCCGGCGA TCACGTCGTC
GCCTCCAAGG CGATGTTCGG ATCGTGCCGC TACGTGGTTG AGGACCTGCT GCCGCGCTAC
GGCATCGAAT CGACGCTGGT CGACGGCCTC GATCTCGACC AATGGCAGCG CGCGGTTCGG
CCGAACACCA AGACGTTCTT TCTGGAAAGC CCGACCAATC CGACGCTCGA TGTGCTCGAC
ATCGGCGCTA TTGCGCAGAT CGCCCATGCC GGCGGCGCGC GCCTCGTCGT CGACAACGTG
TTCGCGACGC CGATCTGGCA GAGCCCGCTC GAACTCGGCG CCGACGTCGT GGTGTATTCC
GCGACCAAGC ACATCGACGG CCAGGGCCGC TGTCTCGGCG GCGTGGTGCT GGCGTCGCAG
GCGTTCATCG AGGAGCATAT CCAGATGTAT CTGCGCCAGA CCGGTCCGTC GCTGTCGCCG
TTCAACGCCT GGGTGCTGCT GAAGGGCCTG GAGACGCTGT CGGTCCGCGT CCGGCAGCAG
ACCGAGACCG CGGCGGCGAT CGCCGACGCG CTGGCGATTC ATCCCAAGGT CGCGCGGCTG
ATCTATCCCG GCCGCGCCGA TCATCCGCAG GCGGAGACCG TCAAGAAGCA GATGGGCGCC
GGCTCGACGC TGGTGGGCTT CGAGGTCAAG GGCGGCAAGG CCGAGGCGTT CCGCTTCCTC
AACGCGCTGA AGCTGGTGAA GATCAGCAAC AATCTCGGCG ACGCCAAGAG CCTGGTCACC
CACCCGGCGA CCACCACACA TCAGCGGCTG AAGCCGGAAG CCCGCGCCGA ACTCGGCATC
AGCGAGGGCT TCATCCGCTA TTCGGCCGGG CTGGAGCACA AGGACGATCT GATCGAGGAT
CTGATCGCCG CGCTCGATCA CGTGTGA
 
Protein sequence
MSETKSPDPK PSIAIPDTAH YRRETRLVHS GSLRSQYGET SEALFLTQGF VYDSAEQCEA 
RFTGDDPGFQ YSRFSNPTVF SFEQRMAEFE GAEAARATAT GMAAVTAAML APLRAGDHVV
ASKAMFGSCR YVVEDLLPRY GIESTLVDGL DLDQWQRAVR PNTKTFFLES PTNPTLDVLD
IGAIAQIAHA GGARLVVDNV FATPIWQSPL ELGADVVVYS ATKHIDGQGR CLGGVVLASQ
AFIEEHIQMY LRQTGPSLSP FNAWVLLKGL ETLSVRVRQQ TETAAAIADA LAIHPKVARL
IYPGRADHPQ AETVKKQMGA GSTLVGFEVK GGKAEAFRFL NALKLVKISN NLGDAKSLVT
HPATTTHQRL KPEARAELGI SEGFIRYSAG LEHKDDLIED LIAALDHV