Gene Rsph17029_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0500 
Symbol 
ID4897245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp522651 
End bp523832 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID640111084 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001042388 
Protein GI126461274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.960396 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.966435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGG ACTGGAAGAC AAGGACGCAA CTCGTCCACG GGGGCAGCCG CCGGAGCCAG 
TATGGCGAAA TGGCCGAGGC GATCTTCCTG ACCCAGGGCT TCGTCTACGA CTCGGCCGAA
CAGGCCGAGG CGCGCTTCAT CGAGACCGGC GCCGACGAAT TCATCTATGC CCGCTACGGC
AACCCCACGA CGCGCATGTT CGAAGAGCGC ATCGCGGCCG TCGAGGGCAC CGAGGATGCG
TTCGCCACCG CCTCGGGCAT GGCCGCGATC CACGGCGTGC TCACCTCCAT TGTGCGGGCG
GGCGATCATC TGGTGGCGGC ACGCGCTCTT TTCGGCTCCT GCATCTACAT CCTCGAGGAG
GTGCTGGGCC GGTTCGGCGT CGAGGTGACC TTCGTCGACG GCACCGATCT CGATCAGTGG
CGAGCGGCGG TGCGGCCCGG CACGAAGGCC GTGTTCTTCG AGTCGGTCTC GAACCCGACG
CTCGAGGTGG CCGACATCGG CGCCATCGCC GAGATCGCCC ATGCCGTGGG CGCGCTCGTC
ATCGTGGACA ATGTCTTCGC GACGCCCGTC TTCTCGACGG CGGTGCGGCA GGGCGCGGAT
GTGGTGATCT ATTCGGCCAC CAAGCACATC GACGGGCAGG GGCGCGCGCT CGGCGGCGTG
GTCTGCGCCT CGCAGGCCTT CATCCGCAAG GTGCTCGAAC CCTTCATGAA GCACACCGGC
GGCTCGATGA GCCCCTTCAA CGCCTGGCTC ATGCTGAACG GGATGGCGAC GCTCGACCTG
CGCTGCCGCG CGATGGCGGA CACGGCCGAG AAGATCGCCC GCGCGCTCGA GGGCCATCCC
CAGCTCGGCC GCGTGATCCA TCCCGCGCTG GAAAGCCACC CGCAGCACGA CATGGCCAAG
GCGCAGATGG AACGTCCCGG CACGATGATC GCGCTCGACC TCGCCGGGGG CAAGGAGGCG
GCCTTCCGCT TCCTCGACGC CCTGAAGATC GTGAAGATCT CGAACAATCT GGGCGATGCC
CGCTCGATCG CGACCCACCC GGCGACGACC ACCCACCAGC GTCTCTCCGA CGCGCAGAAG
GCCCATCTCG GCATCACGCC CGGACTCGTG CGGCTGTCGG TGGGGCTCGA GGATGCGGAC
GACCTGATCG CCGATCTGAA ACAGGCGCTC GCGGTGATCT GA
 
Protein sequence
MTKDWKTRTQ LVHGGSRRSQ YGEMAEAIFL TQGFVYDSAE QAEARFIETG ADEFIYARYG 
NPTTRMFEER IAAVEGTEDA FATASGMAAI HGVLTSIVRA GDHLVAARAL FGSCIYILEE
VLGRFGVEVT FVDGTDLDQW RAAVRPGTKA VFFESVSNPT LEVADIGAIA EIAHAVGALV
IVDNVFATPV FSTAVRQGAD VVIYSATKHI DGQGRALGGV VCASQAFIRK VLEPFMKHTG
GSMSPFNAWL MLNGMATLDL RCRAMADTAE KIARALEGHP QLGRVIHPAL ESHPQHDMAK
AQMERPGTMI ALDLAGGKEA AFRFLDALKI VKISNNLGDA RSIATHPATT THQRLSDAQK
AHLGITPGLV RLSVGLEDAD DLIADLKQAL AVI