Gene Rsph17025_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0638 
Symbol 
ID5082982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp637590 
End bp638771 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640482195 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001166849 
Protein GI146276690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.711879 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGG ACTGGAAGAC ACGGACGCAG CTCGTCCACG GGGGCAGCCG GCGGAGCCAG 
TATGGCGAGA TGGCCGAGGC GATCTTTCTC ACGCAGGGCT TCGTCTATGA CACGGCCGGG
CAGGCCGAGG CCCGCTTCAT CGAGACGGGC GAGGACGAGT TCATCTACGC CCGCTACGGC
AACCCCACCA CCCGCATGTT CGAAGAGCGC ATCGCAGCCG TCGAGGGCAC CGAGGATGCC
TTTGCCACCG CCTCGGGCAT GGCGGCGATC CACGGCGTCC TGACCTCGAT CGTGCGGGCG
GGCGATCATC TGGTGGCGGC GCGCGCGCTC TTCGGCTCGT GCATCTACAT CCTCGAGGAG
GTGCTGGGCC GGTTCGGGGT CGAGGTGACC TTCGTCGATG GCACCGACCT TGACCAGTGG
CGCGCGGCGG TTCGGCCGGG CACCAAGGCC GTGTTCTTCG AGTCGGTCTC GAACCCGACG
CTCGAGGTGG CCGACATCGG CGCCATCGCC GCGATCGCCC ATGCGGTGGG CGCGCTGGTG
ATCGTGGACA ATGTCTTCAC CACCCCCGTC TTTTCGACCG CGGTCCGGCA GGGTGCGGAT
GTGGTGATCT ATTCTGCCAC CAAACATATC GACGGGCAGG GACGCGCGCT CGGCGGCGTG
GTCTGCGCCT CGCAGCACTT CATCCGCAAG GTGCTCGAGC CCTTCATGAA ACATACCGGC
GGCTCGATGA GCCCCTTCAA CGCCTGGCTC ATGCTGAACG GCATGGCGAC GCTCGACCTG
CGCTGCCGGG CGATGGCCGA CACGGCCGAG AAGATCGCCC GGGCGCTCGA GGACGATGCG
CGGCTCGTGC GCGTGATCCA CCCCTCGCTG AAGAGCCATC CGCAGCACGA GATGGCGAAG
GCGCAGATGG ACCGCCCCGG CACGATGATC GCGCTTGATC TGGCCGCGGG CAAGGAGGCG
GCCTTCCGCT TCCTCGACGC GCTGAAGATC ATCAAGATCT CGAACAATCT CGGCGACGCG
CGCTCGATCG CGACCCATCC GGCGACAACC ACCCACCAGC GGCTGTCGGA CGCGCAGAAG
GCCCATCTCG GCATCACGCC GGGCCTCGTG CGGCTGTCGG TGGGGCTCGA GGATGCGGAC
GATCTGATTG CCGACCTGAA ACAGGCGCTC GACGTGATCT GA
 
Protein sequence
MTKDWKTRTQ LVHGGSRRSQ YGEMAEAIFL TQGFVYDTAG QAEARFIETG EDEFIYARYG 
NPTTRMFEER IAAVEGTEDA FATASGMAAI HGVLTSIVRA GDHLVAARAL FGSCIYILEE
VLGRFGVEVT FVDGTDLDQW RAAVRPGTKA VFFESVSNPT LEVADIGAIA AIAHAVGALV
IVDNVFTTPV FSTAVRQGAD VVIYSATKHI DGQGRALGGV VCASQHFIRK VLEPFMKHTG
GSMSPFNAWL MLNGMATLDL RCRAMADTAE KIARALEDDA RLVRVIHPSL KSHPQHEMAK
AQMDRPGTMI ALDLAAGKEA AFRFLDALKI IKISNNLGDA RSIATHPATT THQRLSDAQK
AHLGITPGLV RLSVGLEDAD DLIADLKQAL DVI