Gene Rpal_5251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5251 
Symbol 
ID6412952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5663168 
End bp5664379 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID642715141 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001994213 
Protein GI192293608 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.903252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAGT CCAAACCGAC CATTCCCGTT CCCGCCACCG CGCATTTCCG CCGCGAAACC 
CGCCTGGTGC ATTCCGGCAG CTTGCGCTCG CAGTTCGGCG AGACCTCCGA GGCGTTGTTC
CTGACTCAGG GCTTCGTCTA CGAAAGCGCC GAGCAGTGCG AGGCGCGCTT CACCGGCGAC
GACGCCGGCT TCCAATATTC CCGGTTCTCG AACCCGACCG TGTTCAGCTT CGAACAGCGG
ATGGCGGAGT TCGAAGGCGC AGAAGCGGCG CGCTCGACCG CCACCGGCAT GGCCGCGGTG
ACGGCCGCGA TGCTGGCGCC GCTGCGGGCA GGCGATCACG TCGTCGCCTC CAAGGCGATG
TTCGGCTCGT GCCGCTACGT CGTCGAAGAC CTGCTGCCGC GCTACGGCAT CGAGTCGACG
CTGGTCGACG GCCTCGATCT CGACCAATGG CAGCGTGCGG TCCGGCCGAA CACCAAGACA
TTCTTCCTGG AGAGCCCGAC CAACCCGACC CTCGACGTGC TCGACATCGG CGCGATCGCG
GAGATCGCCC ATGCGGCGGG CGCGCGCCTC GTGGTCGACA ACGTGTTCGC AACGCCGATC
TGGCAAAGCC CGCTGCAGCT TGGTGCCGAC GTCGTGGTTT ACTCCGCCAC CAAGCACATC
GACGGACAGG GCCGCTGTCT CGGCGGCGTG GTGCTGTCGT CGCAGGCGTT CATCGAAGAA
CACATCCAGA TGTATCTGCG CCAGACCGGC CCGTCGCTGT CGCCGTTCAA CGCCTGGGTG
CTGCTGAAGG GCCTGGAAAC GCTGGCGGTC CGTGTCGAGA AGCAGACCTC CAATGCGGCC
GCGATCGCCG ATGCGCTGGC CGGCCACCCG AAGGTGCCGC GCCTGGTTTA TCCGGGCCGG
GCCGATCACC CGCAGGCCGC GACGGTCAAG AAGCAGATGG GCGCAGGCTC GACCCTGGTC
GGCTTCGAGG TGAAGGGCGG CAAGGCCGAA GCGTTCCGCT TCCTCAATGC CCTGAAGCTG
GTGAAGATCA GCAACAATCT CGGCGACGCC AAGAGCCTCG TCACCCACCC GGCCACGACG
ACGCATCAGC GGCTGAAGCC GGAAGCGCGC GCCGAACTCG GCATCAGCGA AGGCTTCATC
CGATTGTCGG CCGGCCTCGA ACACAGAGAC GATCTGATCG AGGACCTGCT CGCCGCGCTC
GACAAGGTGT GA
 
Protein sequence
MSESKPTIPV PATAHFRRET RLVHSGSLRS QFGETSEALF LTQGFVYESA EQCEARFTGD 
DAGFQYSRFS NPTVFSFEQR MAEFEGAEAA RSTATGMAAV TAAMLAPLRA GDHVVASKAM
FGSCRYVVED LLPRYGIEST LVDGLDLDQW QRAVRPNTKT FFLESPTNPT LDVLDIGAIA
EIAHAAGARL VVDNVFATPI WQSPLQLGAD VVVYSATKHI DGQGRCLGGV VLSSQAFIEE
HIQMYLRQTG PSLSPFNAWV LLKGLETLAV RVEKQTSNAA AIADALAGHP KVPRLVYPGR
ADHPQAATVK KQMGAGSTLV GFEVKGGKAE AFRFLNALKL VKISNNLGDA KSLVTHPATT
THQRLKPEAR AELGISEGFI RLSAGLEHRD DLIEDLLAAL DKV