Gene RPD_4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4194 
Symbol 
ID4024715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4661081 
End bp4662103 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content61% 
IMG OID637964400 
Productcysteine synthase 
Protein accessionYP_571312 
Protein GI91978653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.360882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTGC ACGTCGGTAA CGCCGTCGGC GCGACCCCGC TGGTCGAGAT CACCAAGTTG 
GATATCCCGG ATGGCATCCG GGTATTCGCG AAGCTGGAGT TCCTCAATCC TGGCGGCAGC
ATCAAGGACC GCATGGTCAA ATACATTCTC GACCATGCCG AGCGTGTCGG GGCGCTGCAG
CCCGGTGCGA CGATTGTGGA GAATACGTCC GGAAACACCG GCGCGGCGAT CGCCATGTTC
GCGGCCGAAC GCGGATATCG GGCGATCCTG ACGATGCCGG ACAAGGTGAG CCAGGAGAAA
CAAAACGTCC TGCGCGCGAT GGGCGCACAG ACAATCGTTT GCCCGACGGC GGTTCGACCG
GATTCACCGG AGCACTACGT CGAGACGGCT CGACGGCTTC ACCGCGAGAT ACCCGGCTCG
TTCATGCTGA ACCAGTACGA CAATCCGCTG AACGCCGAGG CGCATTTCCA CACCACCGGG
CCGGAAATAT GGGAAGCGCT CGGCGGCCGG ATCACGGCCT TCGTTTCGTC GGGCAGCACC
GGGGGAACGA TTTCCGGTAT CGGCGGCTAT CTACGCTCGA AGAACCCCGA CATCCACGTC
GTTCTGCTCG ACCCCGTCGG CTCGATCTAT CACAAATACT TTCACGAAGG CGTCGTGGAT
CCGCGCGAAA TCGCGGCCTA CCACGTCGAG GGTGTCGGTG AGGACCATCT CGCCAAGTGC
ATGGATTTCT CGGTTCTGAC CAATGTAATT CGCTTCAACG ATCGCAACGC CATTCAAATG
TGCCACGAGC TTGCGCGGAA AGAAGGCTTG CTGTGCGGCG GCAGTTCAGG AGCCAATATC
TGGGGCTGCA TCGAAGTTGC GAAGGCGCTG AAGCCGCCGG CGGTCATTGT GACCGTTCTT
CCGGACAGCG GAGCGAAGTA CGTTTCCAAG ATTTACAACG CGGATTGGCT TGCGGAACAA
CGGTTCGCGG ACGGTAGCAG CGCAACGGTG TCGATGCGCA GTCCGGCGCC GAGCGACGCC
TGA
 
Protein sequence
MRLHVGNAVG ATPLVEITKL DIPDGIRVFA KLEFLNPGGS IKDRMVKYIL DHAERVGALQ 
PGATIVENTS GNTGAAIAMF AAERGYRAIL TMPDKVSQEK QNVLRAMGAQ TIVCPTAVRP
DSPEHYVETA RRLHREIPGS FMLNQYDNPL NAEAHFHTTG PEIWEALGGR ITAFVSSGST
GGTISGIGGY LRSKNPDIHV VLLDPVGSIY HKYFHEGVVD PREIAAYHVE GVGEDHLAKC
MDFSVLTNVI RFNDRNAIQM CHELARKEGL LCGGSSGANI WGCIEVAKAL KPPAVIVTVL
PDSGAKYVSK IYNADWLAEQ RFADGSSATV SMRSPAPSDA