Gene RPC_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2837 
Symbol 
ID3970065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3078486 
End bp3079733 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content63% 
IMG OID637925949 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_532704 
Protein GI90424334 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00234776 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC ATCCCGCGGT TTCCAATGGA AGCTATGACG TCGCGCGTGT GCGCGAGGAT 
TTCCCCGCGC TGGCGCTCAA GGTCTATGGC AAACCGCTGA TCTATCTCGA CAACGCCGCC
TCGGCGCAGA AGCCGCGGGT GGTGCTGGAT CGCATGACGC AGGCCTATGA GAACGAATAC
GCCAACGTGC ATCGCGGGCT GCATTATCTC GCCAACGCGG CGACCGAGGC CTATGAGGGC
GGCCGCGCGC GGGTCACCAG CTTCCTCAAC GCCAGGCGGC CGGAAGAGAT CATCTTCACC
CGCAACGCCA CCGAGGCGAT CAACCTCGTG GCGTCGTCGT GGGGCGCGCC CAATATCGGC
GAGGGCGACG AGATCGTGCT CTCGATCATG GAGCACCATT CCAACATCGT GCCGTGGCAT
TTCCTGCGCG AACGCCAGGG CGCGGTGATC AAATGGGCGG AGGTCGACGA CGACGGCAAC
TTCCTGCTGG AAGAATTCGA AAAGCAGCTG ACGGCGAAGA CCAAGCTGGT CGCGATCACC
CAGATGTCGA ACGCGCTCGG CACCGTGGTG CCGGTCAAGG AGGTGGTGCG GATCGCGCAC
GCCCGCGGCA TTCCGGTCTT GGTCGACGGC AGCCAGGCCG CGGTGCATAT GGCGATCGAC
GTCCAGGACA TCGACTGCGA CTTCTACGTC ATGACCGGCC ACAAGGTTTA TGGCCCGACC
GGGATCGGCG TATTGTACGC CAAATACGAT CACCTGGTGG CGATGCGGCC GTATTGCGGC
GGCGGCGAAA TGATCCGCGA AGTGTCGCGC GACGTCGTGA CTTATGGCGA TCCGCCGCAC
AAATTCGAGG CCGGCACGCC GGCGATCGTC GAGGCGGTGG GATTGGGCGC CGCGATCGAC
TACGTCAATT CGATCGGCAA AGCGCGGATC GCGGCGCACG AGCACGATCT GTTGGACTAC
GCCCAGGCGC AGCTGCGCGA GATCAATTCG GTGCGCATCA TCGGCACCGC CCGCGACAAG
GGGCCGGTGA TCTCGTTCGA GATGAAGGGC GCGCATCCGC ACGACATCGC CACGGTGATC
GACCGCTCCG GCATCGCGGT GCGCGCGGGA ACTCACTGCG TGATGCCGCT TTTGGAGCGA
TTCCAAGTCA CCGCGACGTG TCGGGCCTCG TTTGGGATGT ATAACACCCG CGAGGAAGTC
GACCAATTCG TACAGGCGCT GATCAAGGCG CGGGATTTGT TCTCATGA
 
Protein sequence
MTTHPAVSNG SYDVARVRED FPALALKVYG KPLIYLDNAA SAQKPRVVLD RMTQAYENEY 
ANVHRGLHYL ANAATEAYEG GRARVTSFLN ARRPEEIIFT RNATEAINLV ASSWGAPNIG
EGDEIVLSIM EHHSNIVPWH FLRERQGAVI KWAEVDDDGN FLLEEFEKQL TAKTKLVAIT
QMSNALGTVV PVKEVVRIAH ARGIPVLVDG SQAAVHMAID VQDIDCDFYV MTGHKVYGPT
GIGVLYAKYD HLVAMRPYCG GGEMIREVSR DVVTYGDPPH KFEAGTPAIV EAVGLGAAID
YVNSIGKARI AAHEHDLLDY AQAQLREINS VRIIGTARDK GPVISFEMKG AHPHDIATVI
DRSGIAVRAG THCVMPLLER FQVTATCRAS FGMYNTREEV DQFVQALIKA RDLFS