Gene Rpal_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2745 
Symbol 
ID6410409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2986701 
End bp2987948 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content63% 
IMG OID642712621 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_001991729 
Protein GI192291124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC ATCCTGCAGT GAGAAACGGC AGTTACGACG TCGACCTCGT CCGTGAAGAC 
TTTCCGGCGT TGGCGCTGGA GGTCTATGGC AAGAAGCTGG TGTATCTCGA CAACGCCGCC
TCGGCGCAGA AGCCGCGGCA GGTGCTGACG CGGATGACGC AGGCGTATGA GAGTGAATAC
GCCAACGTGC ATCGCGGCCT GCATTATCTC GCCAATGCCG CGACAGAAGC CTATGAGGGC
GGCCGCACTC GGGTGCAACA TCTGCTCAAC GCCAAGCGGC CGGAAGAGAT CATCTTCACC
CGCAATGCCA CCGAGGCGAT CAACCTCGTG GCATCGTCGT GGGGTGCGAC GAACATCGGC
GAGGGCGACG AGATCGTGCT CTCGATCATG GAGCACCATT CGAACATCGT GCCGTGGCAC
TTCCTGCGCG AGCGCCAGGG CGCCGTGCTG AAATGGGCGC CGGTCGACGA CGAAGGCAAC
TTCCTGATCG ACGAGTTCGA GAAGCTGCTG ACCGCCAAGA CCAAGCTGGT CGCGATCACG
CAGATGTCGA ACGCGCTCGG CACCGTCGTC CCGGTCAAGG AGGTGGTGAA GATCGCCCAT
GCCCGCGGCA TTCCGGTGTT GGTCGACGGC AGCCAGGCAG CGGTGCATCT CGCCATCGAC
GTCCAGGACA TCGATTGCGA TTTCTATGTG ATGACCGGGC ACAAGATCTA CGGCCCGACC
GGGATCGGCG CGCTGTACGG CAAGTACGAC GTCCTCGCCA AGATGCGGCC CTACAACGGC
GGCGGCGAGA TGATCCGTGA GGTCGCCCAG GACTGGGTGA CCTACGGCGA CCCGCCGCAT
CGATTCGAGG CCGGCACGCC GGCGATCGTC GAGGCGGTCG GGCTCGGCGC CGCGATCGAC
TACGTCAATT CGATTGGCAA GGAACGGATC GCCGCCCACG AACACGATCT TTTGACCTAT
GCGGAGGAGC GGCTGCGGGA GATCAACGCG CTGCGCATCA TCGGCAGCGC AAAGGGCAAG
GGACCGGTGA TTTCCTTCGA AATGAAGGGG GCTCACCCGC ACGACGTCGC CACCGTGATC
GATCGGCAGG GCATCGCGGT CCGTGCCGGC ACCCATTGCG TGATGCCGCT GCTGGAGCGG
TTCCAAGTCA CTGCGACGTG CCGTGCGTCG TTCGGCATGT ATAATACCCG TGAGGAAGTG
GACCAACTCG TCAGTGCGCT GATCAAGGCG CGGGATCTGT TCGCATGA
 
Protein sequence
MSTHPAVRNG SYDVDLVRED FPALALEVYG KKLVYLDNAA SAQKPRQVLT RMTQAYESEY 
ANVHRGLHYL ANAATEAYEG GRTRVQHLLN AKRPEEIIFT RNATEAINLV ASSWGATNIG
EGDEIVLSIM EHHSNIVPWH FLRERQGAVL KWAPVDDEGN FLIDEFEKLL TAKTKLVAIT
QMSNALGTVV PVKEVVKIAH ARGIPVLVDG SQAAVHLAID VQDIDCDFYV MTGHKIYGPT
GIGALYGKYD VLAKMRPYNG GGEMIREVAQ DWVTYGDPPH RFEAGTPAIV EAVGLGAAID
YVNSIGKERI AAHEHDLLTY AEERLREINA LRIIGSAKGK GPVISFEMKG AHPHDVATVI
DRQGIAVRAG THCVMPLLER FQVTATCRAS FGMYNTREEV DQLVSALIKA RDLFA