Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2459 |
Symbol | |
ID | 4022950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2747089 |
End bp | 2748336 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962652 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_569590 |
Protein GI | 91976931 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.117109 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC ACCCGGCGGT CAGCAACGGC AGCTACGATG TCGCGCGGGT GCGCGAGGAT TTCCCCGCGC TGGCGCTCAA GGTCTACGGC AAGGATCTGG TCTATCTCGA CAACGCCGCC TCGGCGCAGA AGCCGCGCGT CGTGCTGGAG CGGATGACCA AGGCGTATGA GAGCGAATAC GCCAACGTGC ATCGCGGGCT GCATTATCTC GCCAACGCCG CGACCGAAGC CTATGAGGGC GGCCGCCACC GGGTGCAGAA GTTTCTCAAC GCCAAGCGGC CGGAAGAGAT CATCTTCACC CGCAACGCCA CCGAGGCGAT CAATCTGGTC GCGTCGTCGT TCGGCGCGCC GAATATCGGC GAGGGCGACG AGATCGTGCT CTCGATCATG GAGCACCATT CCAACATCGT GCCATGGCAC TTCCTGCGCG AGCGGCAGGG CGCGGTGATC AAATGGGCGC CGGTCGACGA CGACGGCAAT TTCCTGATCG ACGAATTCGA GAAGCTGCTG TCGTCGAAGA CCAAGCTGGT CGCGATCACC CAGATGTCGA ATGCGCTCGG CACCATCGTG CCGGTCAAGG ACGTGGTGAA GCTCGCGCAC GACCGCGGCA TTCCGGTGCT GGTCGACGGC AGCCAGGGCG CGGTGCATCT CACGATCGAC GTCCAGGACA TCGATTGCGA CTTCTACATC ATGACCGGGC ACAAGCTGTA CGGCCCGACC GGAATCGGCG TGCTGTACGG CAAGTACGAT GTCCTCGCCA AGATGCGGCC GTTCAACGGC GGCGGCGAGA TGATCCGCGA AGTCGCGCAG GACTGGGTGA CCTATGGCGA CCCGCCGCAC CGGTTCGAGG CCGGCACGCC GGCGATCGTC GAGGCGGTCG GGCTCGGTGC TGCGATCGAC TACGTCAATT CGATCGGCAA GGAGCGGATC GCGGCGCACG AACACGATCT TTTGACCTAC GCCCAGGATC GGCTGCGCGA GATCAATTCG CTGCGGCTGA TCGGCACCGC CAAGGGCAAG GGACCGGTGA TTTCCTTCGA AATGAAGGGC GCTCACCCGC ATGACATCGC CACCGTGATC GACCGCCAGG GGATTGCGGT GCGCGCCGGA ACCCATTGCG TGATGCCGTT GCTGGAGCGG TTCCAGGTCA CGGCGACGTG CCGCGCATCG TTCGGCATGT ATAATACCCG TGAGGAAGTC GACCAACTCG CACAGGCGCT GATCAAGGCG CGGGATCTGT TCGCATGA
|
Protein sequence | MTTHPAVSNG SYDVARVRED FPALALKVYG KDLVYLDNAA SAQKPRVVLE RMTKAYESEY ANVHRGLHYL ANAATEAYEG GRHRVQKFLN AKRPEEIIFT RNATEAINLV ASSFGAPNIG EGDEIVLSIM EHHSNIVPWH FLRERQGAVI KWAPVDDDGN FLIDEFEKLL SSKTKLVAIT QMSNALGTIV PVKDVVKLAH DRGIPVLVDG SQGAVHLTID VQDIDCDFYI MTGHKLYGPT GIGVLYGKYD VLAKMRPFNG GGEMIREVAQ DWVTYGDPPH RFEAGTPAIV EAVGLGAAID YVNSIGKERI AAHEHDLLTY AQDRLREINS LRLIGTAKGK GPVISFEMKG AHPHDIATVI DRQGIAVRAG THCVMPLLER FQVTATCRAS FGMYNTREEV DQLAQALIKA RDLFA
|
| |