Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2837 |
Symbol | |
ID | 3970065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3078486 |
End bp | 3079733 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637925949 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_532704 |
Protein GI | 90424334 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00234776 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACAC ATCCCGCGGT TTCCAATGGA AGCTATGACG TCGCGCGTGT GCGCGAGGAT TTCCCCGCGC TGGCGCTCAA GGTCTATGGC AAACCGCTGA TCTATCTCGA CAACGCCGCC TCGGCGCAGA AGCCGCGGGT GGTGCTGGAT CGCATGACGC AGGCCTATGA GAACGAATAC GCCAACGTGC ATCGCGGGCT GCATTATCTC GCCAACGCGG CGACCGAGGC CTATGAGGGC GGCCGCGCGC GGGTCACCAG CTTCCTCAAC GCCAGGCGGC CGGAAGAGAT CATCTTCACC CGCAACGCCA CCGAGGCGAT CAACCTCGTG GCGTCGTCGT GGGGCGCGCC CAATATCGGC GAGGGCGACG AGATCGTGCT CTCGATCATG GAGCACCATT CCAACATCGT GCCGTGGCAT TTCCTGCGCG AACGCCAGGG CGCGGTGATC AAATGGGCGG AGGTCGACGA CGACGGCAAC TTCCTGCTGG AAGAATTCGA AAAGCAGCTG ACGGCGAAGA CCAAGCTGGT CGCGATCACC CAGATGTCGA ACGCGCTCGG CACCGTGGTG CCGGTCAAGG AGGTGGTGCG GATCGCGCAC GCCCGCGGCA TTCCGGTCTT GGTCGACGGC AGCCAGGCCG CGGTGCATAT GGCGATCGAC GTCCAGGACA TCGACTGCGA CTTCTACGTC ATGACCGGCC ACAAGGTTTA TGGCCCGACC GGGATCGGCG TATTGTACGC CAAATACGAT CACCTGGTGG CGATGCGGCC GTATTGCGGC GGCGGCGAAA TGATCCGCGA AGTGTCGCGC GACGTCGTGA CTTATGGCGA TCCGCCGCAC AAATTCGAGG CCGGCACGCC GGCGATCGTC GAGGCGGTGG GATTGGGCGC CGCGATCGAC TACGTCAATT CGATCGGCAA AGCGCGGATC GCGGCGCACG AGCACGATCT GTTGGACTAC GCCCAGGCGC AGCTGCGCGA GATCAATTCG GTGCGCATCA TCGGCACCGC CCGCGACAAG GGGCCGGTGA TCTCGTTCGA GATGAAGGGC GCGCATCCGC ACGACATCGC CACGGTGATC GACCGCTCCG GCATCGCGGT GCGCGCGGGA ACTCACTGCG TGATGCCGCT TTTGGAGCGA TTCCAAGTCA CCGCGACGTG TCGGGCCTCG TTTGGGATGT ATAACACCCG CGAGGAAGTC GACCAATTCG TACAGGCGCT GATCAAGGCG CGGGATTTGT TCTCATGA
|
Protein sequence | MTTHPAVSNG SYDVARVRED FPALALKVYG KPLIYLDNAA SAQKPRVVLD RMTQAYENEY ANVHRGLHYL ANAATEAYEG GRARVTSFLN ARRPEEIIFT RNATEAINLV ASSWGAPNIG EGDEIVLSIM EHHSNIVPWH FLRERQGAVI KWAEVDDDGN FLLEEFEKQL TAKTKLVAIT QMSNALGTVV PVKEVVRIAH ARGIPVLVDG SQAAVHMAID VQDIDCDFYV MTGHKVYGPT GIGVLYAKYD HLVAMRPYCG GGEMIREVSR DVVTYGDPPH KFEAGTPAIV EAVGLGAAID YVNSIGKARI AAHEHDLLDY AQAQLREINS VRIIGTARDK GPVISFEMKG AHPHDIATVI DRSGIAVRAG THCVMPLLER FQVTATCRAS FGMYNTREEV DQFVQALIKA RDLFS
|
| |