Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0077 |
Symbol | |
ID | 4895779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 85763 |
End bp | 86668 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640110654 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001041969 |
Protein GI | 126460855 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGGC TCGGATGGTC GGATGTGACG CCCAGGGCGG ATTGGCTGAA CCGGCGGCAG ATCCTTGCAG GGGCGGGGGC GCTGGGGCTG GCGGGGCCCG CCTTCGCCCG GATCGAGGCG AAGGCCAGCC GCTTCTCGAC GGATGAGAAG CCCAACAGCT TCGAGGAGAT CTCGAACTAC AACAACTTCT ATGAGTTCGG CCTCGACAAG GGCGATCCGG CGCAGAATGC CGGCGCCCTG ACCGTGGATC CCTGGTCGGT CGAGATCGGG GGCCTCGTCG AGCGTCCGGG CGCCTATCCG CTCGATGACA TCCTGAAGGG GGTGACGCTC GAGGAGCGGA TCTACCGCCT GCGCTGCGTC GAGGGTTGGT CGATGGTGGT GCCCTGGATC GGGTTCGAGC TGCGGACCCT CCTCGAGCGG GCAGGCGTGC AGCCAGGCGC CCGGTTCGTG GCCTTCGAGA CGCTCGTCCG CCCGGAAGAG ATGCCGGGCG TCCGCTCGCG CATCCTCGAC TGGCCCTATC GCGAGGGGCT GCGGATCGAC GAGGCAATGC ATCCGCTGAC GATCCTCGCC ACCGGCCTCT ATGGCGAGGA GATGCCGAAG CAGAATGGCG CGCCGATCCG GCTGGTCGTG CCGTGGAAAT ACGGCTTCAA ATCGATCAAG AGCATCGTGC GGATCTCGCT GGTCGAGAAG ATGCCCGCCA CCTCCTGGAA CATGCAGAAT GCACGCGAAT ACGGCTTCTA CTCCAATGTG AACCCGGCGG TGGATCATCC GCGCTGGAGC CAGGCCTCCG AGCGGCGGAT CGGCTCGGGG TTCTTCGCGC CGCGGATCGA GACGCAGCTC TTCAACGGCT ATGGCGATCA GGTGGCGCAG CTCTACGCCG GGCAGGACCT GTCGGTGGAT TTCTGA
|
Protein sequence | MRRLGWSDVT PRADWLNRRQ ILAGAGALGL AGPAFARIEA KASRFSTDEK PNSFEEISNY NNFYEFGLDK GDPAQNAGAL TVDPWSVEIG GLVERPGAYP LDDILKGVTL EERIYRLRCV EGWSMVVPWI GFELRTLLER AGVQPGARFV AFETLVRPEE MPGVRSRILD WPYREGLRID EAMHPLTILA TGLYGEEMPK QNGAPIRLVV PWKYGFKSIK SIVRISLVEK MPATSWNMQN AREYGFYSNV NPAVDHPRWS QASERRIGSG FFAPRIETQL FNGYGDQVAQ LYAGQDLSVD F
|
| |