Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0541 |
Symbol | |
ID | 3909580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 605872 |
End bp | 606867 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882429 |
Product | cobalt chelatase, pCobS small subunit |
Protein accession | YP_484163 |
Protein GI | 86747667 |
COG category | [R] General function prediction only |
COG ID | [COG0714] MoxR-like ATPases |
TIGRFAM ID | [TIGR01650] cobaltochelatase, CobS subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.33721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCG CCACGACCAA AGCCCAGGAG ATCACCGGCC TGCCCGACAT GAAGGTGTCG GTCCGCCAGG TCTTCGGCAT CGACAGCGAC CTCGAAGTTC CCGCCTTTTC CGAGGCCGAT CCGCATGTGC CCGATGTCGA CAGCGACTAT CGCTTCGACC GCGCCACCAC GCTCGCGATT CTCGCCGGCT TCGCCCGCAA TCGCCGCGTC ATGGTGACGG GCTTCCACGG CACCGGCAAA TCGACCCATA TCGAGCAGGT TGCGGCCCGG CTGAACTGGC CCTGCGTCCG CGTCAATCTC GACAGCCACA TCAGCCGCAT CGATCTGGTC GGCAAGGACT CGATCGTGGT CCGCGACGGC AAGCAGGTCA CCGAATTCCG CGACGGCATC CTGCCCTGGG CGTTGCAGCA CAATGTGGCA TTGGTGTTCG ACGAATACGA TGCCGGCCGC CCCGACGTGA TGTTCGTGAT CCAGCGCGTG CTGGAAGTCT CCGGCCGGCT GACGCTGCTC GACCAGAACA AGGTGATCAA GCCGCATCCG GCGTTCCGGC TGTTCGCCAC CGCCAACACC ATCGGCCTGG GTGATACCTC GGGCCTGTAT CACGGCACGC AGCAGATCAA CCAGGGCCAG ATGGACCGCT GGTCGATCGT CACCACGCTG AACTATCTGC CGCACGACGA GGAAGTCGAG ATCGTGCTCG CCAAGGCCAA GCACTATCGC AACCCGGAAG GGCGCGACAT CGTCAACAAG ATGGTGCGGC TCGCCGATCT CACCCGCAAC GCCTTCGCCA ACGGCGACCT GTCGACGGTG ATGAGCCCGC GCACGGTGAT CACCTGGTCG GAGAACGCCG AGATCTTCGG CGACATCGGC TTCGCGTTCC GCGTCACCTT CCTCAACAAA TGCGACGAGC TGGAGCGCCC GCTGGTGGCC GAGTTCTATC AGCGCTGCTT CAACGCCGAG CTGCCGGAAA GCTCGGTGAA TGTTGCGATG ACGTAA
|
Protein sequence | MTAATTKAQE ITGLPDMKVS VRQVFGIDSD LEVPAFSEAD PHVPDVDSDY RFDRATTLAI LAGFARNRRV MVTGFHGTGK STHIEQVAAR LNWPCVRVNL DSHISRIDLV GKDSIVVRDG KQVTEFRDGI LPWALQHNVA LVFDEYDAGR PDVMFVIQRV LEVSGRLTLL DQNKVIKPHP AFRLFATANT IGLGDTSGLY HGTQQINQGQ MDRWSIVTTL NYLPHDEEVE IVLAKAKHYR NPEGRDIVNK MVRLADLTRN AFANGDLSTV MSPRTVITWS ENAEIFGDIG FAFRVTFLNK CDELERPLVA EFYQRCFNAE LPESSVNVAM T
|
| |