Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4452 |
Symbol | |
ID | 3973078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4958915 |
End bp | 4959931 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927563 |
Product | nitrogen-fixing NifU-like |
Protein accession | YP_534294 |
Protein GI | 90425924 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACGA TGCAGCACGA CAGGTTCGGC GAACATTTTG CCAACCCGCG CAATGTCGGC GTGCTGGCGC AGGCCAATGC GGTCGGCTCG GTCGGCACCA TGGGGTGGGG CGACGCAGTC AAGCTGATGC TGCAGATCGA TCCGGTCACC GACCGGATCG AGCAGGCCCG GTTCCAGACC TTCGGCTGCT CATCGGCGAT CGCCTCGTCC TCGGCGATCA CCGAGCTGAT CACCGGCAAG ACCACCGACG AGGCGCTGGA GATCTCCGCC GCTGACGTCG TTGAATTCCT CGGCGGGCTA CCGGCGGAGC GGATGTACTG TTCGGTGATG ACGTACGAAG CCGTACAAAA TGCCATCGCC GACTATCGCC GGCACGGCGC GCCGGCCCCC GCCACGGACT CCGCGGTGAT CTGCAAATGC TTCGGCGTCA CCCAGGCGAT GGCCGAGCGC ACCATCCGCA TCAATCATCT CACCGATCCG CACCAGGTGA CGTTTCATAC CAAGGCCGGC GGCGGCTGTT TCAGTTGCTA CAAGCAGATC GAAACCGTGC TGGCGCGGGT CAATGCCGAC ATGGTCGCCG AAGGCCATCT TTCGCCGCGG CAGGCCTACC GGCTGGGCTC GGTGCCGCCG TCGAGCGAAG CATTGAAGCC GCGCGGCGAC GCGCCGCCTC CCTTGGGCTC CGGCGCCCGC GCCAATATTC CCGGCCATAT CGCGATTCCG CCGCGCCCGG CGCCGACTTC GGCAGCACCC CGCCCGGTGC TGCCGCCGTC GGAAAATCGC GACCGCGGCA CGCCCGAACA ATTCGAACTG ATCAAGCAGG CGGTCGAAGC GTTGCGCCCG CATCTGCAGC GCGACGGCGG CGATTGCGAG CTGGTCGATG TCGACGGCAA CACCATCTAT CTGCGGCTGT CGGGCAATTG CGTCGATTGT CAGTTGGCGT CGGTGACGCT GTCCGGCGTG CAGGCGCAGC TCGCCGAGAA GCTGCAGCGC CCGGTGCGCG TGGTGCCGGT GTCATGA
|
Protein sequence | MPTMQHDRFG EHFANPRNVG VLAQANAVGS VGTMGWGDAV KLMLQIDPVT DRIEQARFQT FGCSSAIASS SAITELITGK TTDEALEISA ADVVEFLGGL PAERMYCSVM TYEAVQNAIA DYRRHGAPAP ATDSAVICKC FGVTQAMAER TIRINHLTDP HQVTFHTKAG GGCFSCYKQI ETVLARVNAD MVAEGHLSPR QAYRLGSVPP SSEALKPRGD APPPLGSGAR ANIPGHIAIP PRPAPTSAAP RPVLPPSENR DRGTPEQFEL IKQAVEALRP HLQRDGGDCE LVDVDGNTIY LRLSGNCVDC QLASVTLSGV QAQLAEKLQR PVRVVPVS
|
| |