Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0980 |
Symbol | |
ID | 3909335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1125207 |
End bp | 1126208 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882873 |
Product | nitrogen-fixing NifU-like protein |
Protein accession | YP_484601 |
Protein GI | 86748105 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.191022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGCA ATCTCGACAG GCTCGATGAA CTCGTCTCCA GTCCGCGCAA TGCCGGCGTG CTGCTGCAGG CCAACGCGAT CGGCGCCTTC GGCGGCATCC GCTGGGGCGA TGCGGTCAAG CTGATGCTGC GGGTCGATCC GGCGACCGAC CGGATCGAGC AGGCGCGGTT TCAGGCGTTC GGCTGCAGCT CGTCGATCGC CGCGGCGTCC GCGGTCACCG AACTGATCAC CGGCAAGACG CTGGACGACG CCGCGGCGCT CGGCGCGGCC GACATCGCCG ACGATCTCGG CGGCTTGCCG GCGGCGCGGA TGTATTGCGC GGTGATGGCC TACGAGGCGC TACAGACGGC GATCACGTCC TATCGGGGCA TTGCGGCGCT GCGCGAGGCC GACGCGGCGC CGTCGTGCAA GTGCCTCGGC GTCAGCCAGA TGATGATCGA GCGCACCATC CGCTTCAATC GCCTGACCAG CCTGGAACAG GTGACCCACT ACACCAAGGC GGCCGGGAGC TGCAGCTCCT GCTTCAAGCA GGTCGAAGGC CTGCTGGCGC GGGTTAATGC CGAGATGGTC GAGGACGGGC TGATCGACGC TGGCGCGGCG TATCAGCTCG GCTCGACGCA GCAGCGGGCC GTCGACCTGA AGCCGCACGG CGCGCCGCAG CCGGCGGCCA ACATCTTCTC CGGCAAAGCG GCGCCGGCGC ATCTGCGCGC GATGCCGAAG AGCCCGCCGC CGCGTCCGGC GACGGCGCAG GCGCCGGTCG CCGGGACCAT CGACGCGCTG CCGCTGACTT CTCTGGTGGC CGAAGCGCTG GAGGATTTGC GGCCGCATCT GCAGCGCGAC GGCGGCGATT GCGAACTCGT CAGCGTCGAG GGCAATGTCG TCTATGTCCG GCTGTCGGGC AATTGCGTCG GCTGCCAATT GTCATCGGTG ACGCTGTCCG GCGTCCAGGC CAGACTCGCC GACAAGCTCG GCCGGCCGCT GCGCGTGGTG CCGGTGTCAT GA
|
Protein sequence | MLGNLDRLDE LVSSPRNAGV LLQANAIGAF GGIRWGDAVK LMLRVDPATD RIEQARFQAF GCSSSIAAAS AVTELITGKT LDDAAALGAA DIADDLGGLP AARMYCAVMA YEALQTAITS YRGIAALREA DAAPSCKCLG VSQMMIERTI RFNRLTSLEQ VTHYTKAAGS CSSCFKQVEG LLARVNAEMV EDGLIDAGAA YQLGSTQQRA VDLKPHGAPQ PAANIFSGKA APAHLRAMPK SPPPRPATAQ APVAGTIDAL PLTSLVAEAL EDLRPHLQRD GGDCELVSVE GNVVYVRLSG NCVGCQLSSV TLSGVQARLA DKLGRPLRVV PVS
|
| |