Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1084 |
Symbol | |
ID | 4021560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1234037 |
End bp | 1235032 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961276 |
Product | nitrogen-fixing NifU-like |
Protein accession | YP_568223 |
Protein GI | 91975564 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0694] Thioredoxin-like proteins and domains [COG0822] NifU homolog involved in Fe-S cluster formation |
TIGRFAM ID | [TIGR02000] Fe-S cluster assembly protein NifU |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACA ATCTCGATAG GCTCGACGAA CACATCTCCA GCCCGCGCAA TGCCGGTGTG CTGCCGCACG CCAATGCGGT CGGCTCGTTC GGCGCCATCC GCTGGGGTGA CGCGGTCAAA CTGATGCTGC AAGTCGATCC GCGAACCGAT CGGATCGAAC AGGCGCGGTT TCAGGCCTTC GGGTGCAGCT CGTCGATCGC ATCGTCCTCG GCGGTTACCG AGATGATCAC CGGCAGAACA CTCGACGAAG CCGTCGGGAT CAGCGCTGCC GATATCGCGG ATTATCTCGG CGGCCTGCCG CCGGAACGCA TGTACTGCGC GGTGATGACC TATGAGGCCC TGCAGAAGGC GATCGGGTCG TATCGCGGCG AGGCCGAGCT GAGCGAAGCC GATGCGGCGC CGTCCTGCAA ATGCCTCGGC GTCAGCCAGA TGATGATCGA GCGCACCATC CGTTTCAATC GGCTGACCAG CGTCGAGCAG GTGACCCACC ACACCAAGGC GGCCGGCAGT TGCAGCGCCT GTTTCAAGCA GGTCGAAGGC CTGCTGGCGC GGGTCAATGC CGAGATGGCG GAGGATGGGC TGATCGGGCC CGGCGACGCC TATCAGCTCG GTTCGACGTC GCCGCGCGCG ATCGATCTGA AGCCGCACGG CGCGCCGCAG CCGGCGACCA ATATCTTTGC CGCCAAGGCC GCGCCGGCGC ATCTGCGCGC CGCGCCGAAG AGCGCGCCGT CGCGTCCGGC ACCTGCGCCT GCCGCGGTCG GGGTCGATGC GCCGTCGCAG ACGACGCTGA TTGCCGAAGC GCTCGACGAG CTGCGGCCGC ATTTGAAGCG CGATGGCGGC GACTGCGAAC TCGTCAATGT CGAGGGCAAT GTCGTTTACG TCAGGCTGTC GGGCAATTGC GTCGGCTGCC AATTGTCATC GCTGACGCTG TCCGGCGTTC AGGCCAGGCT CGCCGACAGG CTCGGCCGGC CGCTGCGTGT GGTGCCTGTG CCATGA
|
Protein sequence | MLDNLDRLDE HISSPRNAGV LPHANAVGSF GAIRWGDAVK LMLQVDPRTD RIEQARFQAF GCSSSIASSS AVTEMITGRT LDEAVGISAA DIADYLGGLP PERMYCAVMT YEALQKAIGS YRGEAELSEA DAAPSCKCLG VSQMMIERTI RFNRLTSVEQ VTHHTKAAGS CSACFKQVEG LLARVNAEMA EDGLIGPGDA YQLGSTSPRA IDLKPHGAPQ PATNIFAAKA APAHLRAAPK SAPSRPAPAP AAVGVDAPSQ TTLIAEALDE LRPHLKRDGG DCELVNVEGN VVYVRLSGNC VGCQLSSLTL SGVQARLADR LGRPLRVVPV P
|
| |