Gene RPB_0980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0980 
Symbol 
ID3909335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1125207 
End bp1126208 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID637882873 
Productnitrogen-fixing NifU-like protein 
Protein accessionYP_484601 
Protein GI86748105 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG0822] NifU homolog involved in Fe-S cluster formation 
TIGRFAM ID[TIGR02000] Fe-S cluster assembly protein NifU 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.191022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCA ATCTCGACAG GCTCGATGAA CTCGTCTCCA GTCCGCGCAA TGCCGGCGTG 
CTGCTGCAGG CCAACGCGAT CGGCGCCTTC GGCGGCATCC GCTGGGGCGA TGCGGTCAAG
CTGATGCTGC GGGTCGATCC GGCGACCGAC CGGATCGAGC AGGCGCGGTT TCAGGCGTTC
GGCTGCAGCT CGTCGATCGC CGCGGCGTCC GCGGTCACCG AACTGATCAC CGGCAAGACG
CTGGACGACG CCGCGGCGCT CGGCGCGGCC GACATCGCCG ACGATCTCGG CGGCTTGCCG
GCGGCGCGGA TGTATTGCGC GGTGATGGCC TACGAGGCGC TACAGACGGC GATCACGTCC
TATCGGGGCA TTGCGGCGCT GCGCGAGGCC GACGCGGCGC CGTCGTGCAA GTGCCTCGGC
GTCAGCCAGA TGATGATCGA GCGCACCATC CGCTTCAATC GCCTGACCAG CCTGGAACAG
GTGACCCACT ACACCAAGGC GGCCGGGAGC TGCAGCTCCT GCTTCAAGCA GGTCGAAGGC
CTGCTGGCGC GGGTTAATGC CGAGATGGTC GAGGACGGGC TGATCGACGC TGGCGCGGCG
TATCAGCTCG GCTCGACGCA GCAGCGGGCC GTCGACCTGA AGCCGCACGG CGCGCCGCAG
CCGGCGGCCA ACATCTTCTC CGGCAAAGCG GCGCCGGCGC ATCTGCGCGC GATGCCGAAG
AGCCCGCCGC CGCGTCCGGC GACGGCGCAG GCGCCGGTCG CCGGGACCAT CGACGCGCTG
CCGCTGACTT CTCTGGTGGC CGAAGCGCTG GAGGATTTGC GGCCGCATCT GCAGCGCGAC
GGCGGCGATT GCGAACTCGT CAGCGTCGAG GGCAATGTCG TCTATGTCCG GCTGTCGGGC
AATTGCGTCG GCTGCCAATT GTCATCGGTG ACGCTGTCCG GCGTCCAGGC CAGACTCGCC
GACAAGCTCG GCCGGCCGCT GCGCGTGGTG CCGGTGTCAT GA
 
Protein sequence
MLGNLDRLDE LVSSPRNAGV LLQANAIGAF GGIRWGDAVK LMLRVDPATD RIEQARFQAF 
GCSSSIAAAS AVTELITGKT LDDAAALGAA DIADDLGGLP AARMYCAVMA YEALQTAITS
YRGIAALREA DAAPSCKCLG VSQMMIERTI RFNRLTSLEQ VTHYTKAAGS CSSCFKQVEG
LLARVNAEMV EDGLIDAGAA YQLGSTQQRA VDLKPHGAPQ PAANIFSGKA APAHLRAMPK
SPPPRPATAQ APVAGTIDAL PLTSLVAEAL EDLRPHLQRD GGDCELVSVE GNVVYVRLSG
NCVGCQLSSV TLSGVQARLA DKLGRPLRVV PVS