Gene RPD_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1084 
Symbol 
ID4021560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1234037 
End bp1235032 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID637961276 
Productnitrogen-fixing NifU-like 
Protein accessionYP_568223 
Protein GI91975564 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0694] Thioredoxin-like proteins and domains
[COG0822] NifU homolog involved in Fe-S cluster formation 
TIGRFAM ID[TIGR02000] Fe-S cluster assembly protein NifU 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA ATCTCGATAG GCTCGACGAA CACATCTCCA GCCCGCGCAA TGCCGGTGTG 
CTGCCGCACG CCAATGCGGT CGGCTCGTTC GGCGCCATCC GCTGGGGTGA CGCGGTCAAA
CTGATGCTGC AAGTCGATCC GCGAACCGAT CGGATCGAAC AGGCGCGGTT TCAGGCCTTC
GGGTGCAGCT CGTCGATCGC ATCGTCCTCG GCGGTTACCG AGATGATCAC CGGCAGAACA
CTCGACGAAG CCGTCGGGAT CAGCGCTGCC GATATCGCGG ATTATCTCGG CGGCCTGCCG
CCGGAACGCA TGTACTGCGC GGTGATGACC TATGAGGCCC TGCAGAAGGC GATCGGGTCG
TATCGCGGCG AGGCCGAGCT GAGCGAAGCC GATGCGGCGC CGTCCTGCAA ATGCCTCGGC
GTCAGCCAGA TGATGATCGA GCGCACCATC CGTTTCAATC GGCTGACCAG CGTCGAGCAG
GTGACCCACC ACACCAAGGC GGCCGGCAGT TGCAGCGCCT GTTTCAAGCA GGTCGAAGGC
CTGCTGGCGC GGGTCAATGC CGAGATGGCG GAGGATGGGC TGATCGGGCC CGGCGACGCC
TATCAGCTCG GTTCGACGTC GCCGCGCGCG ATCGATCTGA AGCCGCACGG CGCGCCGCAG
CCGGCGACCA ATATCTTTGC CGCCAAGGCC GCGCCGGCGC ATCTGCGCGC CGCGCCGAAG
AGCGCGCCGT CGCGTCCGGC ACCTGCGCCT GCCGCGGTCG GGGTCGATGC GCCGTCGCAG
ACGACGCTGA TTGCCGAAGC GCTCGACGAG CTGCGGCCGC ATTTGAAGCG CGATGGCGGC
GACTGCGAAC TCGTCAATGT CGAGGGCAAT GTCGTTTACG TCAGGCTGTC GGGCAATTGC
GTCGGCTGCC AATTGTCATC GCTGACGCTG TCCGGCGTTC AGGCCAGGCT CGCCGACAGG
CTCGGCCGGC CGCTGCGTGT GGTGCCTGTG CCATGA
 
Protein sequence
MLDNLDRLDE HISSPRNAGV LPHANAVGSF GAIRWGDAVK LMLQVDPRTD RIEQARFQAF 
GCSSSIASSS AVTEMITGRT LDEAVGISAA DIADYLGGLP PERMYCAVMT YEALQKAIGS
YRGEAELSEA DAAPSCKCLG VSQMMIERTI RFNRLTSVEQ VTHHTKAAGS CSACFKQVEG
LLARVNAEMA EDGLIGPGDA YQLGSTSPRA IDLKPHGAPQ PATNIFAAKA APAHLRAAPK
SAPSRPAPAP AAVGVDAPSQ TTLIAEALDE LRPHLKRDGG DCELVNVEGN VVYVRLSGNC
VGCQLSSLTL SGVQARLADR LGRPLRVVPV P