Gene RPB_0973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0973 
Symbol 
ID3909328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1120829 
End bp1122205 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID637882866 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_484594 
Protein GI86748098 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.41646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGA TCGTGACATC GACCAAGTCC TGCACCGTCA ATCCGCTGCG GATGAGTCGG 
CCGCTCGGCG CGGCGCTGGC CTTGATGGGG CTGCGCAATG CGATGCCGCT GCTGCACGGC
TCGCAGGGCT GCACCTCGTT CGGCCTGGTG CTGTTCGTGC GGCATTTCCG CGAACAGATC
CCGATGCAGA CCACCGCGAT GAGCGAAGTC GCCACCGTGC TCGGCGGCTT CGAGAATGTC
GAGCAGGCGA TCGTCAACAT CGTCGGCCGC ACCCAGCCGG ACGTGATCGG GATCTGCACC
ACGGGCGTCA CCGAGATCAA GGGCGACGAC CTCGACGGCT TCATCAAGGA CGTCCGCCGC
AAGCATCCCG AACTCGCGCA TGTCGCGCTG GTGCCGGTGT CGACGCCGGA CTTCAAGGGC
GCGTTCGAGG ACGGCTTCGC CAGCACCGTG GCGAAGATCG TCGAGCTGCT GGTCGAGGCG
CCAGCGCCGG GCGCCGCGCG CGATCCGGCG CGGCTCAACG TGCTGGCCGG CAGCCATCTG
ACGCCGGGCG ATATCGACGA GCTGCGCGAC GTCATCGAGG CGTTCGGCCT GGTGCCGACC
TTCCTGCCGG ACATTTCCGG CTCGCTCGAC GGCCATCTGC CGGACGACTT CACCCCGACC
ACCCATGGCG GCGTCTCGGT CCCCGAGGTC GCGGCGATGG GCGGCGCGGC GCATACGCTG
GCGCTCGGCG AGCAGATGCG CAAGGCCGCG GCCGCGCTCG AGGCCAAGGC CGGCGTGCCG
TTCACGCTGC TGCGGCGGCT CACCGGGCTC GCGGCCGGCG ACGAACTGAT GGCGACGCTG
GCCAAGATCA GCGGCCGGCC GGTGCCGCCG AAATATCGCC GGCAGCGCAG CCAGCTGGTC
GACGCCATGC TCGACGGCCA CTTCTATTTC GGCGGCAAGC AGGTCGCGAT CGGCGCCGAG
CCGGACATGC TGCTGAATAT CGGCGGCTGG CTCGCCGACA TGGGCTGCAC GATCGAGGCT
GCGGTAACGA CGACCAACTC GCAGGCGCTT TCGCAGGTGC CGGCCGACGA GGTGCTGATC
GGCGATCTGG AAGATCTGGA GAGCCGCGCC GAGGAGTGCG ATCTGCTGCT GACGCATTCG
CACGGCAGGC AAGCCGCCGA GCGGCTCGGC GTGCCGCTGT TCCGCGTCGG CATTCCGATG
TTCGATCGGC TCGGCGCCGC GCATCAGGTC GTGGTCGGCT ATCGCGGCAG CCGCGATCTG
ATCTTTGCGA TCGGCAATCT GTTCATCGCC GCCATCAAGG AACCGCATGT CGACGACTGG
CGCAACGCCG CGATCGGCGA TCGGGATCAG GTCGATGCGG CGGCTACGGC TCATTAG
 
Protein sequence
MARIVTSTKS CTVNPLRMSR PLGAALALMG LRNAMPLLHG SQGCTSFGLV LFVRHFREQI 
PMQTTAMSEV ATVLGGFENV EQAIVNIVGR TQPDVIGICT TGVTEIKGDD LDGFIKDVRR
KHPELAHVAL VPVSTPDFKG AFEDGFASTV AKIVELLVEA PAPGAARDPA RLNVLAGSHL
TPGDIDELRD VIEAFGLVPT FLPDISGSLD GHLPDDFTPT THGGVSVPEV AAMGGAAHTL
ALGEQMRKAA AALEAKAGVP FTLLRRLTGL AAGDELMATL AKISGRPVPP KYRRQRSQLV
DAMLDGHFYF GGKQVAIGAE PDMLLNIGGW LADMGCTIEA AVTTTNSQAL SQVPADEVLI
GDLEDLESRA EECDLLLTHS HGRQAAERLG VPLFRVGIPM FDRLGAAHQV VVGYRGSRDL
IFAIGNLFIA AIKEPHVDDW RNAAIGDRDQ VDAAATAH