Gene RPD_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1077 
Symbol 
ID4021553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1229641 
End bp1231017 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content66% 
IMG OID637961269 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_568216 
Protein GI91975557 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG TGGTGACATC GACAAAATCC TGCACCGTCA ATCCGTTGCG GATGAGCCAG 
CCGCTCGGCG CGGCGCTGGC CTTCATGGGG CTGCGCAATT CGATGCCGCT GCTGCACGGC
TCGCAGGGCT GCACCTCGTT CGGCCTGGTG CTGTTCGTGC GCCATTTCCG CGAACAGATT
CCGCTGCAGA CCACGGCGAT GAGCGAAGTC GCGACCGTGC TCGGCGGCTT CGAGAATGTC
GAACAGGCGA TCGTCAACAT CGTCGGCCGC ACCAAGCCTG ACGTGATCGG CATCTGCACC
ACCGGCGTGA CAGAGATCAA GGGCGACGAC CTCGACGGCT TCATCAAGCT GGTGCGCGGC
AAGCATCCCG AACTTGCGAA TGTGGCGCTG GTGCCGGTGT CGACGCCCGA CTTCAAGGGC
GCATTCGAGG ACGGCTTCGC GACGACGGTC GCGAAGATCG TCGAGACACT GGTCGAGGCG
CCAGCGGCGG GCGTTGGGCG CGATCCGGCG AAGCTCAACG TGCTGGCGGG CAGCCATCTG
ACGCCCGGCG ATATCGACGA ACTTCGTGAC ATCATCGAGG CGTTCGGCCT CGTGCCGACG
TTTCTTCCCG ACATTTCCGG CTCGCTCGAT GGCCATCTGC CGGAGGACTT CACCCCGACC
ACCCATGGCG GCGTGTCGGT GGCCGAGGTC GCGGCGATGG GGCGCGCGGC GCACACGCTC
GCGCTCGGCG AACAGATGCG CAAGGCGGCG GCCGCGCTCG AGGCCAAAGT CGGCGTGCCG
TTCACGCTGC TGCAGCGCCT CACCGGGCTT GCGCCGAGCG ACGAATTGAT GGCGACGCTG
GCGCGGATCA GCGGCCGCCC GGTGCCGCCG AAGTATCGCC GCCAACGCAG CCAGCTCGTC
GACGCCATGC TCGACGGCCA CTTCTATTTC GGCGGCAAGA GAATCGCGAT CGGCGCCGAG
CCCGACATGC TGCTCAATAT CGGCGGCTGG CTCGCCGACA TGGGGTGCAC CATTGCTGCT
GCGGTGACCA CGACGCACTC GCCGGCGCTC GCGCAGGCGC CGTCTGACGA CGTGCTGATT
GGTGATCTGG AGGATCTGGA GCAACGCGCT GAGGATTGCG ATCTGCTGGT GACGCATTCG
CATGGTCGTC AGGCAGCGGA ACGGCTCGGC GTTCCGCTGT TCCGCGTCGG GCTGCCGATG
TTCGACCGGC TCGGCGCCGC GCATCAGGTC GCGGTCGGCT ATCGCGGCAC CCGCGATCTG
ATCTTCGCGA TCGGAAATCT GTTCATTTCC AACATCAAGG AACCGGACGT CGACACTTGG
CGCAGCACCG CTGCTGGCGG TCCGGATCAA GTCGATGCGT CGGTTACGAC TCATTAA
 
Protein sequence
MAKVVTSTKS CTVNPLRMSQ PLGAALAFMG LRNSMPLLHG SQGCTSFGLV LFVRHFREQI 
PLQTTAMSEV ATVLGGFENV EQAIVNIVGR TKPDVIGICT TGVTEIKGDD LDGFIKLVRG
KHPELANVAL VPVSTPDFKG AFEDGFATTV AKIVETLVEA PAAGVGRDPA KLNVLAGSHL
TPGDIDELRD IIEAFGLVPT FLPDISGSLD GHLPEDFTPT THGGVSVAEV AAMGRAAHTL
ALGEQMRKAA AALEAKVGVP FTLLQRLTGL APSDELMATL ARISGRPVPP KYRRQRSQLV
DAMLDGHFYF GGKRIAIGAE PDMLLNIGGW LADMGCTIAA AVTTTHSPAL AQAPSDDVLI
GDLEDLEQRA EDCDLLVTHS HGRQAAERLG VPLFRVGLPM FDRLGAAHQV AVGYRGTRDL
IFAIGNLFIS NIKEPDVDTW RSTAAGGPDQ VDASVTTH