Gene Rpal_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5097 
Symbol 
ID6412791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5480683 
End bp5482056 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content66% 
IMG OID642714982 
Productnitrogenase molybdenum-cofactor biosynthesis protein NifN 
Protein accessionYP_001994061 
Protein GI192293456 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAGA TCGTCACCTC GACCAAGTCC TGCACCGTCA ACCCGCTGCG GATGAGTCAG 
CCGCTCGGCG CCGCGCTGGC CTTCATGGGG CTGCGCAACT GCATGCCGCT GCTGCACGGC
TCCCAGGGCT GCACCTCGTT CGGCCTCGTG CTGTTCGTTC GCCACTTCCG CGAGTCGATC
CCGCTGCAGA CCACCGCGAT GAGCGAAGTC GCCACCGTGC TCGGCGGCTT TGAGAACGTC
GAGCAGGCTA TCGTCAACAT CGTCGGCCGC ACCAAGCCCG ACGTGATCGG GATCTGCACC
ACCGGCGTCA CCGAGATCAA AGGCGACGAT CTCGACGGCT ACATCAAGAT GGTGCGGGCC
AATCATCCGG AACTGGCGAA CGTCGCGCTG GTGCCGGTGT CGACGCCCGA CTTCAAAGGT
GCGTTCGAAG ATGGTTTCGC CGCCACGGTG ACGCGGATCG TCGAGACCCT GGTTGAGACA
CCGGCTGAAG GCGCCGCGCC GGATACCGAC AGGATCAACG TGTTGGCCGG CAGCCATCTG
ACGCCGGGCG ACATTGATGA GCTGCGCGAC ATCATCGAGG CGTTCGGCCT GGTGCCGACC
TTCCTGCCGG ATATCTCCGG CTCGCTCGAC GGGCATGTGC CGGATGATTT CACCCCGACG
ACGCATGGCG GCGTCTCGGT GGCCGAAGTC GTCGCGATGG GCGGCGCGGG CCACACGCTG
GCGTTCGGCG AGCAGATGCG CAAAGCCGCA GCCGCGCTCG AAGCCAAGGC CGGTGTGCCG
TTCACGCTGC TGTCGCGGGT CACCGGGCTT GCGGCGGCTG ATGAGCTGAT GGCGACGTTG
GCCAAGATCA GCGGCCGGCC GGTGCCGCCG AAATATCGCC GGCAGCGCAG CCAGCTGGTC
GACGCCATGC TCGACGGCCA CTTCTATTTC GGTGGCAAGA GCGTTGCAAT CGGCGCCGAG
CCGGACATGC TGCTGAATAT CGGCGGCTGG CTCGCCGATA TGGGCTGTAC CGTCAGCGCC
GCGGTGACGA CCACTACGTC GCCGAGCCTG GCGCAGGTGC CAAGCGACGA GGTGCTGATC
GGCGATCTCG AAGATCTCGA ACGCCGTGCC GAAGATTGCG ATCTATTGGT GACGCATTCG
CACGGCCGTC AGGCCGCGGA GCGCCTGAGC GTGCCGCTGT TCCGGATGGG CCTGCCGATG
TTCGACCGGC TTGGTGCCGC GCATCAGGTC GCAGTCGGCT ATCGCGGCAC CCGCGATCTG
ATCTTCGCGA TCGGCAATTT GTTCATCGCC AACATCAAGG AGCCGGACGT GAACAGCTGG
CGTAGTGCCT CTGCTTGCCC GGACCAGACC GATGCGCCGG CTAAGGCTCA TTAG
 
Protein sequence
MAQIVTSTKS CTVNPLRMSQ PLGAALAFMG LRNCMPLLHG SQGCTSFGLV LFVRHFRESI 
PLQTTAMSEV ATVLGGFENV EQAIVNIVGR TKPDVIGICT TGVTEIKGDD LDGYIKMVRA
NHPELANVAL VPVSTPDFKG AFEDGFAATV TRIVETLVET PAEGAAPDTD RINVLAGSHL
TPGDIDELRD IIEAFGLVPT FLPDISGSLD GHVPDDFTPT THGGVSVAEV VAMGGAGHTL
AFGEQMRKAA AALEAKAGVP FTLLSRVTGL AAADELMATL AKISGRPVPP KYRRQRSQLV
DAMLDGHFYF GGKSVAIGAE PDMLLNIGGW LADMGCTVSA AVTTTTSPSL AQVPSDEVLI
GDLEDLERRA EDCDLLVTHS HGRQAAERLS VPLFRMGLPM FDRLGAAHQV AVGYRGTRDL
IFAIGNLFIA NIKEPDVNSW RSASACPDQT DAPAKAH