Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1077 |
Symbol | |
ID | 4021553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1229641 |
End bp | 1231017 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961269 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifN |
Protein accession | YP_568216 |
Protein GI | 91975557 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGG TGGTGACATC GACAAAATCC TGCACCGTCA ATCCGTTGCG GATGAGCCAG CCGCTCGGCG CGGCGCTGGC CTTCATGGGG CTGCGCAATT CGATGCCGCT GCTGCACGGC TCGCAGGGCT GCACCTCGTT CGGCCTGGTG CTGTTCGTGC GCCATTTCCG CGAACAGATT CCGCTGCAGA CCACGGCGAT GAGCGAAGTC GCGACCGTGC TCGGCGGCTT CGAGAATGTC GAACAGGCGA TCGTCAACAT CGTCGGCCGC ACCAAGCCTG ACGTGATCGG CATCTGCACC ACCGGCGTGA CAGAGATCAA GGGCGACGAC CTCGACGGCT TCATCAAGCT GGTGCGCGGC AAGCATCCCG AACTTGCGAA TGTGGCGCTG GTGCCGGTGT CGACGCCCGA CTTCAAGGGC GCATTCGAGG ACGGCTTCGC GACGACGGTC GCGAAGATCG TCGAGACACT GGTCGAGGCG CCAGCGGCGG GCGTTGGGCG CGATCCGGCG AAGCTCAACG TGCTGGCGGG CAGCCATCTG ACGCCCGGCG ATATCGACGA ACTTCGTGAC ATCATCGAGG CGTTCGGCCT CGTGCCGACG TTTCTTCCCG ACATTTCCGG CTCGCTCGAT GGCCATCTGC CGGAGGACTT CACCCCGACC ACCCATGGCG GCGTGTCGGT GGCCGAGGTC GCGGCGATGG GGCGCGCGGC GCACACGCTC GCGCTCGGCG AACAGATGCG CAAGGCGGCG GCCGCGCTCG AGGCCAAAGT CGGCGTGCCG TTCACGCTGC TGCAGCGCCT CACCGGGCTT GCGCCGAGCG ACGAATTGAT GGCGACGCTG GCGCGGATCA GCGGCCGCCC GGTGCCGCCG AAGTATCGCC GCCAACGCAG CCAGCTCGTC GACGCCATGC TCGACGGCCA CTTCTATTTC GGCGGCAAGA GAATCGCGAT CGGCGCCGAG CCCGACATGC TGCTCAATAT CGGCGGCTGG CTCGCCGACA TGGGGTGCAC CATTGCTGCT GCGGTGACCA CGACGCACTC GCCGGCGCTC GCGCAGGCGC CGTCTGACGA CGTGCTGATT GGTGATCTGG AGGATCTGGA GCAACGCGCT GAGGATTGCG ATCTGCTGGT GACGCATTCG CATGGTCGTC AGGCAGCGGA ACGGCTCGGC GTTCCGCTGT TCCGCGTCGG GCTGCCGATG TTCGACCGGC TCGGCGCCGC GCATCAGGTC GCGGTCGGCT ATCGCGGCAC CCGCGATCTG ATCTTCGCGA TCGGAAATCT GTTCATTTCC AACATCAAGG AACCGGACGT CGACACTTGG CGCAGCACCG CTGCTGGCGG TCCGGATCAA GTCGATGCGT CGGTTACGAC TCATTAA
|
Protein sequence | MAKVVTSTKS CTVNPLRMSQ PLGAALAFMG LRNSMPLLHG SQGCTSFGLV LFVRHFREQI PLQTTAMSEV ATVLGGFENV EQAIVNIVGR TKPDVIGICT TGVTEIKGDD LDGFIKLVRG KHPELANVAL VPVSTPDFKG AFEDGFATTV AKIVETLVEA PAAGVGRDPA KLNVLAGSHL TPGDIDELRD IIEAFGLVPT FLPDISGSLD GHLPEDFTPT THGGVSVAEV AAMGRAAHTL ALGEQMRKAA AALEAKVGVP FTLLQRLTGL APSDELMATL ARISGRPVPP KYRRQRSQLV DAMLDGHFYF GGKRIAIGAE PDMLLNIGGW LADMGCTIAA AVTTTHSPAL AQAPSDDVLI GDLEDLEQRA EDCDLLVTHS HGRQAAERLG VPLFRVGLPM FDRLGAAHQV AVGYRGTRDL IFAIGNLFIS NIKEPDVDTW RSTAAGGPDQ VDASVTTH
|
| |