Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0973 |
Symbol | |
ID | 3909328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1120829 |
End bp | 1122205 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882866 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifN |
Protein accession | YP_484594 |
Protein GI | 86748098 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.41646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGGA TCGTGACATC GACCAAGTCC TGCACCGTCA ATCCGCTGCG GATGAGTCGG CCGCTCGGCG CGGCGCTGGC CTTGATGGGG CTGCGCAATG CGATGCCGCT GCTGCACGGC TCGCAGGGCT GCACCTCGTT CGGCCTGGTG CTGTTCGTGC GGCATTTCCG CGAACAGATC CCGATGCAGA CCACCGCGAT GAGCGAAGTC GCCACCGTGC TCGGCGGCTT CGAGAATGTC GAGCAGGCGA TCGTCAACAT CGTCGGCCGC ACCCAGCCGG ACGTGATCGG GATCTGCACC ACGGGCGTCA CCGAGATCAA GGGCGACGAC CTCGACGGCT TCATCAAGGA CGTCCGCCGC AAGCATCCCG AACTCGCGCA TGTCGCGCTG GTGCCGGTGT CGACGCCGGA CTTCAAGGGC GCGTTCGAGG ACGGCTTCGC CAGCACCGTG GCGAAGATCG TCGAGCTGCT GGTCGAGGCG CCAGCGCCGG GCGCCGCGCG CGATCCGGCG CGGCTCAACG TGCTGGCCGG CAGCCATCTG ACGCCGGGCG ATATCGACGA GCTGCGCGAC GTCATCGAGG CGTTCGGCCT GGTGCCGACC TTCCTGCCGG ACATTTCCGG CTCGCTCGAC GGCCATCTGC CGGACGACTT CACCCCGACC ACCCATGGCG GCGTCTCGGT CCCCGAGGTC GCGGCGATGG GCGGCGCGGC GCATACGCTG GCGCTCGGCG AGCAGATGCG CAAGGCCGCG GCCGCGCTCG AGGCCAAGGC CGGCGTGCCG TTCACGCTGC TGCGGCGGCT CACCGGGCTC GCGGCCGGCG ACGAACTGAT GGCGACGCTG GCCAAGATCA GCGGCCGGCC GGTGCCGCCG AAATATCGCC GGCAGCGCAG CCAGCTGGTC GACGCCATGC TCGACGGCCA CTTCTATTTC GGCGGCAAGC AGGTCGCGAT CGGCGCCGAG CCGGACATGC TGCTGAATAT CGGCGGCTGG CTCGCCGACA TGGGCTGCAC GATCGAGGCT GCGGTAACGA CGACCAACTC GCAGGCGCTT TCGCAGGTGC CGGCCGACGA GGTGCTGATC GGCGATCTGG AAGATCTGGA GAGCCGCGCC GAGGAGTGCG ATCTGCTGCT GACGCATTCG CACGGCAGGC AAGCCGCCGA GCGGCTCGGC GTGCCGCTGT TCCGCGTCGG CATTCCGATG TTCGATCGGC TCGGCGCCGC GCATCAGGTC GTGGTCGGCT ATCGCGGCAG CCGCGATCTG ATCTTTGCGA TCGGCAATCT GTTCATCGCC GCCATCAAGG AACCGCATGT CGACGACTGG CGCAACGCCG CGATCGGCGA TCGGGATCAG GTCGATGCGG CGGCTACGGC TCATTAG
|
Protein sequence | MARIVTSTKS CTVNPLRMSR PLGAALALMG LRNAMPLLHG SQGCTSFGLV LFVRHFREQI PMQTTAMSEV ATVLGGFENV EQAIVNIVGR TQPDVIGICT TGVTEIKGDD LDGFIKDVRR KHPELAHVAL VPVSTPDFKG AFEDGFASTV AKIVELLVEA PAPGAARDPA RLNVLAGSHL TPGDIDELRD VIEAFGLVPT FLPDISGSLD GHLPDDFTPT THGGVSVPEV AAMGGAAHTL ALGEQMRKAA AALEAKAGVP FTLLRRLTGL AAGDELMATL AKISGRPVPP KYRRQRSQLV DAMLDGHFYF GGKQVAIGAE PDMLLNIGGW LADMGCTIEA AVTTTNSQAL SQVPADEVLI GDLEDLESRA EECDLLLTHS HGRQAAERLG VPLFRVGIPM FDRLGAAHQV VVGYRGSRDL IFAIGNLFIA AIKEPHVDDW RNAAIGDRDQ VDAAATAH
|
| |