Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5097 |
Symbol | |
ID | 6412791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5480683 |
End bp | 5482056 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714982 |
Product | nitrogenase molybdenum-cofactor biosynthesis protein NifN |
Protein accession | YP_001994061 |
Protein GI | 192293456 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01285] nitrogenase molybdenum-iron cofactor biosynthesis protein NifN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACAGA TCGTCACCTC GACCAAGTCC TGCACCGTCA ACCCGCTGCG GATGAGTCAG CCGCTCGGCG CCGCGCTGGC CTTCATGGGG CTGCGCAACT GCATGCCGCT GCTGCACGGC TCCCAGGGCT GCACCTCGTT CGGCCTCGTG CTGTTCGTTC GCCACTTCCG CGAGTCGATC CCGCTGCAGA CCACCGCGAT GAGCGAAGTC GCCACCGTGC TCGGCGGCTT TGAGAACGTC GAGCAGGCTA TCGTCAACAT CGTCGGCCGC ACCAAGCCCG ACGTGATCGG GATCTGCACC ACCGGCGTCA CCGAGATCAA AGGCGACGAT CTCGACGGCT ACATCAAGAT GGTGCGGGCC AATCATCCGG AACTGGCGAA CGTCGCGCTG GTGCCGGTGT CGACGCCCGA CTTCAAAGGT GCGTTCGAAG ATGGTTTCGC CGCCACGGTG ACGCGGATCG TCGAGACCCT GGTTGAGACA CCGGCTGAAG GCGCCGCGCC GGATACCGAC AGGATCAACG TGTTGGCCGG CAGCCATCTG ACGCCGGGCG ACATTGATGA GCTGCGCGAC ATCATCGAGG CGTTCGGCCT GGTGCCGACC TTCCTGCCGG ATATCTCCGG CTCGCTCGAC GGGCATGTGC CGGATGATTT CACCCCGACG ACGCATGGCG GCGTCTCGGT GGCCGAAGTC GTCGCGATGG GCGGCGCGGG CCACACGCTG GCGTTCGGCG AGCAGATGCG CAAAGCCGCA GCCGCGCTCG AAGCCAAGGC CGGTGTGCCG TTCACGCTGC TGTCGCGGGT CACCGGGCTT GCGGCGGCTG ATGAGCTGAT GGCGACGTTG GCCAAGATCA GCGGCCGGCC GGTGCCGCCG AAATATCGCC GGCAGCGCAG CCAGCTGGTC GACGCCATGC TCGACGGCCA CTTCTATTTC GGTGGCAAGA GCGTTGCAAT CGGCGCCGAG CCGGACATGC TGCTGAATAT CGGCGGCTGG CTCGCCGATA TGGGCTGTAC CGTCAGCGCC GCGGTGACGA CCACTACGTC GCCGAGCCTG GCGCAGGTGC CAAGCGACGA GGTGCTGATC GGCGATCTCG AAGATCTCGA ACGCCGTGCC GAAGATTGCG ATCTATTGGT GACGCATTCG CACGGCCGTC AGGCCGCGGA GCGCCTGAGC GTGCCGCTGT TCCGGATGGG CCTGCCGATG TTCGACCGGC TTGGTGCCGC GCATCAGGTC GCAGTCGGCT ATCGCGGCAC CCGCGATCTG ATCTTCGCGA TCGGCAATTT GTTCATCGCC AACATCAAGG AGCCGGACGT GAACAGCTGG CGTAGTGCCT CTGCTTGCCC GGACCAGACC GATGCGCCGG CTAAGGCTCA TTAG
|
Protein sequence | MAQIVTSTKS CTVNPLRMSQ PLGAALAFMG LRNCMPLLHG SQGCTSFGLV LFVRHFRESI PLQTTAMSEV ATVLGGFENV EQAIVNIVGR TKPDVIGICT TGVTEIKGDD LDGYIKMVRA NHPELANVAL VPVSTPDFKG AFEDGFAATV TRIVETLVET PAEGAAPDTD RINVLAGSHL TPGDIDELRD IIEAFGLVPT FLPDISGSLD GHVPDDFTPT THGGVSVAEV VAMGGAGHTL AFGEQMRKAA AALEAKAGVP FTLLSRVTGL AAADELMATL AKISGRPVPP KYRRQRSQLV DAMLDGHFYF GGKSVAIGAE PDMLLNIGGW LADMGCTVSA AVTTTTSPSL AQVPSDEVLI GDLEDLERRA EDCDLLVTHS HGRQAAERLS VPLFRMGLPM FDRLGAAHQV AVGYRGTRDL IFAIGNLFIA NIKEPDVNSW RSASACPDQT DAPAKAH
|
| |