Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5101 |
Symbol | nifH |
ID | 6412795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5486753 |
End bp | 5487649 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714986 |
Product | nitrogenase reductase |
Protein accession | YP_001994065 |
Protein GI | 192293460 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1348] Nitrogenase subunit NifH (ATPase) |
TIGRFAM ID | [TIGR01287] nitrogenase iron protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACTTC GGCAAATCGC ATTCTACGGC AAGGGCGGCA TCGGCAAGTC GACCACCTCG CAGAACACGC TCGCGGCGCT GGTCGAGATG GGTCAGAAGA TCCTGATCGT CGGCTGCGAC CCCAAGGCGG ACTCCACCCG TCTGATCCTC AACACCAAGA TGCAGGACAC GGTGCTGAGC CTCGCCGCGG AAGCGGGTTC GGTGGAAGAC CTCGAACTCG AAGACGTGAT GAAGGTCGGC TACAAGGGCA TCAAGTGCAC CGAAGCCGGT GGTCCCGAGC CGGGCGTCGG TTGCGCCGGC CGCGGCGTCA TCACCGCGAT CAACTTCCTC GAAGAGAACG GCGCCTATGA GGACGTCGAC TACGTCTCCT ATGACGTGCT CGGCGACGTG GTCTGCGGCG GCTTCGCGAT GCCGATCCGC GAGAACAAGG CCCAGGAAAT CTACATCGTC ATGTCCGGCG AGATGATGGC GCTGTATGCC GCCAACAACA TCGCCAAGGG CATTCTGAAG TACGCCTCGT CGGGCGGCGT CCGCCTCGGC GGCCTGGTCT GCAACGAGCG CCAGACCGAT CGCGAGCTCG ATCTCGCCGA AGCGCTGGCC AAGCGGTTGA ACTCGCAGCT GATCCACTTC GTGCCGCGCG ACAATATCGT GCAACACGCC GAGCTGCGCC GCCAGACCGT GATCCAGTAC GCGCCCGACA GCCAGCAGGC CAAGGAGTAT CGGACGCTGG CCGAGAAGGT GCATGCCAAC GGCGGCAAGG GCACCATCCC GACCCCGATC ACCATGGAAG AGCTCGAACA GATGCTGCTC GACTTCGGCA TCATGAAGAC CGACGAGCAG GCGCTCGCCG AACTGCAGGC CAAGGAAGCC GCCAAGGCGG CCGCCGCGTC CGCCTGA
|
Protein sequence | MALRQIAFYG KGGIGKSTTS QNTLAALVEM GQKILIVGCD PKADSTRLIL NTKMQDTVLS LAAEAGSVED LELEDVMKVG YKGIKCTEAG GPEPGVGCAG RGVITAINFL EENGAYEDVD YVSYDVLGDV VCGGFAMPIR ENKAQEIYIV MSGEMMALYA ANNIAKGILK YASSGGVRLG GLVCNERQTD RELDLAEALA KRLNSQLIHF VPRDNIVQHA ELRRQTVIQY APDSQQAKEY RTLAEKVHAN GGKGTIPTPI TMEELEQMLL DFGIMKTDEQ ALAELQAKEA AKAAAASA
|
| |