Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0970 |
Symbol | |
ID | 3909325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1116194 |
End bp | 1117657 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637882863 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_484591 |
Protein GI | 86748095 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.659095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG CAGTCGCAGA ATCCCCCGCG GACATCAAGG AACGCAACAA GCAGCTGATC GGCGAAGTCC TGGAAGCCTA TCCGGACAAG TCCGCCAAGC GCCGCGCCAA GCACCTCAAC ACCTACGAGG CCGAAAAGGC CGAGTGTTCG GTCAAGTCCA ACATCAAGTC GATCCCCGGC GTGATGACGA TCCGCGGCTG CGCCTACGCC GGCTCCAAGG GCGTGGTGTG GGGTCCGATC AAGGACATGG TCCACATCAG CCATGGTCCG GTCGGCTGCG GCCAGTATTC CTGGGGTTCG CGCCGCAACT ACTACAAGGG CAACACCGGC ATCGACACCT TCGGCACGAT GCAGTTCACC TCCGACTTCC AGGAGAAGGA CATCGTCTTC GGCGGTGACA AGAAGCTCGG CAAGATCATC GACGAAATTC AGGACCTGTT CCCGCTGAAC CGCGGCATCT CGGTGCAGTC GGAATGCCCG ATCGGCCTGA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGACCAA GCAATACGAC GGCAAGCCGA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC CATCACATCG CCAACGACGT GATCCGCGAC TGGGTGTTCG ACAAGGCCGG CGACAAGGTC GCCACCTTCG AATCGACCCC CTACGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC GACGCCTGGG CCTCGCGCAT CCTGCTCGAG GAAATGGGTC TGCGCGTGAT CGCGCAGTGG TCCGGCGACG GCACCATCGC CGAGCTGGAG AACACCCCGA AGGCGAAGCT GAACATCCTG CATTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATCCCG TGGGTCGAGT ACAATTTCTT CGGCCCGACC AAGATCGAAG CCTCGCTGCG AGAGATCGCC GCGAAATTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCCAA ATACAAGCCG CGGATGCAGG CGATCGTCGA TCGTTATCGC CCGCGCCTCG AAGGCAAGAA GGTCATGCTC TATGTCGGCG GCCTGCGTCC GCGGCACGTC ATCGGCGCCT ACGAAGACCT CGGCATGGAA GTGGTCGGCA CCGGCTATGA ATTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAC GTGAAAGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAGTTCGTC GAGAAGGTCC GGCCCGATCT GGTCGGCTCC GGCATCAAGG AAAAGTACAT CTTCCAGAAG ATGGGTGTGC CGTTCCGCCA GATGCATTCG TGGGACTATT CCGGCCCGTA TCACGGCTAT GACGGCTTCG CCATCTTCGC CCGCGACATG GACATCGCCA TCAACGCTCC GATCTGGAAG CTGACCAAGG CACCTTGGAG CTGA
|
Protein sequence | MSTAVAESPA DIKERNKQLI GEVLEAYPDK SAKRRAKHLN TYEAEKAECS VKSNIKSIPG VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGNTG IDTFGTMQFT SDFQEKDIVF GGDKKLGKII DEIQDLFPLN RGISVQSECP IGLIGDDIEA VSKAKTKQYD GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGDKV ATFESTPYDV AIIGDYNIGG DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP WVEYNFFGPT KIEASLREIA AKFDDKIKEG AERVIAKYKP RMQAIVDRYR PRLEGKKVML YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV EKVRPDLVGS GIKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DIAINAPIWK LTKAPWS
|
| |