Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5100 |
Symbol | |
ID | 6412794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5485218 |
End bp | 5486681 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642714985 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001994064 |
Protein GI | 192293459 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CAGTCGCAGA ATCCCCCGCG GACATCAAGG AACGTAACAA GAAGCTGATC GGCGAAGTCC TGGAGGCCTA TCCGGACAAG TCGGCCAAGC GTCGCGCCAA GCATCTCAAT ACGTACGACG CCGAAAAGGC GGAGTGCTCG GTCAAGTCCA ACATCAAGTC GATCCCGGGC GTGATGACGA TCCGCGGTTG CGCCTACGCC GGCTCGAAGG GCGTGGTATG GGGCCCGATC AAGGACATGG TCCACATCAG CCACGGTCCG GTCGGCTGCG GCCAGTATTC CTGGGGCTCG CGCCGCAACT ACTACAAGGG CACCACCGGC GTCGACACCT TCGGCACGAT GCAGTTCACC TCCGACTTCC AGGAGAAGGA CATCGTTTTC GGCGGTGACA AGAAGCTCGG CAAGATCATC GACGAGATCC AGGAGCTGTT CCCGCTCTCC AAGGGCATCT CGGTGCAGTC GGAATGCCCG ATCGGTCTGA TCGGTGACGA CATCGAGGCG GTCTCGAAGG CCAAGTCGAA GCAGTACGAC GGCAAGCCGA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC CACCACATTG CCAACGACGT GATCCGTGAC TGGGTGTTCG ACAAGGCTGG CGAGAAGAAT GCCGGCTTCC AGTCGACCCC CTACGACGTC GCGATCATCG GCGACTACAA TATCGGCGGC GACGCCTGGG CCTCGCGCAT CCTGCTCGAG GAGATGGGCC TCCGCGTGAT CGCGCAGTGG TCCGGCGACG GCACCATTGC CGAGCTGGAG AATACCCCGA AGGCGAAGCT GAACATCCTG CACTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATTCCG TGGGTGGAAT ACAACTTCTT CGGTCCCACC AAGATCGAAG CGAGCCTGCG CGAGATCGCG TCGAAGTTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCTAA GTACAAGCCG CAGATGGAAG CGGTGATCGC CAAATATCGC CCGCGCCTCG AAGGCAAGAA GGTGATGCTG TATGTCGGCG GTCTGCGTCC GCGCCACGTC ATCGGCGCCT ACGAAGATCT CGGCATGGAA GTGGTCGGAA CCGGCTACGA ATTTGGCCAT AACGACGATT ATCAGCGCAC CACCCATTAC GTGAAGGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAGTTCGTC GAGAAGGTGC GTCCCGACCT GGTCGGCTCG GGCGTCAAGG AAAAGTACAT CTTCCAGAAG ATGGGCGTTC CGTTCCGCCA GATGCACTCC TGGGACTACT CGGGCCCGTA TCATGGCTAC GACGGGTTCG GCATCTTCGC CCGCGACATG GACATCGCGA TCAATGCCCC GGTCTGGAAA CTGACCAAGG CGCCTTGGAG CTGA
|
Protein sequence | MSTAVAESPA DIKERNKKLI GEVLEAYPDK SAKRRAKHLN TYDAEKAECS VKSNIKSIPG VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGTTG VDTFGTMQFT SDFQEKDIVF GGDKKLGKII DEIQELFPLS KGISVQSECP IGLIGDDIEA VSKAKSKQYD GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGEKN AGFQSTPYDV AIIGDYNIGG DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP WVEYNFFGPT KIEASLREIA SKFDDKIKEG AERVIAKYKP QMEAVIAKYR PRLEGKKVML YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV EKVRPDLVGS GVKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFGIFARDM DIAINAPVWK LTKAPWS
|
| |