Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1074 |
Symbol | |
ID | 4021550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1225007 |
End bp | 1226470 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637961266 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_568213 |
Protein GI | 91975554 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.811234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG CAGTCGCAGA ATCTCCCGCG GACCTCAAGG AACGCAACAA GCAGCTGATC GGCGAAGTCC TGGAAGCCTA TCCGGACAAG TCCGCCAAGC GCCGCGCCAA GCACCTCAAC ACCTACGAGG CCGAAAAGGC CGAGTGCTCG GTCAAGTCCA ACATCAAGTC GATCCCGGGC GTGATGACGA TCCGCGGCTG CGCCTACGCC GGCTCGAAGG GCGTGGTGTG GGGTCCGATC AAGGACATGG TCCACATCAG CCACGGCCCG GTCGGCTGCG GCCAGTATTC CTGGGGCTCG CGCCGCAACT ATTACAAGGG CGTCACCGGC ATCGACACCT TCGGCACGAT GCAGTTCACC TCCGATTTCC AGGAGAAGGA CATCGTCTTC GGTGGCGACA AGAAGCTCGG CAAGATCATC GACGAAATTC AGGACCTGTT CCCGCTGAAC CGCGGCATTT CGGTGCAATC GGAATGCCCG ATCGGTCTGA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGACGAA ACAGTATGAC GGCAAGCCTA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC CATCACATCG CCAACGACGT GATCCGCGAC TGGGTGTTCG ACAAGGCCGG CGACAAGGTC GCCACGTTCG AATCGACGCC CTACGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC GATGCCTGGG CCTCGCGCAT CCTGCTTGAG GAGATGGGGC TCCGCGTGAT CGCGCAGTGG TCCGGCGACG GCACCATCGC CGAGCTCGAG AACACTCCAA AAGCGAAGCT GAACATCCTG CACTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATCCCG TGGGTCGAGT ACAATTTCTT CGGTCCGACC AAGATCGAAG CCTCGTTGCG CGAGATCGCT GCGAAGTTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCCAA GTACAAGCCG CGGATGCAGG CGATCGTCGA TCGCTATCGT CCGCGCCTGG AAGGCAAGAA GGTGATGCTC TATGTCGGCG GCCTGCGTCC GCGCCACGTG ATCGGCGCCT ATGAAGACCT CGGCATGGAA GTGGTCGGCA CCGGCTATGA ATTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAC GTGAAGGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAATTCGTG GAGAAGGTGC GGCCCGATCT GGTCGGCTCC GGCATCAAGG AAAAGTACAT CTTCCAGAAG ATGGGTGTGC CGTTCCGCCA GATGCATTCG TGGGACTATT CCGGCCCGTA TCACGGCTAT GACGGCTTCG CCATCTTCGC CCGTGACATG GACATCGCCA TCAACGCTCC GATCTGGAAG CTGACCAAGG CACCTTGGAG CTGA
|
Protein sequence | MSTAVAESPA DLKERNKQLI GEVLEAYPDK SAKRRAKHLN TYEAEKAECS VKSNIKSIPG VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGVTG IDTFGTMQFT SDFQEKDIVF GGDKKLGKII DEIQDLFPLN RGISVQSECP IGLIGDDIEA VSKAKTKQYD GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGDKV ATFESTPYDV AIIGDYNIGG DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP WVEYNFFGPT KIEASLREIA AKFDDKIKEG AERVIAKYKP RMQAIVDRYR PRLEGKKVML YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV EKVRPDLVGS GIKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DIAINAPIWK LTKAPWS
|
| |