Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0971 |
Symbol | |
ID | 3909326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1117750 |
End bp | 1119309 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637882864 |
Product | nitrogenase molybdenum-iron protein beta chain |
Protein accession | YP_484592 |
Protein GI | 86748096 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01286] nitrogenase molybdenum-iron protein beta chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGA CCGCAGAAAA GATCCGGGAT CATTTCGATC TCTTCCATCA GCCCGAATAC GCGGACATGA TGGACAACAA GCGCAAGCAG TTCGAGAACG CCGTCGGCGA AGCCGAAGTC GCGCGCGTGT CGGATTGGAC CAAGACCAAG GAATATCAGG AGAAGAACTT CGCTCGTGAA GCCCTGGTCA TCAACCCGGC CAAGGCCTGC CAGCCGCTCG GCGCGGTGTT CGCCGCGGTG GGCTTCGAGA AGACGCTGCC GTTCGTGCAC GGCTCGCAGG GCTGCGTCGC GTATTATCGC AGCCACTTCT CGCGGCACTT CAAGGAGCCG ACCTCCTGCG TCTCGTCGTC GATGACCGAG GACGCCGCGG TGTTCGGCGG CCTCAACAAC ATGATCGACG GCCTGGCCAA TTCCTACGCG CTGTACAAGC CGAAGATGAT CGCGGTGTCG ACCACCTGCA TGGCCGAAGT GATCGGCGAC GACCTCAACG CCTTCATCAA GAACGCGAAG GAAAAGGGCT CGGTTCCGCA GGACTTCGAC GTCACCTACG CCCACACCCC GGCGTTCGTC GGCAGCCACA TCACCGGCTA CGACAACACC ATGAAGGGCG TGGTCGAGCA CTTCTGGGAC GGCAAGTCCG GCACCACGCC GAAGCTCGAG CGCCAGCCCA ACGAGTCGGT CAACTTCCTC GGCGGCTTCG ACGGCAACAC CGTCGGCAAC ATCCGCGAGG TCAAGCGCAT CTTCGAACTG ATGGGCGTCG ACTACACCAT CTTCGGCGAC AATAGCGACG TCTGGGACAC CCCGGCCGAC GGCGAATTCC GGATGTATGA CGGCGGCACC ACGCTGGAGC AGGCCGCCAA CGCCATCCAC GCCAAGGGCA CGATCTCGAT GCAGGAATTC TGCACCGAAA AGACGCTGGC GACGATCGCG GCGCACGGCC AGGAAGTGGT CGCGCTCAAC AGCCCGATCG GCATCACCGG CACCGACCGC TTTCTGCAGG CGGTGTCGCG GATCACCGGC AAGGCGATCC CCGAAGCGCT GACCAAGGAG CGCGGCCGGC TGGTCGACGC CATCGGCGAC TCCTCGGCGC ACATCCACGG CAAGAAGTTC GCGATCTTCG GCGATCCGGA CCTGTGCTAC GGCCTGGCCG AATTCATCCT CGAACTCGGC GGCGAACCGA CCCACATCCT CGCTACCAAC GGCAACAAGA ACTGGGAAGT GAAGGTCAAC GAGCTCCTGG CGTCCTCGCC GTTCGGCACG AACTGCAAGG TCTATCCCGG CAAGGATCTC TGGCACCTGC GCTCGCTGCT GTTCACCGAG CCGGTCGACT TCATGATCGG CAACACCTAC GGCAAGTATC TCGAGCGCGA CACCGGCACG CCGCTGATCC GCATGGGCTT CCCGGTGTTC GATCGCCACC ACCATCACCG CTCGCCGATC TGGGGCTACC AGGGCACGAT GAACGTGCTG GTCAAGATCC TCGACAAGAT CTTCGACGAA ATGGACAAGG CCACCAACAT CGCCGGCAAG ACCGACCTGT CCTTCGACAT CATCCGCTGA
|
Protein sequence | MTETAEKIRD HFDLFHQPEY ADMMDNKRKQ FENAVGEAEV ARVSDWTKTK EYQEKNFARE ALVINPAKAC QPLGAVFAAV GFEKTLPFVH GSQGCVAYYR SHFSRHFKEP TSCVSSSMTE DAAVFGGLNN MIDGLANSYA LYKPKMIAVS TTCMAEVIGD DLNAFIKNAK EKGSVPQDFD VTYAHTPAFV GSHITGYDNT MKGVVEHFWD GKSGTTPKLE RQPNESVNFL GGFDGNTVGN IREVKRIFEL MGVDYTIFGD NSDVWDTPAD GEFRMYDGGT TLEQAANAIH AKGTISMQEF CTEKTLATIA AHGQEVVALN SPIGITGTDR FLQAVSRITG KAIPEALTKE RGRLVDAIGD SSAHIHGKKF AIFGDPDLCY GLAEFILELG GEPTHILATN GNKNWEVKVN ELLASSPFGT NCKVYPGKDL WHLRSLLFTE PVDFMIGNTY GKYLERDTGT PLIRMGFPVF DRHHHHRSPI WGYQGTMNVL VKILDKIFDE MDKATNIAGK TDLSFDIIR
|
| |