Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1075 |
Symbol | |
ID | 4021551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1226508 |
End bp | 1228121 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637961267 |
Product | nitrogenase molybdenum-iron protein beta chain |
Protein accession | YP_568214 |
Protein GI | 91975555 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01286] nitrogenase molybdenum-iron protein beta chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.902597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGATCCA CCGGAAGCGC CGTGTCCCCG AATTCCTTGC GAAAGAACAT CAACATGACC GAGACCGTAG AAAAGATCCG GGATCATTTC GATCTCTTCC ATCAGCCCGA ATACGCGGAC ATGATGGACA ACAAGCGCAA GCAGTTCGAA AACGCCGTCG GCGAAGCCGA AGTCGAGCGC GTGGCGGACT GGACCAAGAC CAAGGAATAC CAGGACAAGA ACTTTGCGCG TGAAGCCCTG GTCATCAATC CGGCCAAGGC CTGCCAGCCG CTCGGCGCGG TGTTCGCCGC GGTCGGATTC GAAAAGACGC TGCCGTTCGT GCACGGCTCG CAGGGCTGCG TCGCCTATTA TCGTAGCCAC TTCTCGCGGC ACTTCAAGGA GCCGACCTCC TGCGTCTCCT CGTCGATGAC CGAAGACGCC GCGGTGTTCG GCGGCCTCAA CAACATGATC GACGGCCTGG CGAATTCCTA CGCGCTGTAC AAGCCGAAGA TGATCGCGGT CTCGACCACC TGCATGGCCG AGGTGATCGG CGACGACCTC AACGCGTTCA TCAAGAATGC CAAGGAGAAG GGCTCCGTTC CGCAGGAGTT CGACGTCACC TACGCCCACA CCCCGGCCTT CGTCGGCAGC CACATCACCG GCTACGACAA CACGATGAAG GGCGTGGTGG AGCACTTCTG GGACGGCAAG TCCGGCACCG TGCCGAAGCT CGAGCGCCAG CCCAACGGCT CGATCAACTT CCTCGGCGGC TTCGACGGCA ACACCGTCGG CAACATCCGC GAGGTCAAGC GGATCTTCGA ACTGATGGGC GTCGACTACA CCATCTTCGG CGACAACAGC GACGTTTGGG ACACCCCGGC CGACGGCGAG TTCCGGATGT ATGACGGCGG CACCACGCTG GAGCAGGCCG CCAACGCCAT CCACGCCAAG GGCACGATCT CGATGCAGGA ATTCTGCACG GAAAAGACGC TGGCGACGAT CGCGGCGCAC GGCCAGGAAG TGGTCGCGCT CAACTCACCG ATCGGCATCA CCGGCACCGA TCGCTTCCTG CAGGCGGTGT CGCGGATCAC CGGCAAAGCG ATCCCCGAAG CGCTGACCAA GGAACGCGGC CGGCTGGTCG ACGCCATCGG CGACTCGAGC GCGCACATCC ACGGCAAGAA GTTCGCGATC TTCGGCGATC CGGACCTGTG CTACGGCGTG GCCGAATTCA TCCTCGAGCT CGGTGGCGAA CCGACCCACA TTCTCGCCAC CAACGGCAAC AAGAACTGGG AGGAGAAGGT CAACCAGCTC TTGGCGTCCT CGCCGTTCGG CACAAACTGC AAGGTCTACG CCGGCAAGGA TCTCTGGCAC CTGCGCTCGC TGCTGTTCAC TGAGCCGGTC GACCTCATGA TCGGCAACAC CTACGGCAAG TATCTCGAGC GCGACACCGG AACGCCGTTG ATCCGCATGG GCTTCCCGGT GTTCGATCGC CACCATCACC ACCGCTCGCC GATCTGGGGA TATCAGGGCA CGATGAACGT CCTGGTGAAG ATCCTCGACA AGATCTTCGA CGAGATGGAC AAGGCCACCA ACACCGCCGG CAAGACCGAC CTCTCCTTCG ACATCATCCG CTGA
|
Protein sequence | MRSTGSAVSP NSLRKNINMT ETVEKIRDHF DLFHQPEYAD MMDNKRKQFE NAVGEAEVER VADWTKTKEY QDKNFAREAL VINPAKACQP LGAVFAAVGF EKTLPFVHGS QGCVAYYRSH FSRHFKEPTS CVSSSMTEDA AVFGGLNNMI DGLANSYALY KPKMIAVSTT CMAEVIGDDL NAFIKNAKEK GSVPQEFDVT YAHTPAFVGS HITGYDNTMK GVVEHFWDGK SGTVPKLERQ PNGSINFLGG FDGNTVGNIR EVKRIFELMG VDYTIFGDNS DVWDTPADGE FRMYDGGTTL EQAANAIHAK GTISMQEFCT EKTLATIAAH GQEVVALNSP IGITGTDRFL QAVSRITGKA IPEALTKERG RLVDAIGDSS AHIHGKKFAI FGDPDLCYGV AEFILELGGE PTHILATNGN KNWEEKVNQL LASSPFGTNC KVYAGKDLWH LRSLLFTEPV DLMIGNTYGK YLERDTGTPL IRMGFPVFDR HHHHRSPIWG YQGTMNVLVK ILDKIFDEMD KATNTAGKTD LSFDIIR
|
| |