Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0959 |
Symbol | |
ID | 3909314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1109005 |
End bp | 1110561 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637882852 |
Product | nitrogenase cofactor biosynthesis protein NifB |
Protein accession | YP_484580 |
Protein GI | 86748084 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01290] nitrogenase cofactor biosynthesis protein NifB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.438509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGC TGCTGCAACT CCACGATTTC GGGGCCCCCG GGGCGAAGTC CTTCGAGGCG CTGCGCGCCG GCGCGGCGCA ATCCGGCTGC AGCACCACGG GCGGCAACGG CAAGTCCGGC TGCGGCTCGG CGGCTGGACA GGGCGATCTG CCGCCGGAGA TCTGGGAGAA GGTGAAGAAC CATCCCTGCT ACAGCGAGCA GGCGCATCAT CACTTCGCCC GCATGCATGT CGCGGTCGCG CCCGCATGCA ACATCCAGTG CAATTACTGC AATCGCAAAT ATGACTGCGC CAATGAATCG CGCCCCGGCG TCGTCAGCGA GAAGCTGACG CCCGAGCAGG CGGCCCGCAA GGTGATCGCG GTGGCCTCGA CGATTCCGCA GATGACCGTG CTCGGCATCG CCGGTCCGGG CGACGCGCTC GCGAACCCTG CCAAGACCTT CAAGACCTTC GAGCTGGTGA CGGCCACGGC GCCCGACATC AAGTTGTGCC TGTCGACCAA CGGGCTGATG CTGCCCGACT ACGTCGAGCA GATCGCCGCC ATGAATGTCG ATCACGTCAC CATCACCATC AACATGATCG ATCCGGAGAT CGGCGCGCAG ATCTATCCGT GGATCTTCTA CAACCACCGC CGCTTCACCG GCGTCGAGGC GTCGAAAATC CTCAGCGAGC GGCAATTGCT CGGGCTCGAG ATGCTCACCG CGCGCGGCAT CCTGGTGAAG GTCAATTCGG TGATGATCCC GGGGATCAAC GACCGGCACC TGATCGAGGT CAACAAGGCG GTGAAGTCGC GCGGCGCCTT CCTGCACAAC ATCATGCCGC TGATCTCCGA GCCGGAGCAC GGCACGGTGT TCGGTCTCGA AGGCCGTCGC GGCCCGTCGG CGCAGGAGCT CAAGGCGCTG CAGGACGATT GCGAAGGCGA GATGAACATG ATGCGGCACT GCCGGCAGTG CCGCGCCGAC GCGGTCGGCC TGCTCGGCGA GGATCGCAGC GCCGAATTCA CCACCGACAA GGTGATGGAG ATGGAGGTCG AATACGATCT CGCCGCGCGG CAGGCCTATC AGGCCAAGGT CGAGGCCGAG CGCGACGCCA TCGCCCTCGC CAAGCAGCGC GAACTGGCGA CGCTCGCCGA CGAGACCGCC ACGATCAAGA TCCAGGTCGC GATCGCCACC AAGGGCGGCG GCGTGATCAA CGAGCACTTC GGACATGCTC ACGAATTCCA GATCTACGAG GTTTCGACCG CCGGCGCCAA GTTCATCGGT CATCGCCGCG TCGATCTGTA TTGCGAGGGC GGCTACGCCA GCGAGACCGG CATCGAGCCG ATCCTGAAGG CGCTGAACGA CTGCACCGCG GTGCTGGTCG CCAAGATCGG TCTGTGCCCG AAGGATTCTC TGGCCGGCGC CGGGATCGAG GCGGTCGAGG ACTACGCCTT CGAATATATC GAGCAGTCGG TGATCGCTTA TTTCAAGGAC TACCTCGCGC GCGTCGGGAA GTCGGAAATC CGCCACGTCG CTCGCGGCGA CGCCGATATT CGCCAGGGCG CGTTCGTCGC GAGTTGA
|
Protein sequence | MSKLLQLHDF GAPGAKSFEA LRAGAAQSGC STTGGNGKSG CGSAAGQGDL PPEIWEKVKN HPCYSEQAHH HFARMHVAVA PACNIQCNYC NRKYDCANES RPGVVSEKLT PEQAARKVIA VASTIPQMTV LGIAGPGDAL ANPAKTFKTF ELVTATAPDI KLCLSTNGLM LPDYVEQIAA MNVDHVTITI NMIDPEIGAQ IYPWIFYNHR RFTGVEASKI LSERQLLGLE MLTARGILVK VNSVMIPGIN DRHLIEVNKA VKSRGAFLHN IMPLISEPEH GTVFGLEGRR GPSAQELKAL QDDCEGEMNM MRHCRQCRAD AVGLLGEDRS AEFTTDKVME MEVEYDLAAR QAYQAKVEAE RDAIALAKQR ELATLADETA TIKIQVAIAT KGGGVINEHF GHAHEFQIYE VSTAGAKFIG HRRVDLYCEG GYASETGIEP ILKALNDCTA VLVAKIGLCP KDSLAGAGIE AVEDYAFEYI EQSVIAYFKD YLARVGKSEI RHVARGDADI RQGAFVAS
|
| |