Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4462 |
Symbol | |
ID | 3973088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4967641 |
End bp | 4969152 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637927573 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_534304 |
Protein GI | 90425934 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.559091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCG GCGCCCAGAT CAGCGGTCCC GCGAAACGAG GTATGACCAT GAGCTCTGCG ATCGCAGCCA CCACCGAAGA CACCAAGGCG CGCAACAAGG AATTGATCGG GGAGGTCCTC AAGGCCTATC CGGAAAAGTT CGCCAAGCGT CGCGCCAAGC ACCTCGGCAC CTACGAGTCC GAGAAATCCG AATGCGGCGT CAAGTCCAAC ATCAAATCGA TCCCGGGCGT GATGACGATC CGTGGTTGCG CCTACGCCGG CTCGAAGGGC GTGGTGTGGG GCCCGATCAA GGACATGGTC CACATCAGCC ACGGCCCGGT CGGCTGCGGC CAGTATTCCT GGGGGTCGCG CCGTAACTAT TACGTCGGCA CCACCGGCGT CGACAGCTTC GTCACCATGC AGTTCACCTC CGATTTCCAG GAAAAGGACA TCGTGTTCGG TGGCGACAAG AAGCTCGGCA AGCTGATCGA CGAAATCGAA GTGCTGTTCC CGCTCAACAA GGGCATCTCG ATCCAGTCGG AATGCCCGAT CGGTCTGATC GGTGACGACA TCGAGGCGGT CTCCAAGGCC AAGTCGAAGC AGTATGACGG CAAAACCATC GTGCCGGTGC GTTGCGAAGG CTTCCGCGGC GTCTCGCAGT CGCTCGGCCA CCACATCGCC AACGACGCGG TGCGGGACTG GGTGTTCGAC AAGTCGGCCG ACAAGCCGGC GCGGTTCGAG CAGACCCCGT ACGACGTCGC GATCATCGGC GACTACAACA TCGGCGGCGA CGCCTGGTCG TCGCGCATCC TGCTCGAAGA AATGGGTCTG CGGGTCATCG CGCAGTGGTC CGGCGACGGC TCGGTGGCCG AACTCGAAGC CACCCCGAAG GCCAAGCTGA ACATCCTGCA CTGCTACCGC TCGATGAACT ACATCACCCG CCACATGGAA GAGAAGTTCG GCATTCCGTG GGTCGAATAC AACTTCTTCG GTCCGTCGAA GATCTCGTCG AGTCTGCGCG AAATCGCCAG CCATTTCGAC GACAAGATCA AGGAAGGCGC CGAGCGCGTC ATCGCGAAAT ATACCCCGCG GATGGAAGCG GTCATCGCCC AGTACAAGCC TCGCCTGCAG GGCAAGAAGG TGATGCTCTA TGTCGGCGGC TTGCGTCCGC GCCACGTCAT CGGCGCCTAC GAAGATCTCG GCATGGAAGT GGTCGGCACC GGCTATGAAT TCGGCCACAA CGACGACTAT CAGCGCACCA CGCATTACGT GAAGGACTCC ACGCTGATTT ACGACGACGT CACCGGCTAC GAGTTCGAGA AGTTCGTCGA GAAGGTGCAG CCCGACCTGG TCGGTTCCGG CATCAAGGAG AAGTACATCT TCCAGAAGAT GGGTGTGCCG TTCCGGCAGA TGCATTCCTG GGATTACTCC GGTCCGTACC ACGGCTACGA CGGTTTTGCG ATCTTCGCCC GCGACATGGA CATCGCGATC AACGCACCGG TCTGGAAACT GAACAAGGCT CCCTGGAGCT AA
|
Protein sequence | MAVGAQISGP AKRGMTMSSA IAATTEDTKA RNKELIGEVL KAYPEKFAKR RAKHLGTYES EKSECGVKSN IKSIPGVMTI RGCAYAGSKG VVWGPIKDMV HISHGPVGCG QYSWGSRRNY YVGTTGVDSF VTMQFTSDFQ EKDIVFGGDK KLGKLIDEIE VLFPLNKGIS IQSECPIGLI GDDIEAVSKA KSKQYDGKTI VPVRCEGFRG VSQSLGHHIA NDAVRDWVFD KSADKPARFE QTPYDVAIIG DYNIGGDAWS SRILLEEMGL RVIAQWSGDG SVAELEATPK AKLNILHCYR SMNYITRHME EKFGIPWVEY NFFGPSKISS SLREIASHFD DKIKEGAERV IAKYTPRMEA VIAQYKPRLQ GKKVMLYVGG LRPRHVIGAY EDLGMEVVGT GYEFGHNDDY QRTTHYVKDS TLIYDDVTGY EFEKFVEKVQ PDLVGSGIKE KYIFQKMGVP FRQMHSWDYS GPYHGYDGFA IFARDMDIAI NAPVWKLNKA PWS
|
| |