Gene RPC_4462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4462 
Symbol 
ID3973088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4967641 
End bp4969152 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content61% 
IMG OID637927573 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_534304 
Protein GI90425934 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.559091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCG GCGCCCAGAT CAGCGGTCCC GCGAAACGAG GTATGACCAT GAGCTCTGCG 
ATCGCAGCCA CCACCGAAGA CACCAAGGCG CGCAACAAGG AATTGATCGG GGAGGTCCTC
AAGGCCTATC CGGAAAAGTT CGCCAAGCGT CGCGCCAAGC ACCTCGGCAC CTACGAGTCC
GAGAAATCCG AATGCGGCGT CAAGTCCAAC ATCAAATCGA TCCCGGGCGT GATGACGATC
CGTGGTTGCG CCTACGCCGG CTCGAAGGGC GTGGTGTGGG GCCCGATCAA GGACATGGTC
CACATCAGCC ACGGCCCGGT CGGCTGCGGC CAGTATTCCT GGGGGTCGCG CCGTAACTAT
TACGTCGGCA CCACCGGCGT CGACAGCTTC GTCACCATGC AGTTCACCTC CGATTTCCAG
GAAAAGGACA TCGTGTTCGG TGGCGACAAG AAGCTCGGCA AGCTGATCGA CGAAATCGAA
GTGCTGTTCC CGCTCAACAA GGGCATCTCG ATCCAGTCGG AATGCCCGAT CGGTCTGATC
GGTGACGACA TCGAGGCGGT CTCCAAGGCC AAGTCGAAGC AGTATGACGG CAAAACCATC
GTGCCGGTGC GTTGCGAAGG CTTCCGCGGC GTCTCGCAGT CGCTCGGCCA CCACATCGCC
AACGACGCGG TGCGGGACTG GGTGTTCGAC AAGTCGGCCG ACAAGCCGGC GCGGTTCGAG
CAGACCCCGT ACGACGTCGC GATCATCGGC GACTACAACA TCGGCGGCGA CGCCTGGTCG
TCGCGCATCC TGCTCGAAGA AATGGGTCTG CGGGTCATCG CGCAGTGGTC CGGCGACGGC
TCGGTGGCCG AACTCGAAGC CACCCCGAAG GCCAAGCTGA ACATCCTGCA CTGCTACCGC
TCGATGAACT ACATCACCCG CCACATGGAA GAGAAGTTCG GCATTCCGTG GGTCGAATAC
AACTTCTTCG GTCCGTCGAA GATCTCGTCG AGTCTGCGCG AAATCGCCAG CCATTTCGAC
GACAAGATCA AGGAAGGCGC CGAGCGCGTC ATCGCGAAAT ATACCCCGCG GATGGAAGCG
GTCATCGCCC AGTACAAGCC TCGCCTGCAG GGCAAGAAGG TGATGCTCTA TGTCGGCGGC
TTGCGTCCGC GCCACGTCAT CGGCGCCTAC GAAGATCTCG GCATGGAAGT GGTCGGCACC
GGCTATGAAT TCGGCCACAA CGACGACTAT CAGCGCACCA CGCATTACGT GAAGGACTCC
ACGCTGATTT ACGACGACGT CACCGGCTAC GAGTTCGAGA AGTTCGTCGA GAAGGTGCAG
CCCGACCTGG TCGGTTCCGG CATCAAGGAG AAGTACATCT TCCAGAAGAT GGGTGTGCCG
TTCCGGCAGA TGCATTCCTG GGATTACTCC GGTCCGTACC ACGGCTACGA CGGTTTTGCG
ATCTTCGCCC GCGACATGGA CATCGCGATC AACGCACCGG TCTGGAAACT GAACAAGGCT
CCCTGGAGCT AA
 
Protein sequence
MAVGAQISGP AKRGMTMSSA IAATTEDTKA RNKELIGEVL KAYPEKFAKR RAKHLGTYES 
EKSECGVKSN IKSIPGVMTI RGCAYAGSKG VVWGPIKDMV HISHGPVGCG QYSWGSRRNY
YVGTTGVDSF VTMQFTSDFQ EKDIVFGGDK KLGKLIDEIE VLFPLNKGIS IQSECPIGLI
GDDIEAVSKA KSKQYDGKTI VPVRCEGFRG VSQSLGHHIA NDAVRDWVFD KSADKPARFE
QTPYDVAIIG DYNIGGDAWS SRILLEEMGL RVIAQWSGDG SVAELEATPK AKLNILHCYR
SMNYITRHME EKFGIPWVEY NFFGPSKISS SLREIASHFD DKIKEGAERV IAKYTPRMEA
VIAQYKPRLQ GKKVMLYVGG LRPRHVIGAY EDLGMEVVGT GYEFGHNDDY QRTTHYVKDS
TLIYDDVTGY EFEKFVEKVQ PDLVGSGIKE KYIFQKMGVP FRQMHSWDYS GPYHGYDGFA
IFARDMDIAI NAPVWKLNKA PWS