Gene RPC_4683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4683 
Symbol 
ID3972389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5241486 
End bp5243060 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content61% 
IMG OID637927795 
Productnitrogenase alpha chain 
Protein accessionYP_534524 
Protein GI90426154 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01284] nitrogenase alpha chain
[TIGR01861] nitrogenase iron-iron protein, alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTATC ATGAGTTCGA AGTCAGCAAA TGCATCCCCG AGCGCAAGCA GCACGCGGTC 
GTCAAGGGCC CGGGGGAGGA TCTGACCTCC TGCCTGCCGA AGGGATATCT CAACACCATC
CCTGGCTCGA TCTCCGAACG CGGCTGCGCC TATTGCGGCG CCAAGCACGT CATCGGCACG
CCGATGAAGG ACGTGATCCA TCTCAGTCAC GGCCCGGTCG GCTGCACCTA CGACACCTGG
CAGACCAAAC GCTATATCAG CGATAATAAC GACTATCAGC TGAAATACAC CTTCGCTTCC
GACGTGAAGG AGAAGCACAT CGTATTCGGC GCCGAGAAGC TGTTGAAGCA GAACATCCTC
GAGGCGTTCA AAGCGTTTCC GACTATGAAG CGCATGACCA TCTACCAGAC CTGCGCCACC
GCCTTGATCG GCGACGACGT CAACGCCATC GCCGCCGAGG TGATGGAGGA ACTGCCCGAC
GTCGATATCT TCGTCTGCAA CTCGCCGGGT TTCGCAGGCC CCAGCCAATC CGGCGGCCAT
CACAAGATCA ACATCGCTTG GCTGAACCAA AAGGTCGGCA CCGTCGAGCC GAAGATCACC
GGCGACTACG TCATCAATTA CGTCGGCGAG TACAACATCC AAGGTGACCA GGAAGTCATG
ATCGACTTCT TCAAGCGTAT GGGCATCCAG GTGCTGTCGA CCTTCACCGG CAACGGCTCC
TACGACGATC TGCGCAGCAT GCACGGCGCC CATCTCAACG TGCTGGAATG CGCCCGCTCG
GCCGAATACA TCTGCGACGA ATTGCGGATG CGCTACGGCA TTCCGCGGCT CGACATCGAC
GGCTTCGGCC ACAAGGCGCT CGGCGACAGC TTGCGCAAGG TCGGCCTGTT CTTCGGGATC
GAAGACCGCG CCGAGGCGAT CATCGCCGAG GAGACCGCCA AATGGGGTCC CGAACTCGCC
TGGTACAAGG AGCGCCTGCA AGGCAAGAAG GTGTGCCTGT GGCCGGGCGG CTCCAAGCTC
TGGCACTGGG CCCATGCCAT CCAGGAAGAA ATGGGCGTCC AGGTGGTCTC GGTCTACACC
AAGTTCGGCC ATCAGGGCGA CATGGAAAAG GGCGTCTCGC GTTGCGGCGA AGGCGCGCTG
GCGATCGATG ACCCCAACGA ACTCGAAAAT CAGGAAGCGC TGAAGACCCT GAAGCCGGAC
GTGATCTTCA CCGGCAAGCG GCCGGGCGAA GTCGCCAAGA AGATGCGGGT GCCCTATCTC
AACGCCCATG CTTACCACAA CGGCCCCTAC AAGGGCTGGG AGGGCTGGGT GCGCTTTGCC
CGCGACATCT ACAATGCGAT CTATTCGCCG ATGCATCAGC TGTCGGCGAT CGACATCTCC
AAGGACGACT ACGCGACCGA CAAGGGCTTC ACCACCCGGC GCATGCTGTC CGATGCCAAT
CTCTCCGACG AGTCAAAGGC GTCGCCGATG ACCGGCTATT CCGGCAAGTT CGACCCGATC
GCCGCCATCC GCGCCAAGAC CGCCGCCGAC TATCCGGTGT TTCCGCGTCG TAGCGTCACC
GAAGCCGCCG AGTAG
 
Protein sequence
MPYHEFEVSK CIPERKQHAV VKGPGEDLTS CLPKGYLNTI PGSISERGCA YCGAKHVIGT 
PMKDVIHLSH GPVGCTYDTW QTKRYISDNN DYQLKYTFAS DVKEKHIVFG AEKLLKQNIL
EAFKAFPTMK RMTIYQTCAT ALIGDDVNAI AAEVMEELPD VDIFVCNSPG FAGPSQSGGH
HKINIAWLNQ KVGTVEPKIT GDYVINYVGE YNIQGDQEVM IDFFKRMGIQ VLSTFTGNGS
YDDLRSMHGA HLNVLECARS AEYICDELRM RYGIPRLDID GFGHKALGDS LRKVGLFFGI
EDRAEAIIAE ETAKWGPELA WYKERLQGKK VCLWPGGSKL WHWAHAIQEE MGVQVVSVYT
KFGHQGDMEK GVSRCGEGAL AIDDPNELEN QEALKTLKPD VIFTGKRPGE VAKKMRVPYL
NAHAYHNGPY KGWEGWVRFA RDIYNAIYSP MHQLSAIDIS KDDYATDKGF TTRRMLSDAN
LSDESKASPM TGYSGKFDPI AAIRAKTAAD YPVFPRRSVT EAAE