Gene RPB_0970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0970 
Symbol 
ID3909325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1116194 
End bp1117657 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content61% 
IMG OID637882863 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_484591 
Protein GI86748095 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.659095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CAGTCGCAGA ATCCCCCGCG GACATCAAGG AACGCAACAA GCAGCTGATC 
GGCGAAGTCC TGGAAGCCTA TCCGGACAAG TCCGCCAAGC GCCGCGCCAA GCACCTCAAC
ACCTACGAGG CCGAAAAGGC CGAGTGTTCG GTCAAGTCCA ACATCAAGTC GATCCCCGGC
GTGATGACGA TCCGCGGCTG CGCCTACGCC GGCTCCAAGG GCGTGGTGTG GGGTCCGATC
AAGGACATGG TCCACATCAG CCATGGTCCG GTCGGCTGCG GCCAGTATTC CTGGGGTTCG
CGCCGCAACT ACTACAAGGG CAACACCGGC ATCGACACCT TCGGCACGAT GCAGTTCACC
TCCGACTTCC AGGAGAAGGA CATCGTCTTC GGCGGTGACA AGAAGCTCGG CAAGATCATC
GACGAAATTC AGGACCTGTT CCCGCTGAAC CGCGGCATCT CGGTGCAGTC GGAATGCCCG
ATCGGCCTGA TCGGCGACGA CATCGAGGCG GTCTCCAAGG CCAAGACCAA GCAATACGAC
GGCAAGCCGA TCATCCCGGT GCGCTGCGAA GGCTTCCGCG GCGTGTCGCA GTCGCTCGGC
CATCACATCG CCAACGACGT GATCCGCGAC TGGGTGTTCG ACAAGGCCGG CGACAAGGTC
GCCACCTTCG AATCGACCCC CTACGACGTC GCGATCATCG GCGACTACAA CATCGGCGGC
GACGCCTGGG CCTCGCGCAT CCTGCTCGAG GAAATGGGTC TGCGCGTGAT CGCGCAGTGG
TCCGGCGACG GCACCATCGC CGAGCTGGAG AACACCCCGA AGGCGAAGCT GAACATCCTG
CATTGCTACC GCTCGATGAA CTACATCACG CGGCACATGG AAGAGAAGTT CGGGATCCCG
TGGGTCGAGT ACAATTTCTT CGGCCCGACC AAGATCGAAG CCTCGCTGCG AGAGATCGCC
GCGAAATTCG ACGACAAGAT CAAGGAAGGC GCCGAGCGCG TCATCGCCAA ATACAAGCCG
CGGATGCAGG CGATCGTCGA TCGTTATCGC CCGCGCCTCG AAGGCAAGAA GGTCATGCTC
TATGTCGGCG GCCTGCGTCC GCGGCACGTC ATCGGCGCCT ACGAAGACCT CGGCATGGAA
GTGGTCGGCA CCGGCTATGA ATTCGGCCAC AACGACGACT ATCAGCGCAC CACCCACTAC
GTGAAAGACG GCACGCTGAT CTACGACGAC GTCACCGGCT ACGAATTCGA GAAGTTCGTC
GAGAAGGTCC GGCCCGATCT GGTCGGCTCC GGCATCAAGG AAAAGTACAT CTTCCAGAAG
ATGGGTGTGC CGTTCCGCCA GATGCATTCG TGGGACTATT CCGGCCCGTA TCACGGCTAT
GACGGCTTCG CCATCTTCGC CCGCGACATG GACATCGCCA TCAACGCTCC GATCTGGAAG
CTGACCAAGG CACCTTGGAG CTGA
 
Protein sequence
MSTAVAESPA DIKERNKQLI GEVLEAYPDK SAKRRAKHLN TYEAEKAECS VKSNIKSIPG 
VMTIRGCAYA GSKGVVWGPI KDMVHISHGP VGCGQYSWGS RRNYYKGNTG IDTFGTMQFT
SDFQEKDIVF GGDKKLGKII DEIQDLFPLN RGISVQSECP IGLIGDDIEA VSKAKTKQYD
GKPIIPVRCE GFRGVSQSLG HHIANDVIRD WVFDKAGDKV ATFESTPYDV AIIGDYNIGG
DAWASRILLE EMGLRVIAQW SGDGTIAELE NTPKAKLNIL HCYRSMNYIT RHMEEKFGIP
WVEYNFFGPT KIEASLREIA AKFDDKIKEG AERVIAKYKP RMQAIVDRYR PRLEGKKVML
YVGGLRPRHV IGAYEDLGME VVGTGYEFGH NDDYQRTTHY VKDGTLIYDD VTGYEFEKFV
EKVRPDLVGS GIKEKYIFQK MGVPFRQMHS WDYSGPYHGY DGFAIFARDM DIAINAPIWK
LTKAPWS