Gene RPB_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0971 
Symbol 
ID3909326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1117750 
End bp1119309 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content63% 
IMG OID637882864 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_484592 
Protein GI86748096 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CCGCAGAAAA GATCCGGGAT CATTTCGATC TCTTCCATCA GCCCGAATAC 
GCGGACATGA TGGACAACAA GCGCAAGCAG TTCGAGAACG CCGTCGGCGA AGCCGAAGTC
GCGCGCGTGT CGGATTGGAC CAAGACCAAG GAATATCAGG AGAAGAACTT CGCTCGTGAA
GCCCTGGTCA TCAACCCGGC CAAGGCCTGC CAGCCGCTCG GCGCGGTGTT CGCCGCGGTG
GGCTTCGAGA AGACGCTGCC GTTCGTGCAC GGCTCGCAGG GCTGCGTCGC GTATTATCGC
AGCCACTTCT CGCGGCACTT CAAGGAGCCG ACCTCCTGCG TCTCGTCGTC GATGACCGAG
GACGCCGCGG TGTTCGGCGG CCTCAACAAC ATGATCGACG GCCTGGCCAA TTCCTACGCG
CTGTACAAGC CGAAGATGAT CGCGGTGTCG ACCACCTGCA TGGCCGAAGT GATCGGCGAC
GACCTCAACG CCTTCATCAA GAACGCGAAG GAAAAGGGCT CGGTTCCGCA GGACTTCGAC
GTCACCTACG CCCACACCCC GGCGTTCGTC GGCAGCCACA TCACCGGCTA CGACAACACC
ATGAAGGGCG TGGTCGAGCA CTTCTGGGAC GGCAAGTCCG GCACCACGCC GAAGCTCGAG
CGCCAGCCCA ACGAGTCGGT CAACTTCCTC GGCGGCTTCG ACGGCAACAC CGTCGGCAAC
ATCCGCGAGG TCAAGCGCAT CTTCGAACTG ATGGGCGTCG ACTACACCAT CTTCGGCGAC
AATAGCGACG TCTGGGACAC CCCGGCCGAC GGCGAATTCC GGATGTATGA CGGCGGCACC
ACGCTGGAGC AGGCCGCCAA CGCCATCCAC GCCAAGGGCA CGATCTCGAT GCAGGAATTC
TGCACCGAAA AGACGCTGGC GACGATCGCG GCGCACGGCC AGGAAGTGGT CGCGCTCAAC
AGCCCGATCG GCATCACCGG CACCGACCGC TTTCTGCAGG CGGTGTCGCG GATCACCGGC
AAGGCGATCC CCGAAGCGCT GACCAAGGAG CGCGGCCGGC TGGTCGACGC CATCGGCGAC
TCCTCGGCGC ACATCCACGG CAAGAAGTTC GCGATCTTCG GCGATCCGGA CCTGTGCTAC
GGCCTGGCCG AATTCATCCT CGAACTCGGC GGCGAACCGA CCCACATCCT CGCTACCAAC
GGCAACAAGA ACTGGGAAGT GAAGGTCAAC GAGCTCCTGG CGTCCTCGCC GTTCGGCACG
AACTGCAAGG TCTATCCCGG CAAGGATCTC TGGCACCTGC GCTCGCTGCT GTTCACCGAG
CCGGTCGACT TCATGATCGG CAACACCTAC GGCAAGTATC TCGAGCGCGA CACCGGCACG
CCGCTGATCC GCATGGGCTT CCCGGTGTTC GATCGCCACC ACCATCACCG CTCGCCGATC
TGGGGCTACC AGGGCACGAT GAACGTGCTG GTCAAGATCC TCGACAAGAT CTTCGACGAA
ATGGACAAGG CCACCAACAT CGCCGGCAAG ACCGACCTGT CCTTCGACAT CATCCGCTGA
 
Protein sequence
MTETAEKIRD HFDLFHQPEY ADMMDNKRKQ FENAVGEAEV ARVSDWTKTK EYQEKNFARE 
ALVINPAKAC QPLGAVFAAV GFEKTLPFVH GSQGCVAYYR SHFSRHFKEP TSCVSSSMTE
DAAVFGGLNN MIDGLANSYA LYKPKMIAVS TTCMAEVIGD DLNAFIKNAK EKGSVPQDFD
VTYAHTPAFV GSHITGYDNT MKGVVEHFWD GKSGTTPKLE RQPNESVNFL GGFDGNTVGN
IREVKRIFEL MGVDYTIFGD NSDVWDTPAD GEFRMYDGGT TLEQAANAIH AKGTISMQEF
CTEKTLATIA AHGQEVVALN SPIGITGTDR FLQAVSRITG KAIPEALTKE RGRLVDAIGD
SSAHIHGKKF AIFGDPDLCY GLAEFILELG GEPTHILATN GNKNWEVKVN ELLASSPFGT
NCKVYPGKDL WHLRSLLFTE PVDFMIGNTY GKYLERDTGT PLIRMGFPVF DRHHHHRSPI
WGYQGTMNVL VKILDKIFDE MDKATNIAGK TDLSFDIIR