Gene RPC_4461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4461 
Symbol 
ID3973087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4966004 
End bp4967560 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content62% 
IMG OID637927572 
Productnitrogenase molybdenum-iron protein beta chain 
Protein accessionYP_534303 
Protein GI90425933 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01286] nitrogenase molybdenum-iron protein beta chain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA ACGCAGAAAA GATCCTCGAC CACTTCGACC TGTTCGCCAA GCCAGACTAC 
AAGGAAATGC TGGAAGGCAA GCGCAAGAAC TTCGAAAATC CGGTCTCGGA CGCCGAACTC
GACCGCGTGC GCGAATGGAC CAAGACGCCG GAATATCAGG AAAAGAACTT CGCCCGTGAA
GCCCTGGTGA TCAACCCGGC CAAGGCCTGT CAGCCGCTCG GCGCCGTGTT CGCCGCGGTC
GGCTTCGAGA AGACCTTGCC GTTCGTGCAC GGCTCGCAGG GCTGCGTCGC TTACTATCGC
AGCCACTTCT CCCGCCACTT CAAGGAGCCG ACCTCGTGCG TCTCCTCGTC GATGACGGAA
GACGCCGCGG TGTTCGGTGG CCTCAACAAC ATGGTCGACG GCCTGGCCAA CGCGCACGCG
CTGTACAAGC CGAAGATGAT CGCGGTCTCC ACCACCTGCA TGGCCGAAGT GATCGGCGAC
GACCTCAACG CCTTCATCAA GGGCGCCAAG GAAAAGGGCT CGGTCCCGGC CGATTTCGAC
GTCACCTTCG CCCACACCCC GGCCTTCGTC GGCAGCCACA TCACCGGCTA CGACAACGCG
ATGAAGGGCA TCGTCGAGCA TTTCTGGGAC GGCAAGGCCG GCACCGCGCC GAAGCTCGAG
CGTCAGCCCA ACGGTTCGAT CAACTTCATC GGCGGCTTCG ACGGCTACAC CGTCGGCAAC
ATGCGCGAGA TCAAGCGTCT GTTCGACCTG ATGGGGATCA AGTACACCAT CTTCGGCGAC
AACAGCGACG TCTGGGATAC CCCTGCGGAC GGTGAATTCC GGATGTATGA CGGCGGCACC
ACGCTGGAAG ACGCCGCCAA CGCCATCCAC GCCGTCGGCA CCATCTCGAT GCAGGAATTC
TGCACCGAAA AGACCCTGGC GACCATCGCC GCCCACGGCC AGGAAGTCGC CGCCTTCAAC
CATCCGATCG GCCTGTCCGG CACCGACAAG TTCCTGCAGG AAGTCTCGCG GCTGACCGGC
GTGTCGATCC CCGACGCGAT CACCAAGGAG CGTGGCCGGC TGGTCGACGC CATCGGCGAC
TCCAGCGCCC ACATCCACGG CAAGAAGTTC GCGATCTTCG GCGATCCGGA TCTCTGCCTC
GGTCTGGCGG CGTTCCTGCT CGAACTCGGC GCCGAACCGA CCCACATCGT CGCCACCAAC
GGCAACAAGG CGTGGGAAGA AAAGGTCAAG CAGCTGCTCG CCTCCTCGCC GTTCGGCGCC
AACTGCAAGG CCTATCCGGG CAAGGATCTG TGGCACCTGC GCTCGCTGCT GTTCACCGAA
CCGGTCGACT TCATGATCGG CTCGACCTAC GGCAAGTATC TCGAGCGCGA CACCAATACC
CCGCTGATCC GCATCGGCTT CCCGATCGCC GATCGTCACC ATCACCACCG CTACCCGGTG
TGGGGTTATC AGGGCTCGCT CAACGTGCTG GTCAAGATCC TCGATAAGAT CTTCGACGAA
ATCGACAAGA ACGTCGTCGC CGGCAAGAAC GACATCAGCT TCGACATCAT CCGCTGA
 
Protein sequence
MSQNAEKILD HFDLFAKPDY KEMLEGKRKN FENPVSDAEL DRVREWTKTP EYQEKNFARE 
ALVINPAKAC QPLGAVFAAV GFEKTLPFVH GSQGCVAYYR SHFSRHFKEP TSCVSSSMTE
DAAVFGGLNN MVDGLANAHA LYKPKMIAVS TTCMAEVIGD DLNAFIKGAK EKGSVPADFD
VTFAHTPAFV GSHITGYDNA MKGIVEHFWD GKAGTAPKLE RQPNGSINFI GGFDGYTVGN
MREIKRLFDL MGIKYTIFGD NSDVWDTPAD GEFRMYDGGT TLEDAANAIH AVGTISMQEF
CTEKTLATIA AHGQEVAAFN HPIGLSGTDK FLQEVSRLTG VSIPDAITKE RGRLVDAIGD
SSAHIHGKKF AIFGDPDLCL GLAAFLLELG AEPTHIVATN GNKAWEEKVK QLLASSPFGA
NCKAYPGKDL WHLRSLLFTE PVDFMIGSTY GKYLERDTNT PLIRIGFPIA DRHHHHRYPV
WGYQGSLNVL VKILDKIFDE IDKNVVAGKN DISFDIIR