Gene RPC_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4472 
Symbol 
ID3972484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4973976 
End bp4975532 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content64% 
IMG OID637927583 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_534314 
Protein GI90425944 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAGC TGGTACAACT GCTGGAGTTT GGGGCGCCCG GCGCGAAGTC GCTCGAGGAA 
ATTCGGCAGC AGGTCGCCAG CGCAGGCTGC AGTTCGAAGG GCGGCAGCGG CAAGTCGAGC
TGCGGCTCGG CAGCCGGGCA GGGCGATCTC GCCCCCGAGG TCTGGGAGAA GGTCAAGAAC
CATCCCTGCT ACAGCGAACA GGCGCATCAT CACTTCGCCC GCATGCATGT CGCGGTGGCG
CCGGCCTGCA ACATCCAGTG CAACTACTGT AATCGCAAAT ACGATTGCGC CAACGAATCG
CGCCCCGGCG TGGTCAGCGA GAAGCTGACT CCCGAACAGG CGGCGAAGAA GGTGCTGGCG
GTCGCCTCCA GCGTGCCGCA GATGACCGTG CTCGGCGTCG CCGGCCCCGG CGATCCGTTG
GCCAATCCGC AAAAGACCTT CAAGACCTTC GAACTGGTGT CGGCGACTGC GCCGGACATC
AAGCTGTGCC TGTCCACCAA CGGCCTGATG CTGCCCGACC ACGTCGATCG CATCCAGGCG
ATGAACGTCG ACCACGTCAC CATCACCATC AACATGATCG ACCCGGAAAT CGGCGCCAAG
ATCTATCCGT GGATCTTCTA CAACCACCGC CGCTACGAGG GCGTCGAGGC GTCGAAGATC
CTCAGCGAGC GGCAATTGCT TGGGCTCGAA ATGCTCACCG CGCGGGGCAT CCTGGTCAAA
GTCAATTCGG TGATGATCCC CGGCGTTAAC GACAAGCACC TGGTCGAGGT CAACAAGGCG
GTGAAGTCGC GCGGCGCCTT CCTGCACAAC ATCATGCCGC TGATCTCCGA GCCGGAGCAT
GGCACGGTGT ACGGCCTGTC CGGGCTGCGT GGCCCGTCGG CGCAGGAATT GAAAGCGCTG
CAGGACGCCT GCGAAGGCGA GATGAACATG ATGCGGCATT GCCGGCAGTG CCGCGCCGAC
GCCGTCGGCC TGCTTGGCGA GGACCGCAGC GACGAATTCT CCAACGAGAA GGTCGCGGCG
ATGGAGGTCA CCTACGACCT CGAAGCCCGC AAGGCCTATC AGGCCAAGGT CGAGCTGGAG
CGCGAGGCGA TCGCCAAGGC CAAGACCGCC GAGCTGGTGA AACTCGCCGA CGAGACCAGC
GCCATCAAGA TCCAGGTGGC GATCGCCACC AAGGGCGGTG GCCGGGTCAA CGAGCATTTC
GGCCACGCCC ACGAATTCCA GATCTACGAG GTCTCCACCG CCGGCGCCAA ATTCGTCGGC
CACCGCCGCG TCGATCTGTA TTGCGAAGGC GGCTACGCCT CCGCCGACGG TGCCGCCACC
ATCATCCGCG CCTTGAACGA CTGCACCGCG GTGCTGGTCG CCAAGATCGG CATCTGCCCG
AAGGATTCGC TCGCCGCGGC TGGCATCGAG GCGGTGGAAA CCTACGCCTT CGAGTTCATC
GAGACCTCGG TGATCGCCTA CTTCAAGGAC TACCTGCAGC GCGTCGAGAA GAAGGAAATC
AGCCACGTCG CCAAGGGTGA CGCCGACATT CGTCAGGGGG CATTCACCGA GGCCTGA
 
Protein sequence
MQKLVQLLEF GAPGAKSLEE IRQQVASAGC SSKGGSGKSS CGSAAGQGDL APEVWEKVKN 
HPCYSEQAHH HFARMHVAVA PACNIQCNYC NRKYDCANES RPGVVSEKLT PEQAAKKVLA
VASSVPQMTV LGVAGPGDPL ANPQKTFKTF ELVSATAPDI KLCLSTNGLM LPDHVDRIQA
MNVDHVTITI NMIDPEIGAK IYPWIFYNHR RYEGVEASKI LSERQLLGLE MLTARGILVK
VNSVMIPGVN DKHLVEVNKA VKSRGAFLHN IMPLISEPEH GTVYGLSGLR GPSAQELKAL
QDACEGEMNM MRHCRQCRAD AVGLLGEDRS DEFSNEKVAA MEVTYDLEAR KAYQAKVELE
REAIAKAKTA ELVKLADETS AIKIQVAIAT KGGGRVNEHF GHAHEFQIYE VSTAGAKFVG
HRRVDLYCEG GYASADGAAT IIRALNDCTA VLVAKIGICP KDSLAAAGIE AVETYAFEFI
ETSVIAYFKD YLQRVEKKEI SHVAKGDADI RQGAFTEA