Gene RPD_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1063 
Symbol 
ID4021539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1218030 
End bp1219589 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content64% 
IMG OID637961255 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_568202 
Protein GI91975543 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGA AACTTCTCAC GCTGCACGAT TTCGGCGCCC CAGGTGCGAC GTCGTTCGAT 
GAATTGCGGC GTAGCGCCGC GCAGTCCGGC TGTAGCAGCA CGGGTGGTAG CGGAAAGTCC
GGCTGCGGCT CGGCGGCGGG CCAGGGCGAC CTGCCGCCGG ACGTCTGGGA GAAGGTGAAG
AACCATCCCT GCTACAGCGA GCAGGCGCAT CACCACTTCG CCCGCATGCA TGTCGCCGTC
GCGCCGGCCT GCAACATCCA GTGCAATTAC TGCAATCGCA AGTATGATTG CGCCAACGAA
TCGCGTCCGG GTGTCGTCAG CGAGAAGCTG TCGCCGGAGC AGGCGGCCCG CAAGGTGATC
GCCGTCGCCT CGACGATTCC GCAGATGACC GTGCTTGGCG TCGCCGGCCC GGGCGATCCG
CTCGCCAACC CGGCGAAGAC CTTCAAGACC TTCGAGCTGG TGTCCGAGAC CGCGCCCGAC
ATCAAGCTGT GCCTGTCGAC CAACGGCCTG ACGCTGCCCG ATCACGTCGA GCGCATCGTC
GCGATGAAAG TCGATCACGT CACCATCACG ATCAACATGA TCGACCCGGA GATCGGCGCG
CAGATCTATC CGTGGATCTT CTACGACCAC CGCCGCATCA CCGGCGTCGA GGCGTCAAAG
ATCCTCAGCG AGCGGCAATT GCTCGGGCTC GAGATGCTCA CCGCGCGCGG CATCCTGGTC
AAGGTCAACT CGGTGATGAT CCCGGGGATC AACGATCGGC ATCTGATCGA GGTCAACAAA
GCGGTGAAAT CGCGCGGCGC CTTCCTGCAC AACATCATGC CGCTGATCTC CGCGCCGGAG
CACGGCACGG TGTTCGGCCT CGAAGGCCGC CGCGGCCCGT CCGCGCAGGA GCTGAAGGCG
CTCCAGGATG ATTGCGAGGG CGAAATGAAC ATGATGCGGC ATTGTCGGCA ATGCCGCGCC
GACGCGGTCG GCCTGCTCGG TGAGGACCGC AGCGCCGAAT TCACCACCAA CAAGGTGATG
GAGATGGAGG TCACATACGA TCTCGCCGCG CGCCAAGCCT ATCAGGCCAA GGTCGAGGCC
GAGCGCGACG CCATCGCTGG CGCCAAGCAG CGCGAGATGG CGACGCTCGC CGATGAGACC
GCCTCGATCA AGATCCAGGT CGCGATCGCG ACCAAGGGCG GCGGCGTGAT CAACGAGCAC
TTCGGCCATG CTCATGAATT CCAGATCTAC GAGGTCTCGA CCGCCGGCGC CAAGTTCATC
GGCCACCGCC GCGTCGATCT GTATTGCGAG GGCGGCTACG CCAGCGACAC CGGGATCGAG
CCGATTCTGA AGGCGCTGAA CGACTGCGCC GCGGTGCTGG TCGCCAAGAT CGGCCTCTGT
CCGAAGGAGT CGCTGGCCGG CGCCGGCATC GAGGCGGTCG AGACCTATGC CTTCGACTAT
ATCGAGCAAT CGGCGATTGC CTATTTCAAG GACTACCTCG ATCGCGTCGG CAAGTCGGAG
ATCAGTCACG TCCAGCGTGG CGATGCCGAC ATTCGCCAGG GCGCATTCGT CACGAGCTGA
 
Protein sequence
MQKKLLTLHD FGAPGATSFD ELRRSAAQSG CSSTGGSGKS GCGSAAGQGD LPPDVWEKVK 
NHPCYSEQAH HHFARMHVAV APACNIQCNY CNRKYDCANE SRPGVVSEKL SPEQAARKVI
AVASTIPQMT VLGVAGPGDP LANPAKTFKT FELVSETAPD IKLCLSTNGL TLPDHVERIV
AMKVDHVTIT INMIDPEIGA QIYPWIFYDH RRITGVEASK ILSERQLLGL EMLTARGILV
KVNSVMIPGI NDRHLIEVNK AVKSRGAFLH NIMPLISAPE HGTVFGLEGR RGPSAQELKA
LQDDCEGEMN MMRHCRQCRA DAVGLLGEDR SAEFTTNKVM EMEVTYDLAA RQAYQAKVEA
ERDAIAGAKQ REMATLADET ASIKIQVAIA TKGGGVINEH FGHAHEFQIY EVSTAGAKFI
GHRRVDLYCE GGYASDTGIE PILKALNDCA AVLVAKIGLC PKESLAGAGI EAVETYAFDY
IEQSAIAYFK DYLDRVGKSE ISHVQRGDAD IRQGAFVTS