Gene RPD_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3736 
Symbol 
ID4024252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4173062 
End bp4174348 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content65% 
IMG OID637963940 
Productlight-independent protochlorophyllide reductase subunit N 
Protein accessionYP_570858 
Protein GI91978199 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01279] light-independent protochlorophyllide reductase, N subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.255529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00206765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGTTC ATGTCTCCAA TTGCGCCGCG ACAGCGGAAG ACCCCGTCTC GCGCGAAATC 
CGCACCGAGA GCGGCCAGCG CGAAGTGTTC TGCGGCCTCA CCGGAATCGT CTGGCTGCAT
CGCAAGATTC AGGACGCGTT CTTCCTCGTG GTCGGCTCGC GCACCTGCGC GCATCTGATC
CAGTCGGCTG CCGGCGTGAT GATTTTTGCC GAGCCGCGGT TCGGCACTGC GATCATGGAA
GAGAAGGATC TCGCCGGTCT CACCGACGCC AATATCGAGC TCGACCGGAT CGTCACACAA
TTGCTGACGC GGCGGCCCGA CATCAAGTTG CTGTTTCTGG TCGGCTCTTG CCCGTCCGAA
GTGATCAAGC TCGATTTGTC GCGGGCGGCG CTGCGGCTGT CACAGCGCTT CTCGCCCGGC
GTGCGTATCC TGAACTACTC GGGCAGTGGC ATCGAGACCA CCTTCACCCA GGGCGAGGAT
GCCTGCCTCG CGTCACTGGT GCCGGAGTTG CCCGCCGCGC AGGACGAGAA GTCGTCGCTC
CTGGTGGTCG GCTCGCTCGC CGACGTCGTC GAAGATCAGT TCATGCGGAT GTTCGATGCG
CTCGGCATCG GTCCCGTGCA GTTCTTCCCG CCGCGCAAAT CGACCGCGCT GCCGAGCGTC
GGTCCGAATA CCAAGATCCT GATGGCGCAG CCGTTCCTGC CGGATACGGT GCGTGCGCTG
CAGGAACGCG GCGCCAAGCG GCTGGCCGCG CCGTTCCCGC TCGGGGTTGA AGGCACCACC
GGCTGGCTGC GTGCCGCGGC CGACGCGTTC GGAGTCGATC CTGCGCATTT CGACAAGGTC
ACCGGTCCGA ACCGCGCTCG CGCCGAACGC GCGCTTGCGG CTTACCGGAC CGAACTCGCA
GATCGTCGTA TCTTCTTCTT CCCCGACTCC CAGCTCGAGA TTCCGCTGGC GCGTTTCCTG
TCGCGCGAGC TGTCGATGAA GCTGGTCGAA GTCGGCACGC CCTATCTGCA TCGCGAGCAT
CTCGCGGAAG AGTTGAAGCT GCTGCCCGCC GGCGTCGCGA TAACAGAAGG TCAGGACGTC
GACCTTCAGC TCGACCGCTG CCGGCTCGCG CGTCCCGACA TCGTGGTGTG CGGTCTCGGC
CTTGCCAATC CGCTCGAAGC CGAAGGCATC ACGACCAAAT GGTCGATCGA ACTCGTGTTC
ACCCCGATCC AGGGGTACGA GCAGGCGGCC GACCTCGCTG AATTGTTCGC GCGTCCGCTC
GTGCGCCGCG CCAAGCTGGT GGCCTGA
 
Protein sequence
MTVHVSNCAA TAEDPVSREI RTESGQREVF CGLTGIVWLH RKIQDAFFLV VGSRTCAHLI 
QSAAGVMIFA EPRFGTAIME EKDLAGLTDA NIELDRIVTQ LLTRRPDIKL LFLVGSCPSE
VIKLDLSRAA LRLSQRFSPG VRILNYSGSG IETTFTQGED ACLASLVPEL PAAQDEKSSL
LVVGSLADVV EDQFMRMFDA LGIGPVQFFP PRKSTALPSV GPNTKILMAQ PFLPDTVRAL
QERGAKRLAA PFPLGVEGTT GWLRAAADAF GVDPAHFDKV TGPNRARAER ALAAYRTELA
DRRIFFFPDS QLEIPLARFL SRELSMKLVE VGTPYLHREH LAEELKLLPA GVAITEGQDV
DLQLDRCRLA RPDIVVCGLG LANPLEAEGI TTKWSIELVF TPIQGYEQAA DLAELFARPL
VRRAKLVA