Gene RPD_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3571 
Symbol 
ID4024085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3975928 
End bp3977088 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID637963775 
Productradical SAM family protein 
Protein accessionYP_570695 
Protein GI91978036 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.22168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATAC CGTTTCACAA GGAGATCCGG ATCGGCGGCT ATCTGCTCAA GCAGAAGCTG 
CTGGGCCGCA AGCACTACCC GCTGGTGCTG ATGCTCGAGC CGCTGTTCCG TTGCAATCTG
GCCTGCGTCG GCTGCGGCAA GATCGATTAT CCCGATGCGA TCCTGAACCG CCGGATGTCC
GCGCAGGAAT GCTGGGACGC CGCCGAGGAA TGCGGCGCGC CGATGGTTGC GATCCCGGGC
GGCGAGCCGC TGATCCACAA GGAGATCGGC GAGATCGTGC GCGGCCTGGT GGCGCGCAAG
AAGTTCGTGT CGCTGTGCAC CAACGCGCTG CTGCTCGAGA AGAAGCTGCA CCTGTTCGAG
CCGTCGCCGT TCCTGTTTTT CTCGGTGCAT CTCGACGGCC TGAAGGATCA TCACGACAAG
GCGGTGTCGC AGGCCGGCGT GTTCGATCGC GCGGTGTCGG CGATCAAGGC GGCGAAGGCC
AAGGGCTTCA CCGTCAACGT CAACGCGACG ATCTTCGACA ACCATCCGGC CGAGGAGATC
GCCAAGTTCC TCGACTTCAC GACCGAACTC GGCGTCGGCG TCTCGATGTC ACCGGGCTAC
GCTTATGAGC GTGCGCCCGA TCAGGAGCAC TTCCTGAACC GGACCAAGAC GAAGAAACTG
TTCCGCGACG TTTTCGCGCT CGGCAAGGGC AAGAAGTGGA ACTTCATGCA TTCCGGCCTG
TTTCTGGACT TCCTTGCCGG AAATCAGGAG TTCGAATGCA CCCCGTGGGG AATGCCCGCG
CGCAATATCT TCGGCTGGCA GAAGCCGTGC TACCTGCTCG GTGAAGGCTA CACCAAGACC
TTCAAGGAGC TGATGGAGAC CACCAACTGG GATTCCTACG GCACCGGCAA GTACGAGAAG
TGCGCGGACT GCATGGCGCA TTGCGGCTAC GAGCCGACCG CGGCGACGGC GTCGCTGAAC
AATCCGCTGA AGGCGGCCTG GGTCGCGTTG CGCGGGATCC GGACCTCGGG TCCGATGGCG
CCCGAGATCG ATCTGTCGAA CCAGCGTCCG GCTCAGTACA TCTTCGCCGA GCAGGTGCAG
AAGACGCTGT CAGAGATCCG CCGCGACGAG GCTGCCGCTG CCAATCACGG CGCCAAGCAC
GAAGCTTCGA CAGCCGCGTA G
 
Protein sequence
MAIPFHKEIR IGGYLLKQKL LGRKHYPLVL MLEPLFRCNL ACVGCGKIDY PDAILNRRMS 
AQECWDAAEE CGAPMVAIPG GEPLIHKEIG EIVRGLVARK KFVSLCTNAL LLEKKLHLFE
PSPFLFFSVH LDGLKDHHDK AVSQAGVFDR AVSAIKAAKA KGFTVNVNAT IFDNHPAEEI
AKFLDFTTEL GVGVSMSPGY AYERAPDQEH FLNRTKTKKL FRDVFALGKG KKWNFMHSGL
FLDFLAGNQE FECTPWGMPA RNIFGWQKPC YLLGEGYTKT FKELMETTNW DSYGTGKYEK
CADCMAHCGY EPTAATASLN NPLKAAWVAL RGIRTSGPMA PEIDLSNQRP AQYIFAEQVQ
KTLSEIRRDE AAAANHGAKH EASTAA