Gene RPD_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2001 
Symbol 
ID4022483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2238428 
End bp2239585 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content71% 
IMG OID637962194 
Productcytochrome c-type biogenesis protein cycH 
Protein accessionYP_569137 
Protein GI91976478 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.224163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC CGCAGTCCCG CGTCGGCGCA TTATCACCCG GAACCCCCAA TCTTATGGCG 
CTCTGGTTTG TGTTCGCGAT CATGACGGCC ATCGCGGCGT TCGCGGTGTT GTGGCCGCTC
GGGCGCGGCG TCCGGACGCT GCGCGAAGGC ACCGAGGCCA ACGTCTACAA GGACCAGCTC
GCCGAGGTCG ATCGCGACGC CGAAGCCGGG CTGATCGGCG CCCCCGAAGC ATCCGCCGCA
CGCGTCGAGA TCGGACGCCG GCTGCTCGCC AGCGCCGAGG TCGAGCGCCC GGCTTTGTTC
ACAACGAGCC GCGGCCTGCG GCGGGGCGTC GCGGTGTTCG CGCTGATCGG CCTGCCGGCG
CTGGCGCTGG CGGTGTATCT GCCGCTCGGC TCGCCGATGC AGGGCGACGA TCCGCTGGCG
CAGCGCACCA AGACCGCGGC GGCGTCGCAG CCGCTCGAGA ATCTGGTGGC GCAGGTCGAG
GCGCATCTCG AGAAGAACCC GACCGACGGC CGCGGCTGGA CCGTGCTGGC GCCGGTGCTC
TTTAAGCTCA GCCGCTTCGA GGACGCCGCG CGCGCCTACC GCAATTCGAT CAACTACGCC
GGCGAGACAG CGGATCGGCG CGCCGATCTC GGCGAGGTGC TGTCGATGGC GGCGGGCGGC
GTGGTCACCG CCGAGGCCAA GTCGGAATTC GAGCGCGCGG TGAAGCTGAA CGCCGACGAC
CCGAAGGCGC GCTACTTCCT CGGCCTCGCC GCCGAGCAGG ACGGCCGTCC GAAGGATGCC
GCGGCGATCT GGCGGGCGAT GCTGGAGAAG GCGCCGGCCG AAGCGCCGTG GAAGCCGATG
CTGCAGGCGC AGCTCGCGCG CCTCGAGGGA ACGCCGTTGC CGGCGTTGCC GGACGACACG
ATCGCATCGG CGCAAGGCAT GAGCGAGGGC GACCGCAACG CGATGATCCG CGGCATGGTC
GACAAGCTCG CCGCCCGGTT GCAGCAGAAC GGCGACGACG TCGAGGGATG GCTGCGGCTG
GTGCGCGCCT ACATGGTGCT CGGCGACGCC GACAAGGCCA GGAGCGCGCA GGCGCAAGCG
CGGCAGGCGG TGGCCGGCAA CGCCGAGCGG CTCAAGCAGC TCAATGACGG GCTCAAGAAC
CTCGGGCTTG ACGGATGA
 
Protein sequence
MNAPQSRVGA LSPGTPNLMA LWFVFAIMTA IAAFAVLWPL GRGVRTLREG TEANVYKDQL 
AEVDRDAEAG LIGAPEASAA RVEIGRRLLA SAEVERPALF TTSRGLRRGV AVFALIGLPA
LALAVYLPLG SPMQGDDPLA QRTKTAAASQ PLENLVAQVE AHLEKNPTDG RGWTVLAPVL
FKLSRFEDAA RAYRNSINYA GETADRRADL GEVLSMAAGG VVTAEAKSEF ERAVKLNADD
PKARYFLGLA AEQDGRPKDA AAIWRAMLEK APAEAPWKPM LQAQLARLEG TPLPALPDDT
IASAQGMSEG DRNAMIRGMV DKLAARLQQN GDDVEGWLRL VRAYMVLGDA DKARSAQAQA
RQAVAGNAER LKQLNDGLKN LGLDG