Gene RPD_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1303 
Symbol 
ID4021780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1467464 
End bp1469533 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content62% 
IMG OID637961496 
Productcytochrome c1 
Protein accessionYP_568442 
Protein GI91975783 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex
[COG2857] Cytochrome c1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC CCTCGACCTA TCAGCCACAA AGTCCTTGGA TGAAGTGGCT TGAACAGCGC 
CTGCCCATCG GGAACTTTGT TCACTCCTCG TTCATTGCGT GGCCGACGCC GCGGAACCTG
AACTACTGGT GGACGTTCGG CGCCATTCTT TCGATGATGC TCGCGCTGCA GATCATCACC
GGCATTGTGT TGGCGATGCA CTACACGCCG CACGTCGACA TGGCCTTCGA CTCGATCGAG
CGCATCGTTC GCGACGTCAA CTACGGCTGG CTGCTGCGCA ACATGCACGC CGCCGGCGCG
TCGATGTTCT TCATCGCGGT CTACATTCAC ATGTTCCGCG GCCTGTATTA CGGGTCGTAC
AAGGCGCCGC GCGAAGTGTT GTGGATCCTC GGCGTGATCA TCTACCTGCT GATGATGGCG
ACCGGTTTCA TGGGCTACGT GCTGCCGTGG GGCCAGATGA GCTTCTGGGG CGCCACCGTG
ATCACCAACC TGTTCTCGGC GATCCCGTTC GTCGGCGACA GCATCGTGAC GCTGCTGTGG
GGCGGCTATT CGGTCGGCAA CCCGACGCTG AATCGCTTCT TCTCGCTGCA CTACCTGCTG
CCCTTCGTGA TCGTCGGCGT CGTCGTGCTG CACATCTGGG CGGTGCACGT CACCGGCCAG
AACAACCCGG CCGGCGTCGA ACCGAAGACC GAGAAGGACA CCGTCCCGTT CACGCCGTAC
GCGACGTTGA AGGACGTGTT CGGCATGTCC TGCTTCCTGA TCTTCTTCTC GTGGTTCATC
TTCTACATGC CGAACTATCT CGGCGAAGCC GACAACTACA TTCCGGCGAA CCCGGGTGTG
ACCCCGCCGC ACATCGTGCC TGAATGGTAC TACCTGCCGT TCTACGCGAT CCTGCGCTCG
ATCCCGGACA AGCTGATGGG CGTCGTTGCG ATGTTCGGCG CGATCATCGT CCTGCTGTTC
CTGCCCTGGC TCGACAGCTC GAAGGTGCGT TCGTCCCGCT ATCGTCCGCT GGCGAAGCGG
TTCTTCTGGG GCTTCGTGGT CTGCTGCCTG GTTCTGGGGT GGCTCGGCTC CAAGCCGGCG
GAAGGCATCT ACACTATGCT TGCCCGCGTC TTCACCTTCT TCTACTTCGC CTACTTCCTG
ATCGTGCTGC CGATGCTGTC ACGGGTCGAG AAGACCCTGC CGCTTCCGAA CTCGATCTCT
GAAGACGTGC TGAACAAGGG CAAGGTGATT GGCACGACGG CGGCGAGCCT GTTCGCCGTG
GTGTTGGCCG GATCCATGAT GTTCGGCGGC GTGCAGAGCG CCAAGGCGGC GGAAGGCGGC
GAGAGCCCGC CCTCGCTGAG CTGGAGTTTT GCTGGCCCGT TCGGCAAGTA CGATCGCGCC
CAATTGCAGC GTGGCTTCAA GGTCTACAAG GAAGTCTGCT CGGCCTGTCA CTCGCTGAAG
TTGCTGCAAT ATCGCAACCT CGCCGAGCCG GGCGGCCCCG GCTTCACCTT GGATCAGGCC
AAAGCGATCG CCGCCGAAGC CTCGATCAAG GACGGCCCCA ACGACGCCGG CGAAATGTTC
GAGCGCCCCG GCCGGCTCGC CGACACCTTC CACTCGCCGT TCCCGAACGA GCAGGCGGCG
CGCGCGGCCA ATGGCGGCGC GGTTCCTCCG GACATGTCGC TGCTCGCCAA GGCTCGCTCC
TATCCGCGTG GCTTCCCGCA GTTCGCGTTC GACCTGTTCA CCCAGTTCCA GGAGCAGGGC
CCGAACTACA TCGACGCGCT GCTGCAGGGC TATCTGGATA CGCCGCCGGA AGGGTTCACG
TTGCCGGACG GGTCGTACTA CAACAAGTGG TTCCCCGGCC ATTCGATCAA GATGCCGCCG
CCGATTTCGG ACGGGCAGGT GACCTATGAC GACGGCAGCC CGCAGACGGT TCAGCAATAC
GCCAAGGACA TCACGTCGTT CCTGATGTGG GCCGCCGAGC CGCATCTCGA AGCCCGCAAG
CGCCTTGGTC TGCAGGTCAT GATCTTCCTG ATCATCCTCA GCGGCCTGCT GTACTTCACC
AAGCGCAAGG TCTGGTCGAA CGCGCACTGA
 
Protein sequence
MSGPSTYQPQ SPWMKWLEQR LPIGNFVHSS FIAWPTPRNL NYWWTFGAIL SMMLALQIIT 
GIVLAMHYTP HVDMAFDSIE RIVRDVNYGW LLRNMHAAGA SMFFIAVYIH MFRGLYYGSY
KAPREVLWIL GVIIYLLMMA TGFMGYVLPW GQMSFWGATV ITNLFSAIPF VGDSIVTLLW
GGYSVGNPTL NRFFSLHYLL PFVIVGVVVL HIWAVHVTGQ NNPAGVEPKT EKDTVPFTPY
ATLKDVFGMS CFLIFFSWFI FYMPNYLGEA DNYIPANPGV TPPHIVPEWY YLPFYAILRS
IPDKLMGVVA MFGAIIVLLF LPWLDSSKVR SSRYRPLAKR FFWGFVVCCL VLGWLGSKPA
EGIYTMLARV FTFFYFAYFL IVLPMLSRVE KTLPLPNSIS EDVLNKGKVI GTTAASLFAV
VLAGSMMFGG VQSAKAAEGG ESPPSLSWSF AGPFGKYDRA QLQRGFKVYK EVCSACHSLK
LLQYRNLAEP GGPGFTLDQA KAIAAEASIK DGPNDAGEMF ERPGRLADTF HSPFPNEQAA
RAANGGAVPP DMSLLAKARS YPRGFPQFAF DLFTQFQEQG PNYIDALLQG YLDTPPEGFT
LPDGSYYNKW FPGHSIKMPP PISDGQVTYD DGSPQTVQQY AKDITSFLMW AAEPHLEARK
RLGLQVMIFL IILSGLLYFT KRKVWSNAH