Gene RPD_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2003 
Symbol 
ID4022485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2240201 
End bp2242183 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content67% 
IMG OID637962196 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_569139 
Protein GI91976480 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.151037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG AAGCGGGGCA CTACGCGCTG GTGCTGGCGC TTGCGCTGGC GTTGATCCAG 
TCCACCGTGC CGATCCTCGG CGCGCGGCTG AACGACGGCG GCCTGATGAA TGTCGCGCGC
TCCACGGCGC TCGCACAATT CGCCTTCGTC GCGGCGTCGT TCGGCGCGCT GGTTTATCTC
AACGTCACCT CGGACTTCTC GGTTCTCAAT GTCTACGAGA ACTCGCATTC GGCAAAGCCG
CTGCTCTACA AGATCACCGG CGTGTGGGGA AACCACGAAG GCTCGATGCT GCTGTGGGTC
GCGATCCTGG CGCTGTTCGG CGGTCTGGTC GCTGCATTCG GCAACAATCT GCCGCTTTCG
TTGCGCGCCC ATGTGCTGGC GGTGCAGGCC TGGGTCGCCA GCGCGTTCTA TCTGTTCATC
CTGATCACCT CGAACCCGTT CCTGCGGATT CCCAATCCGC CGATCGAGGG GCGTGACCTC
AATCCGGTAT TGCAGGACAT CGGCCTCGCG GTGCATCCGC CGCTGCTCTA TCTCGGCTAT
GTCGGCTTCT CGATCTCGTT CTCCTTCGCC GCCGCGGCGC TGCTCGAAGG CCGGCTCGAC
GCCGCCTGGG CGCGCTGGGT GCGGCCGTGG ACGCTGATGG CGTGGATCTT CCTGACGCTC
GGCATCGCGA TGGGCTCGTA TTGGGCTTAC TACGAACTCG GCTGGGGCGG CTGGTGGTTC
TGGGACCCGG TCGAGAACGC CTCACTGATG CCATGGCTCG CCGGCACCGC GCTGCTGCAT
TCGGCGCTGG TGATGGAGAA GCGCAACGCG CTGAAGGTCT GGACCATCCT CCTCGCGATC
CTGACGTTCT CGCTGTCGCT GCTCGGCACC TTCCTGGTGC GCTCCGGCGT CTTGACCTCG
GTGCACACCT TCGCCACCGA TCCGTCGCGT GGCGTGTTCA TCCTGGTAAT TCTGTGCGTC
TTCATCGGCG GCAGTCTCGC GCTGTTCGCC TGGCGCGCTT CGGCGTTGAA GCAGGGCGGG
CTGTTCGCAC CGATCTCCCG CGAGGGCGCG CTGGTGCTGA ACAATCTGTT CCTCACCACC
GCCTGCGCCA CGGTGTTCGT CGGTACGTTG TATCCCTTGG CGCTCGAAGT CGTCACCGGT
GACAAGATCT CGGTCGGCGC GCCGTTCTTC AATCTCACCT TCGGGCCGCT GTTCGTGCCG
CTGATGCTGG CGATGCCGTT CGGGCCGCTG CTGGCGTGGA AGCGCGGCGA CATCGTCGGC
GCGGCGCAGC GGCTGATCGC GGCCGGCGTC GTGGCGCTGC TGACGGTCGC CCTGGTCTGG
GCCTGGACCA CCGGCGGCCC GGTGCTCGCG CCGCTGGCGA TCGGGCTCGC GGTGTTCGTG
ATCGCCGGCG CGCTTGCCGA CATCGCCGAA CGCATCGGAC TGTTCCGCAA CCCGCTGTCG
ATCTCCACGC GCCGTGCGCG CGGGCTGCCG CGCTCGGCCT GGGGCACGAT GTTCGCCCAT
GCCGGCGTCG GCATGGCGCT GATCGGCATC GTCTGCGAGA CCACCTGGAA CAGCGAGCAC
ATCGCCGCGA TGAAGGAGGG CGATACAGCG CGGATCGCCG GCTACGAACT CAAATTCGAA
GGCGCGATGC AGCGCCAGGG GCCGAATTTT CGCGAATTGG AGACGCGCTT CTCGATCAGC
GAGGGCGGCC AGCCGGTCGG CGTCATGACG CCGTCGAAGC GCAGCTTCAC CACCCGCGGC
ACCTCGACCA CCGAGGCGGC ACTGCTGACC CGTGGCTTCA GCCAGCTTTA CGTCTCGCTC
GGCGAGATCG ACGCCGCGGG CGCGGTGACG GTGCGGATCT ATCACAAACC GATGGTGCTG
CTGATCTGGT TCGGCCCGCT GCTGATGGCG TTCGGCGGCC TGTTGTCGCT GTCCGACCGT
CGCCTGCGCG TCGGCGCACC GAAACCGGCC AAGCCGCAGC GTACGCTGCA GCCGGCGGAG
TAG
 
Protein sequence
MIAEAGHYAL VLALALALIQ STVPILGARL NDGGLMNVAR STALAQFAFV AASFGALVYL 
NVTSDFSVLN VYENSHSAKP LLYKITGVWG NHEGSMLLWV AILALFGGLV AAFGNNLPLS
LRAHVLAVQA WVASAFYLFI LITSNPFLRI PNPPIEGRDL NPVLQDIGLA VHPPLLYLGY
VGFSISFSFA AAALLEGRLD AAWARWVRPW TLMAWIFLTL GIAMGSYWAY YELGWGGWWF
WDPVENASLM PWLAGTALLH SALVMEKRNA LKVWTILLAI LTFSLSLLGT FLVRSGVLTS
VHTFATDPSR GVFILVILCV FIGGSLALFA WRASALKQGG LFAPISREGA LVLNNLFLTT
ACATVFVGTL YPLALEVVTG DKISVGAPFF NLTFGPLFVP LMLAMPFGPL LAWKRGDIVG
AAQRLIAAGV VALLTVALVW AWTTGGPVLA PLAIGLAVFV IAGALADIAE RIGLFRNPLS
ISTRRARGLP RSAWGTMFAH AGVGMALIGI VCETTWNSEH IAAMKEGDTA RIAGYELKFE
GAMQRQGPNF RELETRFSIS EGGQPVGVMT PSKRSFTTRG TSTTEAALLT RGFSQLYVSL
GEIDAAGAVT VRIYHKPMVL LIWFGPLLMA FGGLLSLSDR RLRVGAPKPA KPQRTLQPAE