Gene RPB_3448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3448 
Symbol 
ID3911250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3953431 
End bp3955413 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content68% 
IMG OID637885351 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_487055 
Protein GI86750559 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0456114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG AGGCCGGCCA CTACGCGCTG GTGCTGGCGC TGGCGCTGGC GCTGATCCAG 
TCGACGGTGC CGATCCTCGG CGCGCGGCTG GGCGACGGCG GGCTGATGAA CGTCGCGCGC
TCGGCGGCGC TGGCGCAATT CGTCTTCGTC GCCGTCTCGT TCGGCGCGCT GGTCTGGCTC
AACGTCACCT CCGACTTCTC GGTCGTCAAC GTCTACGAGA ATTCGCATTC GGCCAAGCCG
CTGCTCTACA AGATCACCGG CGTGTGGGGG AACCACGAAG GTTCGATGCT GCTGTGGGTG
GCGATCCTGG CGCTGTTCGG CGGCATGGTC GCAGCGTTCG GCAACAACCT GCCGCTGTCG
CTGCGCGCCC ACGTGCTAGC GGTGCAGGCC TGGGTGGCGA GCGCGTTCTA TCTGTTCATC
CTGATCACCT CGAACCCGTT CCTGCGGATT CCCAATCCGC CGCTCGAGGG TCGCGACCTC
AATCCGGTAC TGCAGGACAT CGGCCTCGCG GTGCATCCGC CGCTGCTGTA TCTCGGCTAT
GTCGGCTTCT CGATCTCGTT CTCCTTCGCC GCCGCGGCGC TGATCGAGGG GCGGCTCGAC
GCGGCCTGGG CGCGCTGGGT GCGGCCGTGG ACGCTGATGG CGTGGATCTT CCTGACGCTG
GGCATCGCGA TGGGCTCGTA CTGGGCCTAT TACGAACTCG GCTGGGGCGG CTGGTGGTTC
TGGGACCCGG TCGAGAACGC CTCGCTGATG CCGTGGCTCG CCGGCACCGC GCTGCTGCAT
TCGGCGCTGG TGATGGAGAA GCGCAACGCG CTGAAGGTCT GGACCATCCT GCTGTCGATC
CTGACCTTCT CGCTGTCGCT GCTCGGCACC TTCCTGGTGC GCTCAGGCGT CTTGACCTCG
GTGCACACCT TCGCCACCGA TCCGTCGCGC GGCGTGTTCA TCCTGATGAT CCTGTGCATC
TTCATCGGCG GCAGCCTGGC GCTGTTTGCC TGGCGCGCCT CGGCGCTGAA GCAGGGCGGG
CTGTTCGCGC CGATCTCGCG CGAGGGCGCG CTGGTGCTGA ACAATCTGTT TCTCACCACC
GCCTGCGCCA CGGTGTTCGT CGGCACGCTG TATCCGCTGG CGCTGGAAGT TCTCACCGGC
GACAAGATCT CGGTCGGCGC GCCGTTCTTC AATCTCACCT TCGGTCCGCT GATGGTGCCG
CTGATGCTGG CGATGCCGTT CGGCCCGCTG CTGGCGTGGA AGCGCGGCGA TCTGCTCGGT
GCCGCCCAGC GGCTGATCGC CGCCGGCGTC GTCGCGCTGC TCGCGGTCGC CCTGGTCTGG
GCCTGGACGT TCGGCGGCCC GGTGCTGGCG CCGCTGGCGA TCGGGCTCGC GGTGTTCGTG
ATTGCCGGCG CGCTCGCCGA CATCGTCGAG CGCATCGGCC TGCTGCGCAA TCCGTTGTCG
ATTGCGGCGC GCCGCGCCCG AGGGCTGCCG CGCTCGGCGT GGGGCACGCT GTTCGCTCAT
GCCGGCATCG GCGTGGCGCT GATCGGCATC GTCTGCGAGA CCACCTGGAA CAGCGAGCAC
ATCGCCGCGA TGAAGGAGGG CGAGTCCGCC AAGCTCGCCG GCTACGAACT CAAATTCGAC
GGCGCGATTC AGCGCCAGGG CCCGAACTAT CGCGAGTTGC AGACGCATTT CTCGGTCAGC
GAGAACGGCC GGCCGATCGG CGCGATGACG CCGTCGAAGC GCAGCTTCAC CACCCGCAAC
ACGTCGACCA CCGAGGCTGC GCTGCTGACC CGCGGCGTCA GCCAGCTCTA CATCTCGCTC
GGCGACATCG ACGCCGCCGG CGCGGTGACG GTGCGGATCT ATCACAAGCC CTTGGTGCTG
CTGATCTGGA TCGGTCCGCT GCTGATGGCG TTCGGCGGCC TGTTGTCACT GTCGGATCGG
CGGCTGCGCG TCGGCGCGCC GAAGCCGGCC AAGCCGCAGC GCTTGCTGCA GCCGGCGGAG
TAA
 
Protein sequence
MIAEAGHYAL VLALALALIQ STVPILGARL GDGGLMNVAR SAALAQFVFV AVSFGALVWL 
NVTSDFSVVN VYENSHSAKP LLYKITGVWG NHEGSMLLWV AILALFGGMV AAFGNNLPLS
LRAHVLAVQA WVASAFYLFI LITSNPFLRI PNPPLEGRDL NPVLQDIGLA VHPPLLYLGY
VGFSISFSFA AAALIEGRLD AAWARWVRPW TLMAWIFLTL GIAMGSYWAY YELGWGGWWF
WDPVENASLM PWLAGTALLH SALVMEKRNA LKVWTILLSI LTFSLSLLGT FLVRSGVLTS
VHTFATDPSR GVFILMILCI FIGGSLALFA WRASALKQGG LFAPISREGA LVLNNLFLTT
ACATVFVGTL YPLALEVLTG DKISVGAPFF NLTFGPLMVP LMLAMPFGPL LAWKRGDLLG
AAQRLIAAGV VALLAVALVW AWTFGGPVLA PLAIGLAVFV IAGALADIVE RIGLLRNPLS
IAARRARGLP RSAWGTLFAH AGIGVALIGI VCETTWNSEH IAAMKEGESA KLAGYELKFD
GAIQRQGPNY RELQTHFSVS ENGRPIGAMT PSKRSFTTRN TSTTEAALLT RGVSQLYISL
GDIDAAGAVT VRIYHKPLVL LIWIGPLLMA FGGLLSLSDR RLRVGAPKPA KPQRLLQPAE