Gene RPC_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3974 
Symbol 
ID3969397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4425481 
End bp4427361 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content65% 
IMG OID637927078 
Productpeptidase M3B, oligoendopeptidase-like clade 3 
Protein accessionYP_533819 
Protein GI90425449 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.896374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAAG CAGCGACCTC CCGAGCCAGC AAATCCGTCC GCAAACCGGC CCCGCGCACC 
GGCGCCGCCA AGAAGACCCC GCCGAGCCGC AAGCCGCAAG CCAAGGCCGC GCAACTGCCG
GAATGGGATT TGGCCGATCT CTATACCAGC ATCGATTCGC CGGAGGTGGC GCGCGATCTC
GACAAGATCG ACGCCGACTG CATCGCCTTC GAGCACGACT ACAAGGGCAA GCTCGCCGAG
GAGACCGCCA AGGGCGGCGG CGGCGTCTGG CTCGCCGAGG CGGTGGCGCG CTACGAGGCG
ATCGACGATC TCGCCGGACG GCTGGCCTCC TATGCGGGGC TCATTCATGC CGGCGACAGC
GTCGATCCGA AGCTGTCGAA ATTCTACGGC GACGTCTCCG AGCGGCTGAC CGCGGCCTCG
GTGCATCTTT TGTTCTTCGC GCTCGAACTC AACCGGGTCG ACGACGCGGT GATGGAGATG
GCGATGCAGG CCCCCGAGCT CGGGCATTAC CGGCCGTGGA TCGAGGACAG CCGCAAGGAC
AAGCCGTATC AGCTCGAGGA CCGCATCGAG CAATTGTTTC ACGAAAAGTC GCAGACCGGC
TACGGCGCCT TCAACCGGTT GTTCGACCAG ACCATTTCGG CGCTGCGCTT CAAGCTCGGG
GCGAAGGAAT TGGCGATCGA GCCGACGCTG ACGCTGCTGC AGGACCGCGA TCCGCAAAAG
CGCAAGGCGG CGGGGCAGGC GCTGGCCAAG ACCTTCAAGG CCAATGAGCG CACCTTCGCG
TTGATCACCA ATACGCTGGC CAAGGACAAG GAAATCTCCG ACCGCTGGCG CGGTTTCGAG
GACGTCGCGG ATTCCCGGCA TCTGGCCAAC CGGGTCGAGC GCGAGGTGGT CGACGCGCTG
GTCGCCTCGG TGCGCGCCGC CTATCCGAAG CTGTCGCATC GCTATTATGC GTTGAAGGCG
CGCTGGTTCG GCAAGAAGCA ATTGGCGCAT TGGGACCGCA ACGCGCCGCT GCCGTTCGCC
GCCACCGGCA CGATCGGCTG GCCGGAAGCC AAGGATATGG TGCTGACCGC CTACACGGCG
TTCTCGCCGG AGATGGCGAA GATCGCCGAG CGCTTCTTCA CCGATCGCTG GATCGACGCG
CCGGTGCGGC CCGGCAAGGC GCCGGGCGCG TTCTCGCATC CGACCACGCC CTCGGCGCAT
CCTTATGTGC TGATGAACTA TCAGGGCAAG CCGCGCGACG TGATGACGCT CGCCCATGAA
CTCGGCCACG GCGTGCACCA GGTGCTGGCG GCGAAGAACG GCGCCTTGAT GGCGCCGACG
CCGCTGACGC TGGCGGAGAC CGCGAGCGTG TTCGGCGAGA TGCTGACCTT CAAGCGGCTG
CTCGGCCAGA CCAAGAGCTT AAAGCAGCGT CAGGCGCTGC TCGCCGGCAA GGTCGAGGAC
ATGATCAACA CCGTGGTGCG GCAGATCGCG TTCTATTCGT TCGAGCGCGC GATCCACACC
GAGCGCCGCA ACGGCGAATT GACCGCGCAG CGGATCGGCG AGATCTGGCT CAGCGTGCAG
GGCGAGAGCC TTGGCCCCGC GATCGACATC AAGCCGGGCT ACGAGAACTT CTGGATGTAC
ATCCCGCACT TCATCCATTC GCCGTTCTAC GTCTACGCCT ATGCGTTCGG GGATTGCCTG
GTGAACTCGC TCTACGCGGT TTACGAGAAC GCCCAGGAAG GCTTTGCCGA ACGCTATCTG
GCGATGCTGT CGGCCGGCGG CACCAAGCAT TACTCCGAAC TGCTGCAGCC GTTCGGGCTC
GACGCCCGCG ATCCGACCTT CTGGGACGGC GGGCTGTCGG TGATCGCCGG GATGATCGAT
GAATTGGAGG CAATGGGGTA G
 
Protein sequence
MAKAATSRAS KSVRKPAPRT GAAKKTPPSR KPQAKAAQLP EWDLADLYTS IDSPEVARDL 
DKIDADCIAF EHDYKGKLAE ETAKGGGGVW LAEAVARYEA IDDLAGRLAS YAGLIHAGDS
VDPKLSKFYG DVSERLTAAS VHLLFFALEL NRVDDAVMEM AMQAPELGHY RPWIEDSRKD
KPYQLEDRIE QLFHEKSQTG YGAFNRLFDQ TISALRFKLG AKELAIEPTL TLLQDRDPQK
RKAAGQALAK TFKANERTFA LITNTLAKDK EISDRWRGFE DVADSRHLAN RVEREVVDAL
VASVRAAYPK LSHRYYALKA RWFGKKQLAH WDRNAPLPFA ATGTIGWPEA KDMVLTAYTA
FSPEMAKIAE RFFTDRWIDA PVRPGKAPGA FSHPTTPSAH PYVLMNYQGK PRDVMTLAHE
LGHGVHQVLA AKNGALMAPT PLTLAETASV FGEMLTFKRL LGQTKSLKQR QALLAGKVED
MINTVVRQIA FYSFERAIHT ERRNGELTAQ RIGEIWLSVQ GESLGPAIDI KPGYENFWMY
IPHFIHSPFY VYAYAFGDCL VNSLYAVYEN AQEGFAERYL AMLSAGGTKH YSELLQPFGL
DARDPTFWDG GLSVIAGMID ELEAMG