Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3974 |
Symbol | |
ID | 3969397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4425481 |
End bp | 4427361 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637927078 |
Product | peptidase M3B, oligoendopeptidase-like clade 3 |
Protein accession | YP_533819 |
Protein GI | 90425449 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.221188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.896374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAG CAGCGACCTC CCGAGCCAGC AAATCCGTCC GCAAACCGGC CCCGCGCACC GGCGCCGCCA AGAAGACCCC GCCGAGCCGC AAGCCGCAAG CCAAGGCCGC GCAACTGCCG GAATGGGATT TGGCCGATCT CTATACCAGC ATCGATTCGC CGGAGGTGGC GCGCGATCTC GACAAGATCG ACGCCGACTG CATCGCCTTC GAGCACGACT ACAAGGGCAA GCTCGCCGAG GAGACCGCCA AGGGCGGCGG CGGCGTCTGG CTCGCCGAGG CGGTGGCGCG CTACGAGGCG ATCGACGATC TCGCCGGACG GCTGGCCTCC TATGCGGGGC TCATTCATGC CGGCGACAGC GTCGATCCGA AGCTGTCGAA ATTCTACGGC GACGTCTCCG AGCGGCTGAC CGCGGCCTCG GTGCATCTTT TGTTCTTCGC GCTCGAACTC AACCGGGTCG ACGACGCGGT GATGGAGATG GCGATGCAGG CCCCCGAGCT CGGGCATTAC CGGCCGTGGA TCGAGGACAG CCGCAAGGAC AAGCCGTATC AGCTCGAGGA CCGCATCGAG CAATTGTTTC ACGAAAAGTC GCAGACCGGC TACGGCGCCT TCAACCGGTT GTTCGACCAG ACCATTTCGG CGCTGCGCTT CAAGCTCGGG GCGAAGGAAT TGGCGATCGA GCCGACGCTG ACGCTGCTGC AGGACCGCGA TCCGCAAAAG CGCAAGGCGG CGGGGCAGGC GCTGGCCAAG ACCTTCAAGG CCAATGAGCG CACCTTCGCG TTGATCACCA ATACGCTGGC CAAGGACAAG GAAATCTCCG ACCGCTGGCG CGGTTTCGAG GACGTCGCGG ATTCCCGGCA TCTGGCCAAC CGGGTCGAGC GCGAGGTGGT CGACGCGCTG GTCGCCTCGG TGCGCGCCGC CTATCCGAAG CTGTCGCATC GCTATTATGC GTTGAAGGCG CGCTGGTTCG GCAAGAAGCA ATTGGCGCAT TGGGACCGCA ACGCGCCGCT GCCGTTCGCC GCCACCGGCA CGATCGGCTG GCCGGAAGCC AAGGATATGG TGCTGACCGC CTACACGGCG TTCTCGCCGG AGATGGCGAA GATCGCCGAG CGCTTCTTCA CCGATCGCTG GATCGACGCG CCGGTGCGGC CCGGCAAGGC GCCGGGCGCG TTCTCGCATC CGACCACGCC CTCGGCGCAT CCTTATGTGC TGATGAACTA TCAGGGCAAG CCGCGCGACG TGATGACGCT CGCCCATGAA CTCGGCCACG GCGTGCACCA GGTGCTGGCG GCGAAGAACG GCGCCTTGAT GGCGCCGACG CCGCTGACGC TGGCGGAGAC CGCGAGCGTG TTCGGCGAGA TGCTGACCTT CAAGCGGCTG CTCGGCCAGA CCAAGAGCTT AAAGCAGCGT CAGGCGCTGC TCGCCGGCAA GGTCGAGGAC ATGATCAACA CCGTGGTGCG GCAGATCGCG TTCTATTCGT TCGAGCGCGC GATCCACACC GAGCGCCGCA ACGGCGAATT GACCGCGCAG CGGATCGGCG AGATCTGGCT CAGCGTGCAG GGCGAGAGCC TTGGCCCCGC GATCGACATC AAGCCGGGCT ACGAGAACTT CTGGATGTAC ATCCCGCACT TCATCCATTC GCCGTTCTAC GTCTACGCCT ATGCGTTCGG GGATTGCCTG GTGAACTCGC TCTACGCGGT TTACGAGAAC GCCCAGGAAG GCTTTGCCGA ACGCTATCTG GCGATGCTGT CGGCCGGCGG CACCAAGCAT TACTCCGAAC TGCTGCAGCC GTTCGGGCTC GACGCCCGCG ATCCGACCTT CTGGGACGGC GGGCTGTCGG TGATCGCCGG GATGATCGAT GAATTGGAGG CAATGGGGTA G
|
Protein sequence | MAKAATSRAS KSVRKPAPRT GAAKKTPPSR KPQAKAAQLP EWDLADLYTS IDSPEVARDL DKIDADCIAF EHDYKGKLAE ETAKGGGGVW LAEAVARYEA IDDLAGRLAS YAGLIHAGDS VDPKLSKFYG DVSERLTAAS VHLLFFALEL NRVDDAVMEM AMQAPELGHY RPWIEDSRKD KPYQLEDRIE QLFHEKSQTG YGAFNRLFDQ TISALRFKLG AKELAIEPTL TLLQDRDPQK RKAAGQALAK TFKANERTFA LITNTLAKDK EISDRWRGFE DVADSRHLAN RVEREVVDAL VASVRAAYPK LSHRYYALKA RWFGKKQLAH WDRNAPLPFA ATGTIGWPEA KDMVLTAYTA FSPEMAKIAE RFFTDRWIDA PVRPGKAPGA FSHPTTPSAH PYVLMNYQGK PRDVMTLAHE LGHGVHQVLA AKNGALMAPT PLTLAETASV FGEMLTFKRL LGQTKSLKQR QALLAGKVED MINTVVRQIA FYSFERAIHT ERRNGELTAQ RIGEIWLSVQ GESLGPAIDI KPGYENFWMY IPHFIHSPFY VYAYAFGDCL VNSLYAVYEN AQEGFAERYL AMLSAGGTKH YSELLQPFGL DARDPTFWDG GLSVIAGMID ELEAMG
|
| |