Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1431 |
Symbol | |
ID | 3908381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1620771 |
End bp | 1622603 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883325 |
Product | peptidase M3B, oligoendopeptidase-like clade 3 |
Protein accession | YP_485052 |
Protein GI | 86748556 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.838587 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCA AGTCGGCGAA GCCCGCGAAA AAGCAGGCGG CCGCGGCGGG CCGGAAGCCG GTCAAGCTGC CGGAATGGAA TCTGGCCGAT CTGTATTCGG CGATCGATGC GCCGCAGGTC GCCGCCGATC TCGACCGGCT CGACGCCGAA TGCGCCGCGT TCGAGGCCGA GTTCAAGGGC AAGCTGGCGG AGCAGGCAGC CGCCGACGGC GGCGGCGCAT GGTTGGCCGG CGCGGTGAAG CGCTACGAGG CGATCGAGGA TCTGGCCGGT CGGCTCGGCT CCTATGCGGG CCTCGTCCAC GCCGGCGACA GCGTCGACCC GGTGAAGTCG AAATTCTACG GCGACGTTTC CGAGCGGCTG ACCGCGGCGT CGGTGCATCT GTTGTTCTTC TCGCTGGAAC TCAATCGCGT CGACGACGCG GTGCTGGAGC GGGCCATGCA GGCACCCGAA CTCGGTCACT ACCGGCCGTG GATCGAGGAT CTGCGCAAGG ACAAGCCGTA TCAGCTCGAC GACCGTGTCG AGCAGCTGTT CCACGAAAAG TCGCAGACCG GCTACGGCGC CTTCAATCGG CTGTTCGACC AGACCATCTC CGGCCTGCGC TTCAAGCTCG GCGGCAAGGA ACTGGCGATC GAGCCGACGC TGAATCTGCT GCAGGACCGT ACACCGGCCA AGCGCAAGGC GGCGGCCGAG GCGCTGGCCA AGACCTTCAA GGCCAACGAG CGCACCTTCG CGCTGATCAC CAATACGCTG GCCAAGGACA AGGAGATTTC CGACCGCTGG CGCGGCTTCG AGGACGTCGC GGATTCGCGG CATCTCGCCA ACCGTGTCGA GCGCGAGGTG GTCGATGCGC TGGTCGCCTC GGTGCGCGCC GCCTATCCGC GGCTGTCGCA TCGCTACTAC AAGCTCAAGG CCGGCTGGTT CGGCAAGAAG AAGCTGCCGC ATTGGGACCG CAACGCGCCG CTGCCGTTCG CGGCCACCGG CAGCATCGCG TGGCCGGAAG CGCGCGAGAT GGTGCTGACC GCCTACAGCG CGTTCTCGCC GGAAATGGCG CGGATCGCCG AGCGGTTCTT CACCGATCGC TGGATCGATG CGCCGGTGCG TCCGGGCAAG GCGCCGGGCG CGTTCTCGCA CCCGACCACG CCGTCGGCGC ATCCTTATGT GCTGATGAAC TACCAGGGCA AGCCGCGCGA CGTGATGACG CTCGCCCATG AACTCGGCCA CGGCGTGCAC CAGGTGCTCG CCGCCGGCAA CGGCGCGCTG ATGGCGCCGA CGCCGCTGAC GCTGGCCGAG ACCGCCAGCG TGTTCGGCGA GATGCTGACC TTCCGCCGGC TGCTGTCGCA GACCAAGAGC GCCAAGCAGC GCCAGGCGTT GCTCGCCGGC AAGGTCGAGG ACATGATCAA CACCGTGGTG CGGCAGATCG CGTTCTACTC GTTCGAGCGC GCGATCCACA CCGAGCGCCG CAGCGGCGAA CTCACCGCGC AGCGGATCGG CGAGATCTGG CTCGGCGTGC AGAGCGAGAG CCTGGGGCCG GCGATCGAGA TCAAGCCGGG CTACGAGAGC TTCTGGATGT ACATCCCGCA CTTCATCCAT TCGCCGTTCT ACGTCTACGC CTACGCGTTC GGCGACTGCC TGGTGAACTC GCTTTACGCG GTCTACGAGC ACGCCCAGGA GGGCTTCGCC GAGCGCTACC TCGCGATGCT CTCGGCCGGC GGCACCAAGC ACTATTCCGA ACTGCTGGCC CCGTTCGGCC TCGACGCCAA GGACCCCAGC TTCTGGGACG GCGGCCTGTC GGTGATCGCC GGCATGATCG ACGAACTGGA GGCGATGGGC TAG
|
Protein sequence | MASKSAKPAK KQAAAAGRKP VKLPEWNLAD LYSAIDAPQV AADLDRLDAE CAAFEAEFKG KLAEQAAADG GGAWLAGAVK RYEAIEDLAG RLGSYAGLVH AGDSVDPVKS KFYGDVSERL TAASVHLLFF SLELNRVDDA VLERAMQAPE LGHYRPWIED LRKDKPYQLD DRVEQLFHEK SQTGYGAFNR LFDQTISGLR FKLGGKELAI EPTLNLLQDR TPAKRKAAAE ALAKTFKANE RTFALITNTL AKDKEISDRW RGFEDVADSR HLANRVEREV VDALVASVRA AYPRLSHRYY KLKAGWFGKK KLPHWDRNAP LPFAATGSIA WPEAREMVLT AYSAFSPEMA RIAERFFTDR WIDAPVRPGK APGAFSHPTT PSAHPYVLMN YQGKPRDVMT LAHELGHGVH QVLAAGNGAL MAPTPLTLAE TASVFGEMLT FRRLLSQTKS AKQRQALLAG KVEDMINTVV RQIAFYSFER AIHTERRSGE LTAQRIGEIW LGVQSESLGP AIEIKPGYES FWMYIPHFIH SPFYVYAYAF GDCLVNSLYA VYEHAQEGFA ERYLAMLSAG GTKHYSELLA PFGLDAKDPS FWDGGLSVIA GMIDELEAMG
|
| |