Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2011 |
Symbol | |
ID | 4022493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2249476 |
End bp | 2251056 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962204 |
Product | peptidase S1C, Do |
Protein accession | YP_569147 |
Protein GI | 91976488 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.963752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC GAAATACCGA TCTTCCCTCG CAGCCGTCCC GCCAGCCGGA TCGCCGATCG CTGCTGTCAG CGCGCAAATT CGCGCTGATG GCCTCGGTGG TCGCCGGTCT CGGCGCGGGC GCGTTCGGCC TGTCGCAGAC CTCGACGCCG GTCGATCTGT TCAGCACGCC GGCGCATGCG CAGGTCGGCA ACAAGGTCAA CCAGGCTCAG CAGCCGGTGG GCTTCGCCGA TATCGTCGAG AAGGTGAAGC CGTCGGTGAT CTCGGTGAAG GTCAACATCG CCGAGAAGAC GGCGAAGAAC GAGGATCGCG CCGAAGACTC GCCGTTTCAG CCGGGCTCCC CGATGGAGCG CTTCTTCCGC CGCTTCGGCG GTGAGATGCC CCCCGGTATG CGCGGCCATC GCGGCGGCGG CACGATGACC GGGCAGGGCT CGGGCTTCTT CATCTCGGCT GACGGCTACG CCGTGACCAA CAATCACGTC GTCGACGGCG CCGACAAGGT CGAGGTCACC ACCGACGACG GCAAGACCTA CAAGGCCAAG GTCATCGGCA CCGACCAGCG TACCGATCTG GCGCTGATCA AGGCCGAGGG CCGCACCGAC TTCCCGTTCG CCAAGCTGTC CGAGGGCAAG CCGCGGATCG GTGATTGGGT GCTCGCGGTC GGCAATCCGT TCGGCCTCGG CGGCACCGTC ACCGCCGGCA TCGTCTCGGC CTCCGGCCGC GACATCGGCA ATGGTCCGTA CGACGATTTC ATCCAGATCG ACGCGCCGGT GAACAAGGGC AATTCCGGTG GCCCGGCGTT CGACACCAAC GGCGAAGTGA TGGGCGTCAA CACCGCGATC TACTCGCCGT CCGGCGGCAG CGTCGGCATC GCGTTCTCGA TCCCCGCCAG CACCGTCAAG ACGGTGGTGC AGCAGCTCAA GGACAAGGGT TCGGTGAGCC GCGGCTGGAT TGGCGTGCAG ATCCAGCCGG TCACGCCGGA GATCGCCGAC AGCCTCGGGC TGAAGAAGCC CGACGGCGCG CTGGTCGCCG AGCCGCAGCC CAATGGTCCG GCGGCGAAGG CGGGCATCGA ATCCGGCGAC GTCATCACCG CGGTCAACGG CACGCCGGTG AAGGACGCGC GCGAACTCGC CCGAACCATC GGCGGCTTCG CGCCGGGCAA TACGGTGAAG CTCACCGTGG TGCACAAGGG CGCGGACCGC GAGCTCAACC TGACGCTCGG CCAATTGCCG AACCAGGTCG AGGCCAAGGT CAATGATGGT GGCGACAACG GCAACAGTTC CAGCCGAGGC ACAGAGGTGC CGAGGCTCGG CCTGACGGTC GCGCCGGCCA GCTCGGTCGC AGGCGCCGGC AAGGACGGCG TCGTCGTCAC CGACGTCGAT CCCAAGAGCG CCGCAGCCGA CCGCGGCTTC AAGGAAGGCG ATGTGATTCT CGAGGTCGCG GGCAAGAACG TGGCCAGCCC CGGCGATGTT CGCGAGGCGA TCAACACCGC CAAGGCCGAC AACAAGAACA GCGTGCTGAT CCGGGTTCGC TCGGGTGGTT CGTCGCGTTT CGTCGCGGTG CCGATCTCCG CCAAGGGCTG A
|
Protein sequence | MTDRNTDLPS QPSRQPDRRS LLSARKFALM ASVVAGLGAG AFGLSQTSTP VDLFSTPAHA QVGNKVNQAQ QPVGFADIVE KVKPSVISVK VNIAEKTAKN EDRAEDSPFQ PGSPMERFFR RFGGEMPPGM RGHRGGGTMT GQGSGFFISA DGYAVTNNHV VDGADKVEVT TDDGKTYKAK VIGTDQRTDL ALIKAEGRTD FPFAKLSEGK PRIGDWVLAV GNPFGLGGTV TAGIVSASGR DIGNGPYDDF IQIDAPVNKG NSGGPAFDTN GEVMGVNTAI YSPSGGSVGI AFSIPASTVK TVVQQLKDKG SVSRGWIGVQ IQPVTPEIAD SLGLKKPDGA LVAEPQPNGP AAKAGIESGD VITAVNGTPV KDARELARTI GGFAPGNTVK LTVVHKGADR ELNLTLGQLP NQVEAKVNDG GDNGNSSSRG TEVPRLGLTV APASSVAGAG KDGVVVTDVD PKSAAADRGF KEGDVILEVA GKNVASPGDV REAINTAKAD NKNSVLIRVR SGGSSRFVAV PISAKG
|
| |