Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1117 |
Symbol | |
ID | 4021593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1271529 |
End bp | 1272494 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637961309 |
Product | PDZ/DHR/GLGF |
Protein accession | YP_568256 |
Protein GI | 91975597 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0129707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCCT TGCCCGAATG GAATGTGCCG GCCGCGATCC GGCCGCGTGC CGCTGACTTT CCGTTCGATC TCGAACGCGC GTTGTCGTCG GTGATCGGGT TGCATTCGAT CATTCCGTCG GACGCCTTCA CGGCGAACAC GCTCGGCACC GAGCGCGCCG GCAATTGCGT GCTGATCGAC GACGGCCTGC TGCTGACCAT CGGCTATCTG ATCACCGAGG CGGAGACGGT CTGGCTTCAT CTCGGCGACG GGCGGGTGGT CGAGGGCCAT GCGCTCGGCT TCGATGCGGA GAGCGGGTTC GGTCTCGTGC AGGCGCTCGG CCCGATCGAT CTGCCGCCGC TGGCGCTCGG CAATTCCGGT GCGGCCAAAG CCGGCGATCG CGTGGTGATC GCCGGCGCCG GCGGACGAAC GCGATCGGTC GCGGGTCGGA TCGCCACAAG GCAGGAATTC GCCGGCTATT GGGAATATCT GCTCGACGAC GCGATCTTCA CCGAACCGTC GCACCCGAAT TGGGGCGGCG CCGGGCTGAT TTCGGCGACG GGCGAACTCA TCGGCATCGG CTCGCTGCAG ATCGAGCGCA GCGGCACCGA CGCGCATTAC AACATGATCG TGCCGATCGA TCTGTTGAAG CCGGCGCTCT CCGATCTGCG CAAATTCGGT CGCGTCGACA AGCCGCCGCG GCCGTGGCTC GGCCTGTATT CGACCGAGAT CGAGGGCAAG ATCGTCGTGG TCGGAATCGC GCCGAAGGGC CCGGCCGCCC GCGCCGAACT GAAGTCCGGC GACGTCATCC TCGCGGTCGC CGGCGAGAAG GTCAGCAGCG AGGGCGAGTT CTATCGCAAG ATCTGGGCGC TCGGCACCGC GGGCGTCGAG GTCCCGCTGA CGCTGTTCAG CGGCGGCGCG ACCTTCGACG TCGTGCTGCA CTCATCCGAC CGCGCCAAAT TCCTCAAGGC CCCGCGGCTG CACTGA
|
Protein sequence | MPSLPEWNVP AAIRPRAADF PFDLERALSS VIGLHSIIPS DAFTANTLGT ERAGNCVLID DGLLLTIGYL ITEAETVWLH LGDGRVVEGH ALGFDAESGF GLVQALGPID LPPLALGNSG AAKAGDRVVI AGAGGRTRSV AGRIATRQEF AGYWEYLLDD AIFTEPSHPN WGGAGLISAT GELIGIGSLQ IERSGTDAHY NMIVPIDLLK PALSDLRKFG RVDKPPRPWL GLYSTEIEGK IVVVGIAPKG PAARAELKSG DVILAVAGEK VSSEGEFYRK IWALGTAGVE VPLTLFSGGA TFDVVLHSSD RAKFLKAPRL H
|
| |