Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1014 |
Symbol | |
ID | 3909138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1163194 |
End bp | 1164159 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882907 |
Product | PDZ/DHR/GLGF |
Protein accession | YP_484635 |
Protein GI | 86748139 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCCT TGCCCGAATG GAACGTGCCG GCCGCGATCC GGCCGCGCGC TGCGGACTTT CCGTTCGATC TCGATCGCAC GCTGTCGGCG GTGCTCGGCG TGCACGCGAT CATTCCGCCC GACGCCTTCA CCGCGAATAC GCTCGGCACC GAACGCGCCG GCAACGCCGT GCTGATCGAC GACGGCCTGC TGCTGACCAT CGGCTATCTG ATCACCGAAG CAGAGACCGT CTGGCTGCAT CTCGGCGACG GCCGGGCGGT CGAGGGGCAC GCGCTCGGCA TCGATTCCGA CAGCGGCTTC GGCCTGGTGC AGGCGCTCGG CGCGATCGAC CTGCCGCCGC TGCGGCTCGG CCATTCGAGC GCGGCCAAGA CCGGCGATCG CGTGATCGTC GGCGGCGTCG GCGGACGTAT CCGCTCGGTC GCGGGGCGGA TCGCGGCGCG GCAGCCCTTC GCCGGCTATT GGGAATATCT GATCGACGAC GCGATCTTCA CCGAGCCGTC GCACCCGAAC TGGGGTGGGG CCGGGCTGAT CTCCGCGACC GGCGAACTGA TCGGCATCGG CTCGCTGCAG ATCGAGCGCA GCGGCAGCGA CGAGCACTAC AACATGATGG TGCCGATCGA TCTCTTGAAG CCGGTGCTCG GCGATCTGCG CAAATTCGGC CGGGTCGACA GACCGCCGCG GCCGTGGCTC GGGCTGTATT CGACCGAGAT CGAGGACCGG ATCGTCGTGG TCGGGATCGC GCCGAAGGGC CCCGCCGCGC GCGCCGAGCT GAAATCGGGC GATGTCATTC TCGCGGTCGC CGGCGACAAG GTCACGAGCG AGGCGGAATT CTATCGCAAG GTCTGGGCGC TCGGCGCCGC CGGCGTCGAG GTCCCGCTGA CGCTGTTCAG CGGCGGCGCC ACCTTCGACG TCGTGCTGCA TTCGGCGGAC CGCGCCAAAT TCCTCAAGGG ACCGCGGATG CATTGA
|
Protein sequence | MPSLPEWNVP AAIRPRAADF PFDLDRTLSA VLGVHAIIPP DAFTANTLGT ERAGNAVLID DGLLLTIGYL ITEAETVWLH LGDGRAVEGH ALGIDSDSGF GLVQALGAID LPPLRLGHSS AAKTGDRVIV GGVGGRIRSV AGRIAARQPF AGYWEYLIDD AIFTEPSHPN WGGAGLISAT GELIGIGSLQ IERSGSDEHY NMMVPIDLLK PVLGDLRKFG RVDRPPRPWL GLYSTEIEDR IVVVGIAPKG PAARAELKSG DVILAVAGDK VTSEAEFYRK VWALGAAGVE VPLTLFSGGA TFDVVLHSAD RAKFLKGPRM H
|
| |