Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0967 |
Symbol | |
ID | 4021442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1093705 |
End bp | 1094670 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637961158 |
Product | hypothetical protein |
Protein accession | YP_568106 |
Protein GI | 91975447 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAGGG TAGCAGGAGC GGATTTCCAG AAGGAAGTTC TGTTCGCGTT GGGCGGTGGC GATTCGCAGA TCAACATCGC TGTGCTGGAT GGACCGGTTG ACCGGACCCA TGATTGCTTT CGTGGCGCGC GGCTGGTCCC GCTCGACACC GCGGCGGCTG AGGGTTCCGA CGGACGCGCG ACCGCGCAGG GGACGCATAT CGCCAGCCTG ATCTTCGGCC AGCCGTGCAG TTCGGTCGAA GGCGTCGCTC CGCTGTGCCG CGGATTGATC GCACCGATCT TCAGCGACGA TCGGCTCGAC TGTTCTCAGG CGGAACTGGC GCAGGCGATC ACGATCGCGC TCGATCACGG CGCCCATATC ATCCAGATCA GCGGCGGCCT GTTCGGTGGA CCGCGGCAGC CAACCGCGGA ACTGCTTGAT GCAGTCGCAC GGTGCAACGC GGACAACGCG CTGATCGTCG CCGGCGCCGG CCGTGACGGT TGCGGCAGCC TGCTGCGCCG TTGCGGCGCG ACCAATCTTC TCCCGGTCGG CGCAATCGAC CAGCACGGCC GGCTGATCGA CGGCGGCGAC GCCAGCCTGC ACGAGATCGG CATTGCTGTG CCGGGCGCCA ATCTGATCGG CGCAGCGCCG CAAGGCGGCA TCGACAACCG TCGCGGCGCG AACTACGCGG CCGCGCTGAT TGCCGGGATC GCCGGGCTGC TGCTCGGCAC GCAACGCCAG ACCGGACGGA CGCTTGATCC CGGCGCGGCG GTCGAGGCGA TGCTGAGCAC CGCCACGCCG CGCGTGGCGG CGCGGGCGTA TGAATGCCAC CGCGCTTGGA TCGGCCTGGC GAATGTCGAA GCCGCCGCGG TGCGCCTGAA CGACGGGATG TACGACACCG CCACGGGGAC GCCGTTCCAA CTGTGGCATC GCGCCCAGAC GAGCGCAAAA TGGTCGGATC ACGCAGCGTT CCGGCCGCGG GCGTGA
|
Protein sequence | MVRVAGADFQ KEVLFALGGG DSQINIAVLD GPVDRTHDCF RGARLVPLDT AAAEGSDGRA TAQGTHIASL IFGQPCSSVE GVAPLCRGLI APIFSDDRLD CSQAELAQAI TIALDHGAHI IQISGGLFGG PRQPTAELLD AVARCNADNA LIVAGAGRDG CGSLLRRCGA TNLLPVGAID QHGRLIDGGD ASLHEIGIAV PGANLIGAAP QGGIDNRRGA NYAAALIAGI AGLLLGTQRQ TGRTLDPGAA VEAMLSTATP RVAARAYECH RAWIGLANVE AAAVRLNDGM YDTATGTPFQ LWHRAQTSAK WSDHAAFRPR A
|
| |