Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3637 |
Symbol | |
ID | 6411313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3897476 |
End bp | 3898867 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713517 |
Product | protease Do |
Protein accession | YP_001992612 |
Protein GI | 192292007 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.25969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCGA TCCGCACGTT CGCCGTCCTC TGTGTGTCAC TCGCCCTCAC CACGCCGCTC GCGGCGCAGG ACCGGCGGGT GCCGTCGTCG CCGGCGGAGC TGAAGCTGTC CTACGCGCCG ATTGTGCAGC ATGTGCAGCC GGCGGTCGTG AACGTCTATG CCGCCAAGGT GGTGCAGAAC CGAAATCCGC TGCTGGAAGA TCCGATCTTC CGCCGGTTCT TCGGCGTGCC CGGCCAGCCG GAGCAGATCC AGCGCTCGCT CGGCTCGGGT GTGATGGTCG ATGCCTCGGG CCTCGTCGTC ACCAACAATC ACGTCATCGA GGGCGCCGAT CAGGTCAAGG TCGCGCTCGC CGACAAGCGC GAGTTCGAAG CCGAGATCGT GCTGAAGGAC AGCCGCACCG ATCTGGCGGT GCTGCGGCTC AAGGACACCA GCGAGAAATT CCCCACGCTC GACTTCGCCA ACTCCGACGA CCTGCTGGTC GGCGACGTGG TGCTGGCGAT CGGCAATCCG TTCGGCGTCG GTCAGACGGT GACGCATGGC ATCGTCTCGG CGCTGGCGCG CACTCAGGTC GGCATTACCG ACTATCAGTT CTTCATTCAG ACCGACGCCG CGATCAACCC GGGCAATTCC GGCGGCGCGC TGGTCGATGT CTCGGGCAAG CTGGTTGGTA TCAACACCGC GATCTTCTCG CGCTCGGGCG GCTCGCAGGG GATCGGCTTC GCGATCCCCG CCAACATGGT GCGCGTCGTG GTCGCCTCGG CCAAGAGCGG CGGCAAGGCC GTGAAGCGGC CGTGGCTCGG CGCGCGGCTG CAGGCGGTGA GCCCGGAGAT CGCCGAGACC CTGGGGCTGA AGCGGCCGGG CGGTGCGCTG GTCGCCAGCG TTACCAAGGG CAGCCCGGCG GAGCGGGCAG GGCTGAAATT GTCCGACCTG ATCGTGTCGA TCGACGGCTT TGCGATCGAT GATCCCAACG CGTTCGATTA TCGGTTTGCG ACGCGTCCGC TTGGCGGTGC CGCGCAGCTC GAAGTGCAGC GCAGCGGCAA GGCGGTGAAG CTGTCGATCC CGCTCGAAAC CGCACCGGAC TCCGGCCGCG ACGAGCTGGT GATCACCTCG CGCTCGCCGT TCCAGGGTGC GAAGATCGCC AATATTTCCC CGGCGATCGC CGACGAAATG CGGCTCGATC CGAGCGTCGA AGGCGTCGTG GTCACCGATC TTCCCGACGA CAGCACTGCG GCGAATGTCG GCTTCCAGAA GGGCGACATC ATCGTCGCCG TCAACAACAC CCGGATCGGC AAGACCAGCG ACCTCGAACG CGTAGCCGGC CAAACGGCGC GGCTGTGGCG CATCATGCTG GTCCGCGGCG GCCAGCAGAT CCAAGTCACC TTGGGCGGGT AG
|
Protein sequence | MNPIRTFAVL CVSLALTTPL AAQDRRVPSS PAELKLSYAP IVQHVQPAVV NVYAAKVVQN RNPLLEDPIF RRFFGVPGQP EQIQRSLGSG VMVDASGLVV TNNHVIEGAD QVKVALADKR EFEAEIVLKD SRTDLAVLRL KDTSEKFPTL DFANSDDLLV GDVVLAIGNP FGVGQTVTHG IVSALARTQV GITDYQFFIQ TDAAINPGNS GGALVDVSGK LVGINTAIFS RSGGSQGIGF AIPANMVRVV VASAKSGGKA VKRPWLGARL QAVSPEIAET LGLKRPGGAL VASVTKGSPA ERAGLKLSDL IVSIDGFAID DPNAFDYRFA TRPLGGAAQL EVQRSGKAVK LSIPLETAPD SGRDELVITS RSPFQGAKIA NISPAIADEM RLDPSVEGVV VTDLPDDSTA ANVGFQKGDI IVAVNNTRIG KTSDLERVAG QTARLWRIML VRGGQQIQVT LGG
|
| |