Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3547 |
Symbol | |
ID | 3911349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4059770 |
End bp | 4061440 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885449 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_487153 |
Protein GI | 86750657 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.777163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC GAGCATGGCA AAGGCCGGTA TGGATTGCGG CGGTGGCGGC GGCGACGCTG GTGATGACGG CGGGGGTCGC ATTTTCCCAA TCGCTGGCGC CTCCACCTCT GACCAACCTT CCGCCAGTCA CCACGGCGCC GACCATCACG CCGTCGGTCG GCCCGACCAT TCCCGCGCTA CGCCCGCCAG GGCCGGACGT GCTGCCGTCG GTCGAGCGTG AGTATCGGCG GCTGCGCGAT GCGCCGCTGC CGCCCTCGCG CTGCAACTAT CTCGACAATG GCCGAACCTG CGCCAAGCTG TGGGTGATCG GCGATTGGCG GTCCGTGACC AAGCACAAGC GCGGCAAGGC AAAGGCCGAT CGCGTCCGCC GGGCTCCGCC GCAAATCGTC GCCGCGCGCG GGCAGCGCCC GGCGGTCGCC GCTCGCGACC ATGTTCCCGG CGAAGTGCTG ATCGAATTCG ACGGCGGTCT GTCCGAGCAG CAACTCCGTG CGCTGGCGCG ACGGCACCGG CTCACGCGCG TCAGCGTTGA CACCGTCGCA CTGGTCGGGA CCCGCATCGG ACTGTTTCGG ATCAACGATC GCCGCTCGCC GCAAACGGTC GCACGCGCAC TCGCCGCTGA CGGTCGGATA CGCGCCGCGC AGGCCAATTT CGTCTACACG CTGCAGAGCG ATGCCAAGCC CGCTGCCGCC GACAATTCGA TGCTGTATCC GCACGGCCGG CTGCGGCTGG CCGAGGCGCA TGCGCTGGCG CGCGGACGCG GCATTGTCGT TGCGGTGATC GATTCCGGCG TCGACATCGC CCATCCGGAA CTCGTCGGTG CGCTCGCCGG TTCGTTCGAC CCGCTCGGCA GCAAGGAGAA GCCGCATCAG CACGGCACCG GCGTCGCCGG GGCGATCGTC GCGCGCGCCA AACTCACCGG TGGCGCACCG GAGGCGAAAA TACTGGCGAT CCGGGCGTTC GGCGAGGGCA AGACCGGAAG CCAGAGCAGC ACGTCTTATC TGATCCTCAA AAGCCTCGAT ATCGCCCTCG GTCATGGCGC CCGAATCGTC AATATGAGCT TTGCCGGATC GCAGGACCCG CTGATTGCGC GCGGCGTCGC GGCTGCTGCA GCGCGGGGGA TCGTGATGAT TGCCGCCGCC GGCAATGCCG GGCCGAAGTC CCCGCCATTG TATCCTGCGG CGCTGCCGGG CGTGATCGCG GTCAGCGCGA CGAGTCCGGG CGACACGTTG TTTGCGGCAT CGAACCGTGG CCCGCAAATC GCCGTGGCAG CTCCCGGCGT CGACGTCCTG CTGCCGGCGC CCGACGGCAA ATACGAGGTG ATGACGGGGA CGTCGTTCTC TGCAGCATTC GTCAGCGGTA TCGCCGCGCT GATGATCGAG CGCAATCCCA CGCTCGTGCC CGATCAGGTG CGGGCCGTTC TGAGCCAGAC CGCCCGCGAT CTTGGAGCGC CGGGCCGTGA CGATCTGTTC GGGGCCGGCG AAGCCGATGC ATTTGCTGCC CTGTCGCGGG TGGACAGCCT GGCGGCCCCG CTGGTTGCGG CGCCCGCGAC CGCGCCGCAG CCCTCTGTCG CGACAGCGCA ACCTGCGACG CCCGGCGCTG CGTTGGCAGC GCCCGCCGGG GCAGAGCAGG AGGTCGGCCC GGCGCCGGCC GCCGCGATCC CGGCCCGCTG A
|
Protein sequence | MSDRAWQRPV WIAAVAAATL VMTAGVAFSQ SLAPPPLTNL PPVTTAPTIT PSVGPTIPAL RPPGPDVLPS VEREYRRLRD APLPPSRCNY LDNGRTCAKL WVIGDWRSVT KHKRGKAKAD RVRRAPPQIV AARGQRPAVA ARDHVPGEVL IEFDGGLSEQ QLRALARRHR LTRVSVDTVA LVGTRIGLFR INDRRSPQTV ARALAADGRI RAAQANFVYT LQSDAKPAAA DNSMLYPHGR LRLAEAHALA RGRGIVVAVI DSGVDIAHPE LVGALAGSFD PLGSKEKPHQ HGTGVAGAIV ARAKLTGGAP EAKILAIRAF GEGKTGSQSS TSYLILKSLD IALGHGARIV NMSFAGSQDP LIARGVAAAA ARGIVMIAAA GNAGPKSPPL YPAALPGVIA VSATSPGDTL FAASNRGPQI AVAAPGVDVL LPAPDGKYEV MTGTSFSAAF VSGIAALMIE RNPTLVPDQV RAVLSQTARD LGAPGRDDLF GAGEADAFAA LSRVDSLAAP LVAAPATAPQ PSVATAQPAT PGAALAAPAG AEQEVGPAPA AAIPAR
|
| |