Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4252 |
Symbol | |
ID | 4024773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4720356 |
End bp | 4721633 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637964458 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_571370 |
Protein GI | 91978711 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.82541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.253409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGA AGCCTCCCTC CGACATCCTC GATCGCCGCC GGTTTCTCGG CGCAGCAGGC CTTGCGGGCG CCGGCGCATT GCTGCCCCTC GCGGCGAAGG CGGGCGAGGC GGCGAAGCCG GACCCGGCGA TCACCGAGGT GCAGGACTGG AATCGCTATC TCGGCGACGG CGTCGACAAG AAGCCCTATG GCGTTCCGTC CAAGTTCGAA AAGGATGTGA TCCGCCGCGA CGTGTCGTGG CTCACCGCCT CGCCGGAATC CTCGGTCAAT TTCACGCCGC TGCATGCGAT CGACGGCATC ATCACGCCGT CCGGCGTGTG TTTCGAGCGC CACCACGGCG GCGTCGCCGA GATCAACCCG GCGGAGCACC GGCTGATGAT CAATGGCCTG GTCGACACCC CGATGGTGTT CACCATGGAC GACATCAAGC GGATGCCGCG GGTCAACAAG GTGTACTTCC TGGAATGCGC GGCGAACTCC GGCATGGAGT GGCGCGGCGC GCAGCTCAAC GGCTGCCAGT TCACCCACGG CATGATCCAC AACGTGATGT ACACCGGCGT GCCCCTGAAG GTGCTGCTCG AACAGGCCGG GCTGAAGCCG AACGCGAAAT GGCTGATGCT GGAGGGCGCG GACAGCGCCG GCATGAATCG CTCGCTGCCG GTTGCGAAGG CGCTCGACGA CGTGCTGATC GCGTTCGCGA TGAATGGCGA GGCGCTGCGC CCCGAGAACG GCTATCCGCT GCGCGCGGTG ATCCCCGGCT GGCAGGGTAA TCTCTGGGTG AAATGGCTGC GCCGGATCGA AGCCGGCGAC CAGCCCTGGC AGGCCCGCGA GGAAACCTCG AAATACACCG ATCTGATGCC CGACGGCCGC GCCCGCAAAT ACACCTTCGT GATGGATGCG AAGTCGGTGA TCACCAACCC GTCGCCGCAA GCGCCGCTGA AGTTCAAGGG CCGCAACGTG CTGAGCGGCG TCGCCTGGTC GGGCCGCGGC ACCGTCAAGC GCGTCGACGT CACGATGGAC GGCGGTCGGA ACTGGCGTGA GGCGCGGATC GACGGACCGG TGCTGGACAA GTCGTTGGTG CGTTTCTACG TCGATTTCGA CTGGAACGGT CAGGAACTGA TGCTGCAGTC GCGCGCCATC GACGAGACCG GCTACGTACA GCCGACCAAG GCCGAGCTGC GCAAGGTCCG CGGCGTCAAC TCGATCTATC ACAACAACGG CATCCAGACT TGGCTCGTGC ATCCGGACGG AGTGACTGAA AATGTCGAGA TCGCTTAG
|
Protein sequence | MSQKPPSDIL DRRRFLGAAG LAGAGALLPL AAKAGEAAKP DPAITEVQDW NRYLGDGVDK KPYGVPSKFE KDVIRRDVSW LTASPESSVN FTPLHAIDGI ITPSGVCFER HHGGVAEINP AEHRLMINGL VDTPMVFTMD DIKRMPRVNK VYFLECAANS GMEWRGAQLN GCQFTHGMIH NVMYTGVPLK VLLEQAGLKP NAKWLMLEGA DSAGMNRSLP VAKALDDVLI AFAMNGEALR PENGYPLRAV IPGWQGNLWV KWLRRIEAGD QPWQAREETS KYTDLMPDGR ARKYTFVMDA KSVITNPSPQ APLKFKGRNV LSGVAWSGRG TVKRVDVTMD GGRNWREARI DGPVLDKSLV RFYVDFDWNG QELMLQSRAI DETGYVQPTK AELRKVRGVN SIYHNNGIQT WLVHPDGVTE NVEIA
|
| |