Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4368 |
Symbol | |
ID | 3912183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4951999 |
End bp | 4953276 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637886274 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_487966 |
Protein GI | 86751470 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGA AGCCCCCCTC CGACATCCTC GATCGCCGCC GGTTTCTCGG CGCTGCCGGC CTCGCCGGCG CCGGCGCGCT GCTGCCGCTC GCCGCCAAGG CCGGCGAGGC GGCGAAGCCC GACCCGATGA TCACCGAGCT GCAGGACTGG AATCGCTTTC TCGGCGACGG CGTCGACAAG AAGCCCTATG GCGTGCCGTC GAAATTCGAG AAGGACGTGA TCCGCCGCGA CGTGTCGTGG CTCACGGCGT CGCCGGAATC CTCGGTCAAT TTCACGCCGC TGCACGCGCT CGACGGCATC ATCACGCCGT CCGGCCTGTG CTTCGAGCGC CATCACGGCG GCGTCGCCGA GATCGATCCG GCGCAGCACC GGCTGATGAT CAACGGCCTG GTCGACACCC CGATGGTGTT CACCATGGAC GACATCCGGC GGATGCCGCG GGTCAACAAG GTGTACTTCC TGGAATGCGC GGCGAATTCC GGCATGGAGT GGCGCGGTGC GCAGCTCAAC GGCTGCCAGT TCACCCACGG CATGATCCAC AATGTGATGT ACACCGGCGT GACGCTGAAG ACGCTGCTGG ATCAGGCCGG GCTGAAGCCG AACGCCAAAT GGCTGATGCT GGAGGGCGCG GACTCCGCCG GCATGAACCG CTCGCTGCCG GTGTCGAAAG CGCTCGACGA CGTGCTGATC GCGTTCGCGA TGAACGGCGA GGCGCTGCGT CCGGAAAACG GCTATCCGCT GCGCGCGGTG ATTCCCGGCT GGCAGGGCAA TCTCTGGGTC AAATGGCTGC GCCGCATCGA GGCCGGCGAC ATGCCGTGGC AGGCCCGCGA GGAGACCTCG AAATACACCG ACCTGATGCC GGACGGCCGC GCCCGCAAAT ACACCTTCGT GATGGACGCG AAAAGCGTGA TCACCAATCC GTCGCCGCAG GCGCCGCTGA AATTCAAGGG CCGCAACGTG CTGAGCGGCG TCGCCTGGTC GGGGCGCGGC ACCGTCAAGC GCGTCGACGT CACCATGGAC GGCGGCCGCA ACTGGCGCGA GGCGCGGATC GACGGGCCGG TGCTCGACAA GTCGATGGTG CGTTTCTACG TCGATTTCGA CTGGAACGGC GAAGAGTTGA TGTTGCAGTC GCGCGCCATC GACGAGACCG GCTACGTGCA GCCGAGCAAG GCCGAGCTGC GCAAGGTCCG CGGCGTCAAT TCGATCTACC ACAACAACGG CATCCAGACC TGGCTCGTGC ATCCGGACGG AGTGACTGAA AATGTCGAGA TCGCTTAG
|
Protein sequence | MSEKPPSDIL DRRRFLGAAG LAGAGALLPL AAKAGEAAKP DPMITELQDW NRFLGDGVDK KPYGVPSKFE KDVIRRDVSW LTASPESSVN FTPLHALDGI ITPSGLCFER HHGGVAEIDP AQHRLMINGL VDTPMVFTMD DIRRMPRVNK VYFLECAANS GMEWRGAQLN GCQFTHGMIH NVMYTGVTLK TLLDQAGLKP NAKWLMLEGA DSAGMNRSLP VSKALDDVLI AFAMNGEALR PENGYPLRAV IPGWQGNLWV KWLRRIEAGD MPWQAREETS KYTDLMPDGR ARKYTFVMDA KSVITNPSPQ APLKFKGRNV LSGVAWSGRG TVKRVDVTMD GGRNWREARI DGPVLDKSMV RFYVDFDWNG EELMLQSRAI DETGYVQPSK AELRKVRGVN SIYHNNGIQT WLVHPDGVTE NVEIA
|
| |