Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0433 |
Symbol | |
ID | 3970828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 467127 |
End bp | 468128 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637923548 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_530327 |
Protein GI | 90421957 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACGC CATCTCTGAC ACGCCGTCGC CTGATCACCA TCGCGGCGAC CGCCGCGGGC GCAGGCCTGC TCGGCCGCGC CGGTCTGGCG CGCGCCGCCA CCGAGCCGCT GCGCTGGCAG GGCCAGGCGC TCGGCGCCCA GGTGTCGCTC GAGATCTATC ACCCGGACCG CGCCGCGGCG GAGCGGCTGA TCGCGTTGTC GCTCGCCGAA GTGCGGCGGC TGGAGCGGCA GTTCAGCCTG TATCAGCCGG ACTCCGCGAT CTGCGACCTC AACCGCAGCG GCGTGCTGAT CGCGCCGGAC CCCGACATGG TGACGCTGCT GCAGGCCGCG CTGCGCTTCG CGCAACTCAC CGGCGGCGCG TTCGATCCCA CCGTGCAGCC GCTGTGGCGG CTCTACCAGG CGCATTTCGC CGCGGCGCAG CCGGATCCCG AGGGGCCGTC GGCCGAAGCG TTGGCCCAGG CCATGGCGCG GGTCGGCCAT GACGGGCTGC GGGTCACGCC GGACCGCGTC GTGCTGCTGA AGCCGGGTGC CGCGATCACG CTGAACGGCA TCGCGCAAGG CTATGCCACC GACAAGGTGG TGGCGCTGTT GCGCGGCGCG GGGCTGTCGA CCACGCTGGT CGACATGGGC GAAATCCGCG CGCTCGGCGC CCGGCCCGAC GGCACGCCAT GGCGCGTCGG CCTCGCCGAT CCGGACCAGG TCGGCGCCAT CACCGAAACC ATCGATCTCG TCGACCGCGC GGTGGCGACC TCCGCCGGCG CCGGCTTCCG CTTCGATCAC GAAGGCCGCT TCACCCACCT GTTCGATCCG GCAACCGGCC GCAGCCCCGC GCGCTACCGC AGCGTCAGCG TGCTGGCGCC GACCGCCACT GAAGCCGACG CGCTCTCCAC CGCGTTCAGC CTGCTGCCGC GGGACAAGAT CGAAGGCATC GTCGGCGAGC GTCCGGGCGT GCAGGCCCGG ATGATGGATC TGGATGGTCG GTTGAGCTGC TGCGGGGCGT GA
|
Protein sequence | MPTPSLTRRR LITIAATAAG AGLLGRAGLA RAATEPLRWQ GQALGAQVSL EIYHPDRAAA ERLIALSLAE VRRLERQFSL YQPDSAICDL NRSGVLIAPD PDMVTLLQAA LRFAQLTGGA FDPTVQPLWR LYQAHFAAAQ PDPEGPSAEA LAQAMARVGH DGLRVTPDRV VLLKPGAAIT LNGIAQGYAT DKVVALLRGA GLSTTLVDMG EIRALGARPD GTPWRVGLAD PDQVGAITET IDLVDRAVAT SAGAGFRFDH EGRFTHLFDP ATGRSPARYR SVSVLAPTAT EADALSTAFS LLPRDKIEGI VGERPGVQAR MMDLDGRLSC CGA
|
| |