Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3147 |
Symbol | |
ID | 3910948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3599915 |
End bp | 3600949 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637885049 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_486754 |
Protein GI | 86750258 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCCC GCTCCCTCAC GCGCCGTCGT TTGATCACCA TCGCCGCCGC GGCTGCCGCA GGATCGCTGT GGCCCGGCCG ATCACGCGCG GCAGCAGGGC CCGAGCCGGT GCGCTGGCAA GGCGCCGCGC TCGGCGCGCA GGTGTCGATC GAGATTCACC ATCCGGATCG CGCCGCCGCC GCGCGACTGG TTGAGCGCTC GATCGCCGAA GTGCGGCGGC TGGAGCGGCA GTTCAGCCTG TATCAGCCGG ACTCCGCGAT CTGCGAACTT AACCGCAGCG GCGTGCTGAT TGCGCCTGAT CCCGACATGG TGACGCTGCT GCAGGCCTCG CTCGGCTATG CCGATCTGAC CGGCGGCGCG TTCGATCCGA CGGTGCAGCC GTTGTGGCGC CTGTATCAGC AGCACTTCTC ATCCGACCGG ACCGATCCCG CAGGCCCCTC CTCGGCATGG CTCGAACAGG CGCTGGAGAA GGTCGGATAT GATGGACTGC GCGTCACGCC CGACCGCATC GTGTTGCTCA AGCGCGGCGC CGCGATCACG CTGAACGGCA TCGCCCAAGG TTATGCGACC GATCGCGTCG TCGAATTGCT CCGGAATGCG GGGCTGTCGA CGACGCTGGT CGATATCGGC GAAGTCCGCG CGCTCGGCGG GCGGCCGGAC GGCACGCCCT GGCGCGTCGG CCTCGCCGAT CCGGACCAGC CCGGCCGATC CGGCGAGATC GTCGAAATCG CCGACCGGGC CGTGGCGACG TCTGCGGGCG CCGGCTTCCG GTTCGATCCG GCGGGCCGCT TCACCCATTT GCTCGACCCG CGGACCGGCC GCAGCCCGCG CTCGTACAAT TCGGTCAGCG TCATCGCGCC GACGGCGACC GCAGCGGACG CGCTGTCGAC CGGCTTCAGC CTGATGCCGC TGCCGATGAT CCAGCGCATC GTCGATCAAT CCCATGGCGT GGAGGCGCGC ATTCTCGACC TCGCCGGTCA GAGGCTCCAC CTCCAGGCGG CGTCCGGACG GAGGGGACAC CGCTCCGCGT CGTGA
|
Protein sequence | MMPRSLTRRR LITIAAAAAA GSLWPGRSRA AAGPEPVRWQ GAALGAQVSI EIHHPDRAAA ARLVERSIAE VRRLERQFSL YQPDSAICEL NRSGVLIAPD PDMVTLLQAS LGYADLTGGA FDPTVQPLWR LYQQHFSSDR TDPAGPSSAW LEQALEKVGY DGLRVTPDRI VLLKRGAAIT LNGIAQGYAT DRVVELLRNA GLSTTLVDIG EVRALGGRPD GTPWRVGLAD PDQPGRSGEI VEIADRAVAT SAGAGFRFDP AGRFTHLLDP RTGRSPRSYN SVSVIAPTAT AADALSTGFS LMPLPMIQRI VDQSHGVEAR ILDLAGQRLH LQAASGRRGH RSAS
|
| |