Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2047 |
Symbol | |
ID | 3909862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2325976 |
End bp | 2327271 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883940 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_485665 |
Protein GI | 86749169 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAGCA AGGTCTCGCG TCGAAAACTG CTTCACATGG CGTCAGCAGG CACGGCCGCT GCCGCTTTTC CCGCCCCATT CGTGTCCGGC GTCACGCGTG CGGCATCGGC AGATCCGATC CTGCTCGGGG TTCCGACGGC TCAAACCGCC CAGGCGGGCG TCGCGGATCA TCAGGACTAT CTGAACGGGA CGACGTTGGC CCTGGAAGAA ATCAACGGCG CCGGCGGCGT GCTCGGGCGT CAGGTCAAAG CCGTCGTGGT CGATATCGAC CCGCTGTCCC CGGAAAGCGG GCAGGTTGCA ATCAACAAGC TGATCGACGC CAAGGTGCAC GCGATGTCCT GCGCCTTCGT GTTCACGCCG GTCCCGGTGG CGGACGTGTC GGCGCGCTAC AAGGCGCCGT TTCTGTGGGG CCTCACTCAG CGCAACATGA CCGATCTCGT CGCCAAGCAG CCCGACAAAT ACTCGCACGT GTTTCAGACT GACCCGTCCG AGGTCCACTA CGGGCACACG TTCCCGGTGT TTTTGAAAGC GATGAAGGAC CAGGGGGTGT GGAAGCCGCT GAATAACGGC GTGCACATCG TCCAGGAACA GATCGCCTAC AACCAGACGA TCTCGAAGGC GCTGCAGGCG TCGCTCCCCA AGAGCGAGTT CAAGCTCGCC GGCATCACCG ACATCCAGTA TCCGGTGCAG GACTGGGGCA CAGTCATCCA GGAGATCAAG AAGGTCGGGG CCGGGGCGGT GATGATCGAC CATTGGGTCG CCGCCGAATA CGCCGCCTTC GTCAAACAGT ACAGCGCCGA TCCGTTGAAG GGCGCGCTCG TCTATCTGCA ATACGGACCG TCGCAGCCCG AGTTCCTCGA ACTGTCGGGG CCCGCCGCTG AAGGCTTCGT CTGGAGCACC GTGCTCGGCG TCTATGCGGA CGAGAAGGGC AAGGCATTCC GCGCCAAATA CAAGAAGCGG TTTCCCGGCA TCATGGGGCT TTGCTACACC GGCAACGGTT ACGACACGAC GTATTATCTC AAGGCCGCCT GGGAGGCCGT CGGCGATCCG TCGAACTTCA AGGGCGTCAG TGACTGGATC CGCAAGAATT CCTATCGCGG CGTCTGCGGC TTCATGAGCA TGGACAATCC CTATCAGGAA TGTGCGCACT ATCCGGACAC GGGTGATGCG ATCGGAGCCG CTGAGCTCGA GAAGGGCATG GCGCAACTGT TCTTCCAGGT CCAGAACAAC GAGCACAAGA TCATCTATCC GGACGTGCTC GTCGAGAACA AGCTGCAGAA GGCGCCGTGG TGGTGA
|
Protein sequence | MVSKVSRRKL LHMASAGTAA AAFPAPFVSG VTRAASADPI LLGVPTAQTA QAGVADHQDY LNGTTLALEE INGAGGVLGR QVKAVVVDID PLSPESGQVA INKLIDAKVH AMSCAFVFTP VPVADVSARY KAPFLWGLTQ RNMTDLVAKQ PDKYSHVFQT DPSEVHYGHT FPVFLKAMKD QGVWKPLNNG VHIVQEQIAY NQTISKALQA SLPKSEFKLA GITDIQYPVQ DWGTVIQEIK KVGAGAVMID HWVAAEYAAF VKQYSADPLK GALVYLQYGP SQPEFLELSG PAAEGFVWST VLGVYADEKG KAFRAKYKKR FPGIMGLCYT GNGYDTTYYL KAAWEAVGDP SNFKGVSDWI RKNSYRGVCG FMSMDNPYQE CAHYPDTGDA IGAAELEKGM AQLFFQVQNN EHKIIYPDVL VENKLQKAPW W
|
| |