Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0232 |
Symbol | |
ID | 3907859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 264849 |
End bp | 266402 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882114 |
Product | extracellular solute-binding protein |
Protein accession | YP_483854 |
Protein GI | 86747358 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATAC TCGATCTGGG ATCACGAACC CTGCGACGCC GCGATATTCT GGCGCTGATC GGCGGCGGCG CCGCAGCCGC TGCGGTCGGC CTGCCGGCGC TGGCGCAGGA GCCCAAGAAG GGCGGCGTGC TGAAGGTCGC GGCCCCAGCG AATCCGTCGT CGCTCGATCC GGCCACCGGC GGCGCCGGTT CCGACCACAG CATTCTCTGG ACGATCTACG ATACGCTGGT CGAGTGGGAC TACGACACGC TGAAGCCGAG GCCCGGCATG GCGAAATGGT CGTATCCGAA TCCGACCACG ATGGTGATCG ACATCACCCC CGGCATCCAG TTCCACGACG GCACCGCGAT GGACGCCGAG GCGGTGAAGT TCAACCTCGA TCGCAACCGC TCCGATCAGC GGTCCAATAT CAAGTCGGAT CTCGCCAGCA TCGAGTCGAT CGAGGTGACC AGCCCGCTGC AGGTGACGCT GAAGCTGAAG AGCCCGGATA CATCCCTGCC GGCGATCCTG TCCGACCGCG CCGGCATGAT GGTGTCGCCG ACCAACATCA AGGCGCTTGG CAACGAGACA GACCGCAAGC CGGTCGGCGC CGGGCCGTGG AAGTTCGTGC GCTGGAACGA CAACGAAATC ATCGTCGTGG CTCGCCACGA GAACTACTGG CGCAAGGGCC GGCCGTATCT CGACGGCATC GAGTTCAACA TCATCACCGA AAACGCCACG GCGCTGCGGT CGGTGGTCGC CGGCCAGAAC GACATGGCAT TTCAGTTGCC GGCACGGCTG AAGCCGGTGA TTGAGCGCGC CAAGGACCTG ACCATGGTCA GCTCGCCGAC GCTGTATTGC ATTCAGGTGT ATTTCAACTA CGCCCGCGCG CCGCTCGACA ATCTCAAGGT TCGTCAGGCG ATCAATTTCG CGTTCGACCG CGACACCTTC GTCAAGGCGG CGCTGAGCGG GCTCGGCGAA TCGGCCCGGA TGACGCTGCC GAGCTCGCAC TGGGCGTTCA ACAAGGATGT GGCCGGCACC TATCCGCACG ATCCGGAGAA GGCGAAGAAG TTGCTGGCAG AGGCCGGCTA CAAGGACGGC CTCGAGCTGA CGATCGGCGG CTATACCGAT CAGGATTCGG TGCGCCGCGG CGAGGTGATC CAGGATCAGC TCGGCAAGGT CGGCATCCGG CTCAAATTCA CCCGCGGCAC CATCGCGGAA ATCAGCGCGC AGTTCTTCGC GCAGGAGAAG AAGTTCGACC TGTTGGTGTC GGCCTGGACC GGGCGTCCCG ATCCGAGCAT GACCTATGGG CTCGGCTTCG ACAAAGGCGC GTACTACAAC GCCGGCCGCA CCGCCGATCC TGAGCTGTCC AAGCTGATCC TCGAAAGCCG CGTCAGCGAG GATTTGGCCA AGCGCGCCGA AGTGTTCGCC AGGATCCAGC GCATCACGGT CGAACAGGCA CTGTCGGCGC CGCTGGCGTT CCAGTTCGAG CTCGACGCGC TGTCGTCCAA GGTGAAGGGC TTCAAGCCCA ATCTGCTCGG CAAGCCGAAG TTCGAATACA TCTCCCTCGC GTGA
|
Protein sequence | MRILDLGSRT LRRRDILALI GGGAAAAAVG LPALAQEPKK GGVLKVAAPA NPSSLDPATG GAGSDHSILW TIYDTLVEWD YDTLKPRPGM AKWSYPNPTT MVIDITPGIQ FHDGTAMDAE AVKFNLDRNR SDQRSNIKSD LASIESIEVT SPLQVTLKLK SPDTSLPAIL SDRAGMMVSP TNIKALGNET DRKPVGAGPW KFVRWNDNEI IVVARHENYW RKGRPYLDGI EFNIITENAT ALRSVVAGQN DMAFQLPARL KPVIERAKDL TMVSSPTLYC IQVYFNYARA PLDNLKVRQA INFAFDRDTF VKAALSGLGE SARMTLPSSH WAFNKDVAGT YPHDPEKAKK LLAEAGYKDG LELTIGGYTD QDSVRRGEVI QDQLGKVGIR LKFTRGTIAE ISAQFFAQEK KFDLLVSAWT GRPDPSMTYG LGFDKGAYYN AGRTADPELS KLILESRVSE DLAKRAEVFA RIQRITVEQA LSAPLAFQFE LDALSSKVKG FKPNLLGKPK FEYISLA
|
| |