Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0106 |
Symbol | |
ID | 6407749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 115283 |
End bp | 116884 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642710015 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001989144 |
Protein GI | 192288539 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0909337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTTGA CGAAGCGATC CTTTGTAGTG GGAGCCCTCG GCGGCATTGC GCTGATGGGT CTGCCGCTCG AAACCAAAGC GGAAGGTGCG GGGAGCGGCG GAACGCTGGT GATCGGTTCG ACGCAGGTGC CGCGGCATTT CAACGGCGCC GTCCAGTCCG GTATCGCCAC CGCGCTCCCG AGCACGCAGA TCTTCGCCAG TCCGCTGCGC TACGACGACA ACTGGAACCC GCAGCCCTAT CTCGCCGAGT CCTGGGACGT CTCGAAGGAC GGGCTGACGG TGACGCTGAA GCTGGTCAAG AACGCCACCT TCCACGACGG CAAGCCGGTG ACTTCCGAGG ACGTCGCGTT CTCGATCATG ACCATCAAGG CCAATCATCC GTTCAAGACG ATGCTGGCCG CGGTCGAGAG CGTCGATACG CCGGATCCGC ATACGGCGGT GATCAAGCTG GCGCATCCGC ACCCGGCGCT GCTGCTGGCG ATGTCGCCTG CATTGATGCC GATCCTGCCG AAACACGTCT ATGGCGACGG CCAGGACGTC AAATCGCATC CGGCCAACCT GAAGCCGATC GGCTCCGGTC CCTACAAGCT CACCGAGTAC AAGCAGGGCG ACTACTACAC GCTGGAGAAG TTCGACAAAT TCTTCATCCC GGGGCGGCCG AAGCTCGACA AGATCGTGGT GCGGCTGATT TCGGATCCGA ATGCCCTGAT GGTGTCGGCC GAGCGCGGCG ATGTTCATAT GGTGCCCTAC TTCACCGGCG TGCGCGACAT TGAGCGGCTG GAAAAGGCGC CGAACGTCGT CGTCACCGAC AAGGGCTTTG CCGGCATCGG TGCGCTGAAC TGGCTCGCCT TCAACACCAA GAAGAAGCCG CTCGACGACG TTCGCGTCCG TCAGGCGATC GGCTATGCGG CGAACCGCGA GTTCATCGTC AAGAAGCTGA TGGGCGGCAA AGCCTTGCCC TCGACCGGAC CGATCGCGCC GGGCTCGCCG CTGGAAGAGA AGAACGTCGA GCAGTACAAA TTCGACATCG CCAAGGCCAA CAAGCTGCTC GACGAGGCTG GGCTCAAGCC GGACGGCTCC GGCGTGCGCA CCACGCTGAC GATCGACTAC ATCCCCGGCA ACGACGAGCA GCAGCGCAAC GTCGCCGAAT ATCTGCGTTC GGCGCTGAAG CGGGTCGGGA TCAATCTCGA GGTTCGCGCC GCTCCCGACT TCCCGACCTG GGCCCAGCGT GTCGCCAGTT TCGACTTCGA CATGACGATG GACACCGTGT TCAACTGGGG CGACCCGGTG ATCGGCGTCG ACCGGACTTA TCTGAGCTCG AACATCCGCA AGGGCATCAT CTGGTCGAAC ACCCAGCAGT ACGCCAATCC GAAGGTCGAC GAGATCCTCG GCCAGGCCGC CCAGGAGAGC TCGCCGGACA AGCGCAAGGC GCTGTATTCG GAGTTCCAGA AGATCGTCGT CGAAGATGCG CCGATCTTCT ACATCAACGC CACTCCGTAC CACACCTCGT TCGCCAAGGG GCTCGGCAAC CTGCCGACCA CGGTGTGGGG CGTCGCCTCG CCGCTCGACG AGCTGTACTG GGTGACTCCG CCGAAGAACT GA
|
Protein sequence | MMLTKRSFVV GALGGIALMG LPLETKAEGA GSGGTLVIGS TQVPRHFNGA VQSGIATALP STQIFASPLR YDDNWNPQPY LAESWDVSKD GLTVTLKLVK NATFHDGKPV TSEDVAFSIM TIKANHPFKT MLAAVESVDT PDPHTAVIKL AHPHPALLLA MSPALMPILP KHVYGDGQDV KSHPANLKPI GSGPYKLTEY KQGDYYTLEK FDKFFIPGRP KLDKIVVRLI SDPNALMVSA ERGDVHMVPY FTGVRDIERL EKAPNVVVTD KGFAGIGALN WLAFNTKKKP LDDVRVRQAI GYAANREFIV KKLMGGKALP STGPIAPGSP LEEKNVEQYK FDIAKANKLL DEAGLKPDGS GVRTTLTIDY IPGNDEQQRN VAEYLRSALK RVGINLEVRA APDFPTWAQR VASFDFDMTM DTVFNWGDPV IGVDRTYLSS NIRKGIIWSN TQQYANPKVD EILGQAAQES SPDKRKALYS EFQKIVVEDA PIFYINATPY HTSFAKGLGN LPTTVWGVAS PLDELYWVTP PKN
|
| |