Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1660 |
Symbol | |
ID | 6409317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1779123 |
End bp | 1780742 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642711548 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001990663 |
Protein GI | 192290058 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.40005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGTC GTGATTTCCT CAAGTCCGCT ACAGCCCTCG CCGCCGGCGC GATGGTGCCA GCACCGGCGA TTTGGTCGGC CGCCAAGGCC GACGCCCGGT CCGAAACCCT GCTGGTTGTC TCCGAGAGCG GCCCGAACAA CCTCGACATC CACGGCGTCG GCACCAACGT GCCCGGCTAC GAGGTGTCGT GGAATTGCTA CGACCGGCTG ATCACCCACG AAATGAAGGA AGGCCCCGGC GGCGTTCCCT ACTACGATAA GGACAAGTTC AAGGGCGAGC TCGCCGACGA CATGGTCATC GGCGACATGT CTGCGACCTT CAAGCTGAAG AAGAACGCCA CCTTCCAGGA CGGCACCCCG GTCACCGCCA AGGACGTGAA GTGGTCGCTC GACCGCTCGG TCAGCGTCGG CGGCTTTCCG ACCTTCCAGA TGAGCGCCGG CTCGCTGACC AAGCCCGAAC AGTTCGTGGT GGTCGACGAT CACACCGTGC GGGTCGACTT CCTGAAGAAG GACAAGCTCA CGATCCCGGA TCTCGCGGTG ATTGTGCCCT GCGTCGTCAA TTCCGAACTG GTGAAGAAGA ACGCGACCGA AAAAGACCCG TGGGGTCTCG AATACACCAA GCAGAACACA GCCGGTTCCG GCGCCTATCG GGTGGTGAAG TGGACCGCCG GCACCGAAGT GATCATGGAG CGCAACGACA AGTGGGTCGG CGGCCCGCTG CCGAAAATCA AGCGCGTGAT CTGGCGCATG GTGCCGCAGG CCGGCAACCG CCGGGCACTG CTGGAGCGCG GTGACGCCGA CATCTCCTAT GAGCTGCCGA ACCAGGACTT CGCCGAGATG AAGCGCGACG GCAAAGTCAA CGTGGTGTCG TTGCCGATCT CCAACGGCAT CCAGTATCTC GGCATGAACG TCACCCAGCC GCCGTTCAAC AACCCGAAGG TGCGTGAGGC GGTCGCCTAC GCGGTGCCAT ATCAGAAGAT CATCGACGCG GTGATGTTCG GCCTCGCCAA CCCGATGTTC GGCGCGGCGG CCGACAAGGC GACCGAAGTG AAGTGGCCGC AGCCGACCAA GTACAATACC GACATGGCGA AGGCCAAGGC GCTGATGGCA GAAGCCGGCT ACGCGAACGG CTTCGACACC ACGCTGTCGT TCGACCTCGG CTTCGCCGGC GTCAACGAGC CGATGTGCAT CCTGATCCAG GAAAGCCTGG CGCAGATCGG CATCCGCTGC ACCATCAACA AGATCCCCGG CGCCAACTGG CGCACCGAGC TGAACAAGAA GGTGATGCCG CTCTACGTCA ACATCTTCTC GGGCTGGCTC GATTATCCGG AGTACTTCTT CTACTGGTGC TACCACTCCG GCAAGTCGAT CTTCAACACC ATGGGCTACG ACTCGCCCGA GATGGACAGG CTGATCGACA GCTCCCGCAT CGCCGCAGCA ACCGGCGAAA CCGCGACCTA CGACAGCGAC GTCAAGGGCT TCGTCGACCT CGCCTTCAAG GACATCCCGC GCGTCCCACT GTACCAGCCC TACCTCAACG TCGCGATGCA GAAGAACGTC TCCGGCTTCG CCTACTGGTT CCACCGCCGG CTCGACTACC GGACGATGGT GAAGGGCTGA
|
Protein sequence | MKRRDFLKSA TALAAGAMVP APAIWSAAKA DARSETLLVV SESGPNNLDI HGVGTNVPGY EVSWNCYDRL ITHEMKEGPG GVPYYDKDKF KGELADDMVI GDMSATFKLK KNATFQDGTP VTAKDVKWSL DRSVSVGGFP TFQMSAGSLT KPEQFVVVDD HTVRVDFLKK DKLTIPDLAV IVPCVVNSEL VKKNATEKDP WGLEYTKQNT AGSGAYRVVK WTAGTEVIME RNDKWVGGPL PKIKRVIWRM VPQAGNRRAL LERGDADISY ELPNQDFAEM KRDGKVNVVS LPISNGIQYL GMNVTQPPFN NPKVREAVAY AVPYQKIIDA VMFGLANPMF GAAADKATEV KWPQPTKYNT DMAKAKALMA EAGYANGFDT TLSFDLGFAG VNEPMCILIQ ESLAQIGIRC TINKIPGANW RTELNKKVMP LYVNIFSGWL DYPEYFFYWC YHSGKSIFNT MGYDSPEMDR LIDSSRIAAA TGETATYDSD VKGFVDLAFK DIPRVPLYQP YLNVAMQKNV SGFAYWFHRR LDYRTMVKG
|
| |