Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2571 |
Symbol | |
ID | 6410233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2778422 |
End bp | 2780011 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642712449 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001991559 |
Protein GI | 192290954 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.13754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAC GTGAATTCGC CAAACTGGGC CTGATGGCCG GCGCGCTCGG CATCGGCGGC ATCCCGCTCG GCATCACGCG GGCCGCGGGA CAAACCCGCG GCGGCACGCT CAACACCATC ATTCAGCCGG AGCCGCCGAT CCTGGTCACC GCGCTCAACC AGCAGCAGCC GACGCTGACG CTGGGCGGCA AGATCTACGA GAGCCTGCTG CGCTATGATT TCGATCTCAA GCCGCTGCCG GGCCTCGCCC AGTCCTGGGA AGTGTCGCCG GACAAGCTGA CCTACACGTT CAAGCTGTTC CCCAACATCA CCTTCCACGA CGGCACGCCG CTGACCTCGG AAGACGTGGT GTTCTCGATC ACGAAGATTC TGATGGAGAA CCACGCGCGC GCCCGCAACA CGTTCTCGCG CATCGACAAG GCCGAGGCGC CGGATCCGCT CACGGTGGTC TTCACCTTGA AGAAGCCGTT CGCGCCGTTC CTCACCGCGT TCGATTGCAC CACGGCGCCG ATCGTGCCGA AGCACATCTA CGACGGCACC GACTATCGCA AGAACCCGGC CAACGCCAAG GCGATCGGCT CCGGCCCGTT CAAGTTGAAG GAATGGGTGC GCGGTTCGCA CGTCCATCTG GTCAAGCACG AGGGTTACTA TCGTCCGGGT GAGCCTGTCC TCGACGAGAT CATTTATCGC GTCATCCCGG ATTCCGCGTC GCGTTCGGTG GCACTGGAGC AGGGGACCGT TCAGCTCACG CAGTGGACCG ACGTTGAGCT GTTCGAGGTG CCGCGGCTGT CGAAGCTGCC GCATCTGACG ATGACCACCA AGGGCTACGA GTTCTTTGCG CCGCATACCT GGCTGGAGAT CAACAACCGC ATCGCGCCGA TGAACGACAA GCGGTTCCGG CAGGCGGTGA TGTATGCGAT CGACCGCAAG GCGCTGCTGA ACCGGATCTA TTTCGGCCTC GGCAAGGTTG CGACCGGCCC CGTGTCGTCG AAGACCAAGT TCTACGAAAA GGACGTCAAG AAGTACGACT TCTCGCCCGA GAAGGCGAAG GCGTTGCTCG ACGAGATGGG GCTGAAGCCG GGCCCCGACG GCAAGCGCGT GACGATTCCC TTCCTGGTGC CGCCCTACGG CGAAACCCAT CAGCGGACCG CCGAATTCCT GCGACAGTCG CTCGCCCGCG TCGGCATCGA TCTGCAACTG CAGGGCATCG ATGTCGCGGG ATGGGCCGAG AAATTCAGCA ACTGGGATTT CTCGATGACT ACGACCACGG TCTATCAGTT CGGCGATCCG GCGCTCGGCG TGTCGCGGAG CTACGTCTCC TCCAACATCC GCAAGGGCAT CCTGTTCTCC AACACCTGCG GCTATTCCAA TCCGGAAGTC GATCGGCTGT TCGAGGAGGC CGCGACCGCG ACGTCGGACG ACAAGCGTCA GGAGCACTAC AGCGCGCTGC AGAAGATCAT GGTCGATGAG GTGCCGGTCA TCTGGCTGCT CGAGATCGAC TATCCGAACC TCATGGACAA GCGGCTGAAG AACGTGGTGA CGTCGGCGAT CGGCGTGCAC GACACCTTCG GGACGGTTTC GTTCGGATGA
|
Protein sequence | MNRREFAKLG LMAGALGIGG IPLGITRAAG QTRGGTLNTI IQPEPPILVT ALNQQQPTLT LGGKIYESLL RYDFDLKPLP GLAQSWEVSP DKLTYTFKLF PNITFHDGTP LTSEDVVFSI TKILMENHAR ARNTFSRIDK AEAPDPLTVV FTLKKPFAPF LTAFDCTTAP IVPKHIYDGT DYRKNPANAK AIGSGPFKLK EWVRGSHVHL VKHEGYYRPG EPVLDEIIYR VIPDSASRSV ALEQGTVQLT QWTDVELFEV PRLSKLPHLT MTTKGYEFFA PHTWLEINNR IAPMNDKRFR QAVMYAIDRK ALLNRIYFGL GKVATGPVSS KTKFYEKDVK KYDFSPEKAK ALLDEMGLKP GPDGKRVTIP FLVPPYGETH QRTAEFLRQS LARVGIDLQL QGIDVAGWAE KFSNWDFSMT TTTVYQFGDP ALGVSRSYVS SNIRKGILFS NTCGYSNPEV DRLFEEAATA TSDDKRQEHY SALQKIMVDE VPVIWLLEID YPNLMDKRLK NVVTSAIGVH DTFGTVSFG
|
| |