Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3960 |
Symbol | |
ID | 6411641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4249865 |
End bp | 4251190 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642713841 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001992931 |
Protein GI | 192292326 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACT TTACCCTCGA TCGCCGCACG TTGTTGAAGG GTGGCGCGAT CACGTTGGCC ACGGCAGCGA CGATGTCCGC CGAGCAGTTG CTTGGTTACG CCAAGGCCTG GGCGCAGGCC TCGCCGTGGA AGCCCGAACC GGGCGCCAAG ATCAATCTGT TGCGCTGGAA GCGGTTCGTC GAAGCCGAAG ACGTCGCCTT CATGAAGATC GTCGATGCCT TCCAGAAGGC CAACAACGTC ACCATCAACG TCTCCAACGA GTCCTACGAC GACATCCAGC CGAAGGCGTC GGTGGCTGCC AACACCGGGC AGGGGCTCGA CATGGTGTGG GGCCTGTACT CGCTGCCGTT CCTGTTCCCG AACAAATGTA CCGACGTCAG CGACGTCGCC GATTATCTCG CCAAGAAGTG CGGCGGCTGG AGCGACTCCG GCAAGGCCTA CGGCATGTAT AACGGCAAGT GGATCGGCAT TCCGGTGGCG GCTACCGGCG GCCTCGTCAA CTACCGGATC AGCGCGGCCG AGAAGGCCGG CCACAAGGAG TTTCCGAAGG ATCTCGGCGG CTTCTCCGAT CTGGTGAAGG GCCTGAACAA GAACGGCACG CCGGCCGGCA TGGCGCTGGG CCACGCCTCG GGCGACGCCA ACGGCTGGCT GCACTGGGCG CTGTGGGCGC ACGGCGGCGC GCTGATCGAC AAGGACAGCA AGGTCGTCGT CAACTCACCA GAGACCGCCA AGGCGCTCGA ATACGTCAAG GGGCTGTACG AGAACTTCAT TCCCGGCACC GCGTCGTGGA ACGATGCTTC CAACAACAAG GCGTTTCTCG CCGGCCAGCT TTATCTGACC ACCAACGGCA TCTCGATCTA CGTCACGGCG AAGAAAGACA ACAAGGAGAT GGCGGCGGAT ATCAACCACG CGCATCTGCC CGCCGGCCTC AACGGCAAGA CCCGCGAGCT GCATCTCGGC TTCCCGATCC TGATCTACAA CTTCACCAAG TTCCCCCAGA CCTGCAAGGC ATTCACCGCT TTCATGATGG AGCCGGAGCA GTTCAACCCG TGGGTCGAGG CCGCGCAGGG CTATCTGTCA CCGTTCCTAC TCGACTACGA GAAGAACCCG ATGTGGACCG CGGACCCGAA AAACACGCCG TATCGCGACG TCGCCCGCAC CGCCTCGACG CCGGCCGGCG ATGCCCAGAT GGGCGAGAAC GCCGCCGCGG CGATCGCCGA CTTCGTCGTG GTCGACATGT TCGCCAACTA CTGCACCGGC CGCGAAGACG TGAAGACCGC CATGAGCAGC GCCGAACGCG CGGCGAAGCG GATCTTCCGG GCGTAA
|
Protein sequence | MTDFTLDRRT LLKGGAITLA TAATMSAEQL LGYAKAWAQA SPWKPEPGAK INLLRWKRFV EAEDVAFMKI VDAFQKANNV TINVSNESYD DIQPKASVAA NTGQGLDMVW GLYSLPFLFP NKCTDVSDVA DYLAKKCGGW SDSGKAYGMY NGKWIGIPVA ATGGLVNYRI SAAEKAGHKE FPKDLGGFSD LVKGLNKNGT PAGMALGHAS GDANGWLHWA LWAHGGALID KDSKVVVNSP ETAKALEYVK GLYENFIPGT ASWNDASNNK AFLAGQLYLT TNGISIYVTA KKDNKEMAAD INHAHLPAGL NGKTRELHLG FPILIYNFTK FPQTCKAFTA FMMEPEQFNP WVEAAQGYLS PFLLDYEKNP MWTADPKNTP YRDVARTAST PAGDAQMGEN AAAAIADFVV VDMFANYCTG REDVKTAMSS AERAAKRIFR A
|
| |