Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0129 |
Symbol | |
ID | 6407772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 142414 |
End bp | 143694 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642710038 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001989167 |
Protein GI | 192288562 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.57996 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAGA GATGGATCGC CGCAGCCGCG ATGACGCTGG TTGCGGGCAT CGCCCACGCC CAGCAGCAGA CCGAAGTGGT GCTGCAGTTC CCGTATCCGG AACTGTTCAG CGAGACCCAC AAGCGGATCG CCGAAGAGTT CGCCAAGGTG CATCCGGAGA TCAAGGTCAC GTTCCGGGCG CCGTACGAGT CGTACGAAGA GGCCACCCAG AAGGTGCTGC GCGAGGCGGT GACCAATCAG CTGCCGGACG TCACCTTCCA GGGCCTGAAC CGCATCCGCG TGCTGGTCGA CAAGACCATC CCGGCGCCGC TCGATGGCTA CATCGCGGCC GAGAAGGATT TCGACAAGCA GGGCTTCCAT CAGGCGATGT ACGACATCGG CACCGCCAGC GGCAAAGTCT ATGCGCTGCC GTTCGCGATC TCGCTGCCGA TCGTCTACGT CAATCTCGAT CTGGTGAAAC AGGCCGGCGG CGATGTGAAC AACCTGCCGA CCACCTGGGA CGGCCTGTTG GATCTGGCCA AGAAGGTCAA GGCGCTCGGC CCCGACACCA ACGGCATCAC GTATGCCTGG GACATCACTG GCAACTGGCT GTGGCAGGCG CCGGTGTTCG CACGCGGCGG CACCATGCTC AACGCCGATG AAACCAAGGT GGCGTTCGAC GGCCCCGAAG GCCAGTTCGC GATGCGCACC ATCGCCCGCC TGGTCACCGA AGGCGGCATG CCGAATCTCG ACCAGCCGTC GATGCGCGCC ACCTTCGCGG CCGGCAAGAC CGGAATCCAC ATCACCTCGA CCTCGGACCT GAAGAAGACC ACCGACATGA TCGGCGGCAA GTTCGCGCTG AAGACCATCG CGTTCCCGGA CGTCGTCAAG CCGAACGGCC GGTTGCCGGC GGGCGGCAAT GTCGTGCTGA TCACCGCCAA GGACAAAGCC AAGCGCGATG CCGCCTGGGA AGTGGTGAAG TTCTGGACCG GTCCGAAGGG CGCGGCGATC ATGGCTGAGA CCACCGGCTA CATGCCGCCC AACAAGGTCG CCAACGACGT CTATCTGAAG GATTTCTACG CCAAGAACCC GAACAACTAC ACCGCGGTCA GCCAGCTCGC GCTGCTGACC AAGTGGTATG CGTTCCCGGG CGACAACGGC CTGAAGATCA CCGACGTGAT CAAGGATCAT CTCAACTCGA TCGTCTCCGG CGCCCGCGCC AAGGAGCCGG ATGCGGTGCT GGCCGACATG ACCCGCGACG TCCAGAACCT GCTGCCGAAG ACCGTCGGCG CCGCCAAGTA A
|
Protein sequence | MLKRWIAAAA MTLVAGIAHA QQQTEVVLQF PYPELFSETH KRIAEEFAKV HPEIKVTFRA PYESYEEATQ KVLREAVTNQ LPDVTFQGLN RIRVLVDKTI PAPLDGYIAA EKDFDKQGFH QAMYDIGTAS GKVYALPFAI SLPIVYVNLD LVKQAGGDVN NLPTTWDGLL DLAKKVKALG PDTNGITYAW DITGNWLWQA PVFARGGTML NADETKVAFD GPEGQFAMRT IARLVTEGGM PNLDQPSMRA TFAAGKTGIH ITSTSDLKKT TDMIGGKFAL KTIAFPDVVK PNGRLPAGGN VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKVANDVYLK DFYAKNPNNY TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVSGARA KEPDAVLADM TRDVQNLLPK TVGAAK
|
| |