Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0174 |
Symbol | |
ID | 3907779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 188622 |
End bp | 189902 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637882056 |
Product | extracellular solute-binding protein |
Protein accession | YP_483797 |
Protein GI | 86747301 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.396862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCGAA CCTGGATGGC AGCGGCGGCT TTCACGCTCG CCGCCGGCTG CGCTCACGCG CAGACGCAGA CCGAAGTCGT GCTGCAATAT CCCTATCCGG AGCTGTTCAC CGAGACCCAC AAGCAGATCG CGGCCGAATT CGCCAAGGTG CATCCGGAAA TCAAGGTGAC GTTCCGCGCG CCTTACGAAT CCTATGAAGA AGGCACCCAG AAGGTGCTGC GCGAGGCGGT CACCAATCAG GTCCCCGACG TCACCTTCCA GGGCCTGAAC CGCGTCCGCG TGCTGGTCGA CAAGAACATT CCGGCCGAAC TCGACGGCTA CATCGCCGCC GAAAAGGATT TCGACAAGCA GGGCTTCCAC CAGGCGATGT ACGACATCGG CACCGCCAGC GGAAAGGTCT ACGCGCTGCC GTTCGCGATC TCGCTGCCGA TCGTCTACGT CAATGTCGAT CTGGTGAAAC AGGTCGGCGG CGATCCGAAC AATCTGCCGA CCAGCTGGGA CGGCCTGATC GACCTCGCCA AGAAGGTCAA GGCGCTCGGC CCGGACTATA ACGGCATCAC CTATGCGTGG GACATCACCG GCAACTGGCT GTGGCAGGCG CCGGTGTTCG CCCGCGGCGG CACCATGCTG AACGCGGACG AAACCAAGGT GGCGTTCGAT GGTCCCGAAG GCCAGTTCGC CATGAAGCAG ATCGCCCGCC TCGTCACCGA GGGCGGCATG CCGAATCTCG ACCAGCCGTC GATGCGCGCC GCCTTCGCGG CGGGCAAGAC CGGCATCCAC ATCACCTCGA CCTCCGATCT CAACAAGACC ACGCAGATGA TCGGCGGCAA GTTCACGCTG AAGACCCACA TCTTCCCGGA CGTGGTCAAG CCGAACGGCC GTCTGCCGGC CGGCGGCAAC GTGGTGCTGA TCACCGCCAA GGACAAGGCC AAGCGTGACG CGGCCTGGGA AGTGGTGAAG TTCTGGACCG GCCCGAAGGG CGCCGCGATC ATGGCGGAGA CCACCGGCTA CATGCCGCCC AACAAGGTCG CCAACGACGT CTATCTGAAG GACTTCTACG AGAAGAACCC GAACAACTAC ACCGCGGTGA GCCAGCTCGC GCTGCTGACC AAATGGTACG CGTTCCCGGG CGACAACGGC CTCAAGATCA CCGACGTGAT CAAGGATCAT CTCAACTCGA TCGTGACCGG AACCCGGGCC AAGGAGCCGG ACGCGGTGCT CGCCGACATG ACCAAGGACG TGCAGAAGCT GCTGCCGAAA TCGGTCGGCG CGGCGCGCTG A
|
Protein sequence | MLRTWMAAAA FTLAAGCAHA QTQTEVVLQY PYPELFTETH KQIAAEFAKV HPEIKVTFRA PYESYEEGTQ KVLREAVTNQ VPDVTFQGLN RVRVLVDKNI PAELDGYIAA EKDFDKQGFH QAMYDIGTAS GKVYALPFAI SLPIVYVNVD LVKQVGGDPN NLPTSWDGLI DLAKKVKALG PDYNGITYAW DITGNWLWQA PVFARGGTML NADETKVAFD GPEGQFAMKQ IARLVTEGGM PNLDQPSMRA AFAAGKTGIH ITSTSDLNKT TQMIGGKFTL KTHIFPDVVK PNGRLPAGGN VVLITAKDKA KRDAAWEVVK FWTGPKGAAI MAETTGYMPP NKVANDVYLK DFYEKNPNNY TAVSQLALLT KWYAFPGDNG LKITDVIKDH LNSIVTGTRA KEPDAVLADM TKDVQKLLPK SVGAAR
|
| |