Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0646 |
Symbol | |
ID | 3908571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 731907 |
End bp | 733232 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637882536 |
Product | extracellular solute-binding protein |
Protein accession | YP_484268 |
Protein GI | 86747772 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.206924 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTC GTTTGAAAGC TCTGACCGCT GCGATGGCGA CGACGATCGC CGCCACGCTG CTGATCGCGC CGGCGCAGGC GGCGACCGAG ATCCAGTGGT GGCACGCCAT GACCGGCGGC AACAACGATG TCGTGGTCAA GCTCGCCAAC GACTTCAACG CCGCGCAGAG CGACTACAAG GTCGTCCCGA CCTACAAGGG CAGCTACGCC GACACCATGA ACGCCGGCAT CGCGGCGTTC CGCGCCGGCA ACGCCCCGCA CATCATGCAG GTGTTCGAGG TCGGCACCGC CACCATGATG GCGGCGACCG GCGCGGTGAA GCCGGTCTAC AAGCTGATGC AGGAGACCGG CGAGACATTC GATCCCAACG CCTACCTGCC GGCGATCACC GGCTACTACT CGACCTCCAA GGGCGAGATG CTGTCGTTCC CGTTCAACTC GTCCTCGACG GTGATGTGGG TCAATCTCGA CGCGCTCAAG AAGGCCGGGA TCGCCGAAGT TCCGAAGACC TGGCCGCAGG TGTTCGAGGA TGCCAAGAAG CTGAAGGCGG CCGGCTACGC CACCTGCGGC TTCTCCACCG CCTGGGTCAC TTGGGTCAAT CTCGAGCAGC TCTCCGCCTG GCACAATGTA CCGCTGGCCA GCAAGGCCAA CGGCCTCGAC GGCTTCGACA CCAAGCTGGA GTTCAACGGC CCGGTGCAGG TCAAGCATCT GGAGACGCTG ATCGAGCTGC AGAAGGACAA GACCTACGAT TATTCCGGCC GCACCAACAC CGGCGAGGGC CGCTTCACCT CCGGCGAGTG CCCGATCTTC CTGACCTCGT CGGGTTTCTT CGGCAACGTC AAGTCTCAGG CCAAGTTCGC CTGGACCAAC GCGCCGATGC CGTACTACCC GGACGTTGCC GGCGCGCCGC AGAATTCGAT CATCGGCGGC GCCTCGCTGT GGGTGATGGG CGGCAAGAGC GCCGACGAAT ACAAGGGCGT CGCCAAGTTC CTCGCCTTCC TGTCCGACAC CGACCGTCAG GTCGCGGTGC ACAAGGCGTC GGGCTATCTG CCGATCACCA AGGCGGCCTA CGAGAAGGCC AAGGCCGACG GCTTCTACAA CGACCAGCCC TATCTCGAGA CCCCGATCAA GGAACTGACC AACAAGCCGC CGACCGAGAA TTCCCGCGGC CTGCGGCTCG GCAACATGGT TCAGCTCCGC GACGTCTGGG CCGAGGAAAT CGAGCAGGCG CTGGCCGGCA AGAAGACCGC CAAGGAAGCG TTGGATGCAG CCGTCACCCG CGGCAACGTG ATGCTGCGGC AGTTCGAAAA GACCGCGGTG AAGTAA
|
Protein sequence | MTFRLKALTA AMATTIAATL LIAPAQAATE IQWWHAMTGG NNDVVVKLAN DFNAAQSDYK VVPTYKGSYA DTMNAGIAAF RAGNAPHIMQ VFEVGTATMM AATGAVKPVY KLMQETGETF DPNAYLPAIT GYYSTSKGEM LSFPFNSSST VMWVNLDALK KAGIAEVPKT WPQVFEDAKK LKAAGYATCG FSTAWVTWVN LEQLSAWHNV PLASKANGLD GFDTKLEFNG PVQVKHLETL IELQKDKTYD YSGRTNTGEG RFTSGECPIF LTSSGFFGNV KSQAKFAWTN APMPYYPDVA GAPQNSIIGG ASLWVMGGKS ADEYKGVAKF LAFLSDTDRQ VAVHKASGYL PITKAAYEKA KADGFYNDQP YLETPIKELT NKPPTENSRG LRLGNMVQLR DVWAEEIEQA LAGKKTAKEA LDAAVTRGNV MLRQFEKTAV K
|
| |