Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1487 |
Symbol | |
ID | 3908800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1676719 |
End bp | 1677723 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883382 |
Product | extracellular solute-binding protein |
Protein accession | YP_485108 |
Protein GI | 86748612 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4521] ABC-type taurine transport system, periplasmic component |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family [TIGR01729] taurine ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.143655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAC TGATTGCTTC ACTCGCCGTC GCCGGTTTGG CGCTGACCGC GGCGCAGGCC GCCGACAAGC CCGCCAAGAT CACGGTCGGC TATCTCAATC TGGTCAACGC CCAGTTGGTG ACCAAGAACC TCGGCCTGTT GGCCAAAGAG ATGCCGGGTG TCGAGATCAA ATACGTCAAA TTCGGCGGCG GCGGCGACAT GCTGCGCGGC ATCGCCGGCA ACGACGTCGA TTTCGGCGGG CTCGGCAATC CGCCGACCGC GATCGGCATC ACCCGCGGGC TGCCGATCAA GGGCATCCTG GTGCTCAACA TGCTCGGCGA CGTCGAGTCG ATGGTGGTGC GCACCTCGAA GAACATCAAG TCGCTGAAGG ATCTGAAGGG CAAGACGGTG GCGGCGCCGT TCGGCTCCAC CACGCACTAC TTGTTGCTGC AGGCGCTGGC CGACGAGGGC GTCGAGCCGT CGTCGATGAA GATCCTCGAT CTGCCGCCCT CCGACATCGC AACCGCCTGG ATCCGCGGTG ATCTCGACGC CGCCTGGCTG TGGGAGCCCA ATCTGGACAA GGCGGTGAAG AACGGCGGCC ACATCTACAT GTCGTCCGGG CTGATGGAGA AGCGCGGCTA CCCGACCTGG GACATCGGCG TGGTGATGAA CGGATTCGCG GAGAAGTACC CCGACTATGT CGAGAAATTC GTCAAGGCGG AATGCGCCGG CATCGACTTC TGGATCAAGA ACCCGGACAA GACCGCGGCG ATCATCGCCG AGGAGCTGTC GCTGCCGCCG GAAGACGCGA TGCGGATGAT GAACGGCACC GCCATGGTGC CTTGCGACAA GCAGCTGACC GCAACCTATC TCGGCACCAC GGCCAAGAAA GGCCAGTTCG TCGACACGCT GCTGGCCACC GGCGACTTTC TGGTGAAGCA GGAGCGGCTC CCGAAACTGC TGCCGCGCAA GGATTTCGAA GCCTTCCTGG TGCCTGGCTA TATCGAAAAA GTAGTCGGCA AGTAA
|
Protein sequence | MKRLIASLAV AGLALTAAQA ADKPAKITVG YLNLVNAQLV TKNLGLLAKE MPGVEIKYVK FGGGGDMLRG IAGNDVDFGG LGNPPTAIGI TRGLPIKGIL VLNMLGDVES MVVRTSKNIK SLKDLKGKTV AAPFGSTTHY LLLQALADEG VEPSSMKILD LPPSDIATAW IRGDLDAAWL WEPNLDKAVK NGGHIYMSSG LMEKRGYPTW DIGVVMNGFA EKYPDYVEKF VKAECAGIDF WIKNPDKTAA IIAEELSLPP EDAMRMMNGT AMVPCDKQLT ATYLGTTAKK GQFVDTLLAT GDFLVKQERL PKLLPRKDFE AFLVPGYIEK VVGK
|
| |