Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0729 |
Symbol | |
ID | 3970551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 797077 |
End bp | 798033 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637923844 |
Product | extracellular solute-binding protein |
Protein accession | YP_530619 |
Protein GI | 90422249 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.446742 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.30254 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTATA AGACCCGACC GGCGCGGACC TCCTCAATGC AACCACGGAT TGTTTTGATC ATCAACGACA GATTGCTGTG CCGCGCCCGT GCAACCGCGA TCGCGGCGCT ATCGCTCGGG CTGCTGCTGG TGGGCGCGCC GCTCGCCCCG GCAATTGCGC AGGCCCCCGC CAAGGGCGTG TCGCAGCCGG CGCCGAAAGC GGTGCCGAGC TTCTGGGATC CGCGACGGCG GCCGGAGCGG CCGGACCTGT CGCGGCTCAC CGTGATCCGC TTTCTCACCG AAACCGATTA TCCGCCGTTC AACTTCACCG GCGCCGATGG CAATCCGGCG GGCTTCAATG TCGATCTGGC GCGGGCGTTG TGCGAGGAGA TCAAGGTCAC CTGCACGGTG CAGATGCGCC GCTTCGAGAC CCTGCTCGAT GCGGTCGCCA GCAACCGCGG CGACGCCATC ATCGCCTCCT TGGCGGTGAC GCCGCAGATG CGCGCCCGGG TGGATTTCAC CGATCCGTAT TACCGCGCCC CGGCGCGCTT CGTGTCGCGC AAGGACGGCG TGCTGGCCGA GATGCGGCCG GAATATCTCG AGGGCAAGAA GGTCGGCGCG ATCAGTGGCT CCTCGCACGA GGCCTATCTG AAAGTGATGT TCACCGACGC CGAGCTGGTG CCCTATCCGA ACGACGACGC GCTGCGCGCC GCACTGCGCC GCGGCGAGGT CGACTACATC TTCGGCGACG CCATCTCGCT GGCGTTCTGG ATCAACGGCA CCGATTCCGC GGATTGCTGC GCGTTTTCCG GCGGCCCGTT CGTCGAAAGC CGGTATTTCG GCGAGGGCGT CGGCATCGCG GTCCGCAAGG GCAACGATCT GTTGCGCCAG GCCTTGAACT GGGCGCTGTT CCGGGTCTGG GAAAAAGGCC GCTACACCGA CCTCTGGCTG CGCTATTTTT CCGTCAGCCC GTTCTAA
|
Protein sequence | MVYKTRPART SSMQPRIVLI INDRLLCRAR ATAIAALSLG LLLVGAPLAP AIAQAPAKGV SQPAPKAVPS FWDPRRRPER PDLSRLTVIR FLTETDYPPF NFTGADGNPA GFNVDLARAL CEEIKVTCTV QMRRFETLLD AVASNRGDAI IASLAVTPQM RARVDFTDPY YRAPARFVSR KDGVLAEMRP EYLEGKKVGA ISGSSHEAYL KVMFTDAELV PYPNDDALRA ALRRGEVDYI FGDAISLAFW INGTDSADCC AFSGGPFVES RYFGEGVGIA VRKGNDLLRQ ALNWALFRVW EKGRYTDLWL RYFSVSPF
|
| |