Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3729 |
Symbol | |
ID | 3971474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4149958 |
End bp | 4151775 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637926839 |
Product | extracellular solute-binding protein |
Protein accession | YP_533583 |
Protein GI | 90425213 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCGCGA TCGGCGTTAC GTCGACGCAG GCGGCGCCGG CCGAACCGGT CTGGCGTCAC GGCCTGTCGC TGTTCGGCGA CGTCAAATAT CCCGCCGACT TCAAGCGATT CGACTACGTC GATGCCAACG CCCCGAAAGG CGGCGCGGCG CGGCAGATTT CGATCGGCAC CTTCGATAAT TTCAATCTGG CGGTGGCCGG CGTGAAGGGC TCGATCGCGC CCGCGGTCGG GCTGATCTAC GAAACCCTGA TGACGCAATC GCAGGACGAG GTCGGCGCCG AATACGGGCT GCTCGCGGAA GCGGCGGCGC ATCCCGACGA TCACTCCTCG GTGACCTATC GGCTGCGCGC CAATGCGCGC TGGCACGACG GCAAACCGGT GACCCCGGAG GATGTGATCT TCTCGCTGGA GGCGCTGAAG AAATACAGCC CGCGTTATGC CTCGTATTAT CGCCACGTCG TGAAGGCCGA GAAGACCGGC GACCGCGAGA TCAAATTCAG CTTCGACATG CCGGGCAACC GCGAATTGCC GACCATCGTC GGCGAACTCG TCGTGTTGCC GAAGCATTGG TGGGAGGGCA GCGACGAGCA GGGCCGGCCG CGCGACATTT CCGCGACCAC GCTGGAAAAG CCGCTCGGGT CGGGTCCGTA TCGCATCAAG GATTTCGTCG CCGGCCGTTC GGTGACGCTG GAACGGGTCA AGGACTATTG GGGCGCCGCG GTGCCGGCGC GGGTCGGCCA GAACAATCTC GACGAACTGC GCTACGAATT CTTCCGCGAC AATCTGGTGG CGCTGGAAGC CTTCAAGGCC GACCAGGCCG ACTGGATCTT CGAGAATTCC GCCAAGCAAT GGGCCACCGC CTATGACTTC CCGGCGGTGA CCGAGAAGCG CGTCGTCAAG GAAGAATTCC CGATCAACGA TTCCGGGCGG ATGCAGGCCT TCATCTTCAA TCTGCGCCGC GAGATGTTCC AGGATGCGAG GCTGCGCCGC GCCTTCAACT ACGCGTTCGA TTTCGAGGAG ATGAACAAGC AACTGTTCTA CGGACAATAC AACCGGATCA ACAGCTACTT CGAAGGTACC GAACTGGCCT CCAGCGGGCT GCCGCAGGGT GCCGAACTGG CGCTGCTGGA GCCGTTGCGC GACAAGTTGC CCGCCGAGCT GTTCACCACG CCCTACGCCA ACCCGGTCGG CGGCAATTCG GACGCGGTGC GCGGCAATCT GCGCGAGGCG ATGCGGCTGT TGAAGGAGGC GGGATTCGAA GTGCGCGACC GCCGGCTGGT CGATGCCGCC GGCAAGCCGG TGCTGGTGGA GATCCTGGTG CGGGATCCCT CCTCGGAGCG GATCGCGCTG TTCTACAAGC CGTCGCTGGA ACGGATCGGC GTCACGGTGT CGATCCGCAC CGTCGACGAC GCGCAGTACG AGAACCGGGT TCGCGCGTAC GATTTCGACA TGATCACCGA TCTGTGGGGC CAGTCGCTGT CGCCCGGCAA CGAGCAGCGC GACTATTGGG GCTCGCAGGC CGCCGATCAG CCGGGCTCGC GCAACACCAT CGGCATCAAG AATCCCGCGG TCGATGCGCT GATCGAGAAA GTGATCTTCG CCAAGGACCG CGCCTCGCTG GTCGCCGCCA CCCGCGCGCT CGATCGCGTG TTGCTATGGA ATTTCTATCT GGTGCCGCAG TTCACCTACG GCTATGCGCG CTACGCGCGC TGGGATCGCT TCAGCCACGC CGAGCTGCCG AAATACGCCC GCGCCGGGTT GCCGTCGCTG TGGTGGTACG ACGCCGACAA GGCCGCCCGG ATCGGCAAAC GCTCTTGA
|
Protein sequence | MSAIGVTSTQ AAPAEPVWRH GLSLFGDVKY PADFKRFDYV DANAPKGGAA RQISIGTFDN FNLAVAGVKG SIAPAVGLIY ETLMTQSQDE VGAEYGLLAE AAAHPDDHSS VTYRLRANAR WHDGKPVTPE DVIFSLEALK KYSPRYASYY RHVVKAEKTG DREIKFSFDM PGNRELPTIV GELVVLPKHW WEGSDEQGRP RDISATTLEK PLGSGPYRIK DFVAGRSVTL ERVKDYWGAA VPARVGQNNL DELRYEFFRD NLVALEAFKA DQADWIFENS AKQWATAYDF PAVTEKRVVK EEFPINDSGR MQAFIFNLRR EMFQDARLRR AFNYAFDFEE MNKQLFYGQY NRINSYFEGT ELASSGLPQG AELALLEPLR DKLPAELFTT PYANPVGGNS DAVRGNLREA MRLLKEAGFE VRDRRLVDAA GKPVLVEILV RDPSSERIAL FYKPSLERIG VTVSIRTVDD AQYENRVRAY DFDMITDLWG QSLSPGNEQR DYWGSQAADQ PGSRNTIGIK NPAVDALIEK VIFAKDRASL VAATRALDRV LLWNFYLVPQ FTYGYARYAR WDRFSHAELP KYARAGLPSL WWYDADKAAR IGKRS
|
| |