Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4214 |
Symbol | |
ID | 6411898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4519001 |
End bp | 4520896 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642714096 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001993185 |
Protein GI | 192292580 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.159532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACGTTGA CCCGACGACA TCTTCTGCAG GGCGGCTTGT TGGCCGCAGC GACCCCGGCT TTGAGCTTCA GCCCCGGCCT GTTTGGGGCG AGCGCGGCGC GCGCCGAAAC CGCGGTCGAT GGGGCGGCAT GGCGCCACGG TCTGTCGCTG TTCGGGGAGC TGAAATATCC GGCCGGCTTC GCGCAGTTCG ATTATGTGAA TCCGAAAGCA CCGAAGGGCG GTGCCGCGCG GCAGATCGCG CTAGGCACCT TCGACAATTT CAATCTCGCG GTCGCCGGCG TGAAAGGTAA CATCGCCGGG CCGGTGGGGT ATCTCTACGA AACCCTGATG ACGCCGTCGC AGGACGAGGT CGGCACCGAA TACGGCTTGC TCGCCGAGGG CGCTGCGCAT CCCGACGATT TTTCCTGGGT GATCTATCGC GTGCGCAAGG AAGCGCGCTG GAACGACGGC AAGCCGGTCA CGGCCGACGA CGTCGTCTTT TCGTTCGATG CGCTGAAGAA ATACAGCCCA CGCTACGCCT CGTATTATCG TCACGTCGTC AAGGCCGAGA AGGTCGGCGA GCGCGACGTC CGCTTCACCT TCGATGCGCC GGGCAACCGC GAACTGCCGA CCATCGTCGG CGAATTGATG GTGCTGCCGA AGCATTGGTG GGAGGGCACT GACGCCCAGG GGCGCAAGCG CGACGTCTCG GCAACAACGC TGGAGCCTCC GCTCGGCTCG GCGCCCTACA AGATCAAGGA CTTCGTCGCC GGCCGTTCGA TCGTGCTGGA GCGCGTGAAG GACTATTGGG GCGAGAAGCT GCCGGTGCGG ATCGGCCAGA ACAATTTCGA CGAGCTGCGG TTCGAGTACT TCCGTGACAA CACCGTCGCA CTGGAGGCCT TCAAGGCCGA CCAGGCCGAC TGGATCATGG AGAACTCCGC CAAGCAGTGG GCGACTGCCT ACGACTTTCC CGCGGTGAAC GACAAGCGTG TTGTCAAAGA AGAATTCCCG ATCAACGATT CGGGACGGAT GCAGGCGTTC GTGCTGAATA CCCGCCGCGA GATGTTCAAG GATCCGCGGG TGCGGCGCGC GTTCAACTAC GCGTTCGATT TCGAAGAGAT GAACAAGCAG CTGTTCTATG GACAGTACAA GCGGATCGCG AGCTTCTTCG AAGGCACCGA GCTCGCCTCC AGCGGACTAC CTGAAGGGCA GGAACTGGCG CTGCTCGAAA CCGTGCGCGA CAAGGTGCCG GCCGAGCTGT TCACGCAGCC CTATACCAAT CCAGTCGGCG GCAACCCGGA GGCGGTACGC GCCAATCTCC GTGAGGCGAT CAAGCTGGTG AAAGAGGCTG GCTTCGACAT CAAGGATCGC AAACTGGTCG ATCCGTCCGG CAAGCCGGTC GCTGTCGAGA TCCTGGTGCA GGACCCGTCG TCGGAGCGGA TTGCGCTGTT CTACAAGCCT TCGCTGGAGC GGCTCGGCGT CACCGTCTCG ATCCGCGTGG TCGACGACGC GCAGTATCAG AACCGGATTC GCGCGTTCGA TTTCGACATC ATCACCGACC TGTGGGGCCA GTCGCTGTCG CCCGGTAATG AACAGCGCGA TTATTGGGGA TCACAGGCGG CCAATGAGCA GGGCTCGCAC AACACCATCG GCATCAAGAA TCCGGCCGTC GATGAGCTGA TCGAAAAGGT GATCTACGCC AAGGACCGGC CCTCGTTGAT TGCGGCGACG CGAGCGCTCG ACCGCGTGCT GCTGTGGAAC TTCTATGTCG TCCCACAATT CACCTACGGC TTCATGCGCT ACGCGCGCTG GGACCGGTTT GGGCACGCGC CGCTGCCGAA ATACGCTCGC TCTGGTCTGC CGGCGTTGTG GTGGTACGAC GCCGACAAGG CCGCCAATCT CGGCAAGCGC TCTTGA
|
Protein sequence | MTLTRRHLLQ GGLLAAATPA LSFSPGLFGA SAARAETAVD GAAWRHGLSL FGELKYPAGF AQFDYVNPKA PKGGAARQIA LGTFDNFNLA VAGVKGNIAG PVGYLYETLM TPSQDEVGTE YGLLAEGAAH PDDFSWVIYR VRKEARWNDG KPVTADDVVF SFDALKKYSP RYASYYRHVV KAEKVGERDV RFTFDAPGNR ELPTIVGELM VLPKHWWEGT DAQGRKRDVS ATTLEPPLGS APYKIKDFVA GRSIVLERVK DYWGEKLPVR IGQNNFDELR FEYFRDNTVA LEAFKADQAD WIMENSAKQW ATAYDFPAVN DKRVVKEEFP INDSGRMQAF VLNTRREMFK DPRVRRAFNY AFDFEEMNKQ LFYGQYKRIA SFFEGTELAS SGLPEGQELA LLETVRDKVP AELFTQPYTN PVGGNPEAVR ANLREAIKLV KEAGFDIKDR KLVDPSGKPV AVEILVQDPS SERIALFYKP SLERLGVTVS IRVVDDAQYQ NRIRAFDFDI ITDLWGQSLS PGNEQRDYWG SQAANEQGSH NTIGIKNPAV DELIEKVIYA KDRPSLIAAT RALDRVLLWN FYVVPQFTYG FMRYARWDRF GHAPLPKYAR SGLPALWWYD ADKAANLGKR S
|
| |