Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4631 |
Symbol | |
ID | 6412317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4991082 |
End bp | 4992092 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642714510 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001993597 |
Protein GI | 192292992 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.464324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGTT TCCGTGCCGC TGCCCTCGCA CTTGCCGCCG TTTTCGTGCC GTTCGCCGCC GGGGCCGCCG AGCAGGTCAA CGTCTACACC TACCGCGAGA CCAAGCTGGT GCAGCCGCTG TTCGATGCCT TCACCAAGGA CACCGGGATC GCCGTCAACG TCATTTCGGC GAGTTCGGGC CTGGAACAGC GGATCAAGGC CGAGGGCGCT GACAGCCCGG CCGACGTGCT GCTGACGGTC GACATCGGCC GGATCGACGA CGCGGTCGCG GCCGGCATCA GCCAGCCGAT CAACTCGGCC GTGATCGACG AGATCGTGCC GGCACAGTTC CGCGATCCCA ACGGCCAGTG GGCCGGCATC TCGATGCGGG CGCGGGTGAT CTACGCCTCG AAAGATCGCG TCAAGCAGGA AGCGATCACC TATGAGGAAC TGGCCGACCC GAAGTGGAAG GGCAAGATCT GCATCCGCTC CGGCCAGCAC ATCTACAACA ACGCGCTGTT CGCCGCTTAC GTCGCCAAGC ACGGCGAGGC CAAGGCCGAG GAATGGCTGC GCGGCCTGAA GGCCAATCTG GCGCAGAAGC CGTCGGGCGG CGACCGCGAG ACCGCGCGCG ACGTCGCGGC CGGCAAATGC GATCTCGGCA TCGGCAACAC CTACTACTGG GCGCTGATGC TGAACGATCC CGACAAGAAG GCCTGGGCGG ATGCAACCCG CGTGGTGCTG CCGACCTTCG AAGGCGGCGG CACCCACGTC AACCTGTCGG GCGTGGTGCT CGCCAAGCAC GCGCCCAACA AGGCCAACGC GGTGAAGCTG ATCGAATGGC TCGTCGGTGA GAAGGCGCAG CAGATCTACG CCGACGCCAA CTACGAATAT CCGATCCGCG CCGGCGTGCC GCTCAATCCG ATCATCGCCG GCTACGGCAA GCTGAAGCCG GATCCGCTGC CGATCGCCAA GATCGCCGCC AACCGCAAGG CCGCCTCGAC GCTGGTCGAC AAGGTCGGAT TCGACAACTG A
|
Protein sequence | MSRFRAAALA LAAVFVPFAA GAAEQVNVYT YRETKLVQPL FDAFTKDTGI AVNVISASSG LEQRIKAEGA DSPADVLLTV DIGRIDDAVA AGISQPINSA VIDEIVPAQF RDPNGQWAGI SMRARVIYAS KDRVKQEAIT YEELADPKWK GKICIRSGQH IYNNALFAAY VAKHGEAKAE EWLRGLKANL AQKPSGGDRE TARDVAAGKC DLGIGNTYYW ALMLNDPDKK AWADATRVVL PTFEGGGTHV NLSGVVLAKH APNKANAVKL IEWLVGEKAQ QIYADANYEY PIRAGVPLNP IIAGYGKLKP DPLPIAKIAA NRKAASTLVD KVGFDN
|
| |