Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1439 |
Symbol | |
ID | 4021916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1607588 |
End bp | 1608601 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637961631 |
Product | extracellular solute-binding protein |
Protein accession | YP_568577 |
Protein GI | 91975918 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCC ACCGCGCCGC TGCCCTCGCC TTCGTTGCCG TTCTGCTGCC GCTGCAGGCA GGAGCCGCCG AACAGGTCAA TGTCTACACC TATCGCGAAA CCAAGCTGGT TCAGCCTTTG TTCGACGCCT TCACCAAGGA CACCGGCATC GCCGTCAACG TGATCTCGGC GAGTTCCGGG CTGGAGCAGC GGATCAAGGC GGAAGGCGCC AACAGCCCGG CCGACGTGCT GTTGACGGTG GATATCGGAC GCATCGACGA AGCGGTGCAG GCCGGCATCA CCCAGCCGAT CAAGTCCGCA GTGATCGACG AGACCGTGCC GCCGCGCTAT CGCGATCCCG ACGGGCACTG GGCCGGCATC TCGATGCGCG CGCGGGTGAT CTACGCCTCG AAGGAGCGCG TCAAGCAGAA CGCGATCACC TACGAAGAGT TGGCCGACCC GAAATGGAAG GGCAAGATCT GCATCCGGTC CGGCCAGCAC ATCTACAACA ACGCGCTGTT CGCGGCCTAT GTCGCCAAAT ACGGCGAGGA GAAAGCCGAA GCCTGGCTGC GCGGGCTGAA AGCCAATCTG GCGCAGAAGC CGTCGGGGGG CGACCGTGAG ACCGCGCGCG ACGTCGCCGC CGGCAAATGC GACCTCGGCA TCGGCAATAC CTATTACTGG GCGCTGATGA TGAATGGCGA CCCCGACAAG AAGCCGTGGG CCGAGGCGAC CCGGGTGATC CTGCCGACCT TCGAAGGCGG CGGCACCCAC GTCAATCTCT CCGGCGTGTT GCTGGCGAAG AACGCGCCGA ACAAGGACAA CGGCGTGAAG CTGATCGAAT GGCTGGCCGG CGAGAAAGCG CAGCAGATCT ACGCCGACGC CAACTACGAA TATCCGATCC GCCCCGGCGT ACCGCTCAAT CCGACCATCG CCAGCTACGG CCGGCTGACG GCCGATCCGT TGCCGATCGC CAAGATTGCG GCGCAACGCA AGGCCGCCTC GACCCTGGTC GACAAGGTCG GATTCGACAA CTGA
|
Protein sequence | MSRHRAAALA FVAVLLPLQA GAAEQVNVYT YRETKLVQPL FDAFTKDTGI AVNVISASSG LEQRIKAEGA NSPADVLLTV DIGRIDEAVQ AGITQPIKSA VIDETVPPRY RDPDGHWAGI SMRARVIYAS KERVKQNAIT YEELADPKWK GKICIRSGQH IYNNALFAAY VAKYGEEKAE AWLRGLKANL AQKPSGGDRE TARDVAAGKC DLGIGNTYYW ALMMNGDPDK KPWAEATRVI LPTFEGGGTH VNLSGVLLAK NAPNKDNGVK LIEWLAGEKA QQIYADANYE YPIRPGVPLN PTIASYGRLT ADPLPIAKIA AQRKAASTLV DKVGFDN
|
| |