Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3947 |
Symbol | |
ID | 3969234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4397087 |
End bp | 4398103 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927051 |
Product | extracellular solute-binding protein |
Protein accession | YP_533792 |
Protein GI | 90425422 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000383863 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCCA TCCGCGCCGC CGCCCTCGCT GTCGCCGCCG CCCTGCTCCC GTTGCAGACG GCCGCCGCCG CCGAACAGGT CAACGTCTAC ACCTATCGCG AGAGCAAGCT GGTGCAGCCG TTGTTCGACG CCTTCACCAA GGACACCGGC ATCGCCGTCA ACGTGATCTC GGCCTCCTCC GGCCTCGAGC AGCGGATCAA AGCGGAAGGC GCCGGCAGCC CCGCCGACGT GTTGCTGACG GTCGACATCG GCCGCATCGA CGACGCGGTC GCCGCCGGGG TCAGCCAGCC GATCAATTCG CCGGTGATCG ACGAGATCGT GCCGCCGCAA TATCGCGATC CGGACGGCCA TTGGGCCGGC ATCTCGATGC GGGCGCGGGT GATCTACGCC TCGAAGGATC GCGTCAAGCA GCAGGCCATC ACCTACGAGG AACTCGCCGA TCCGAAATGG AAGGGCAAGA TCTGCATCCG CTCCGGCCAG CACATCTACA ACAACGCGCT GTTCGCCGCC TATGCGGCTC ACCACGGCGA AGCCAAGGCC GAGCAATGGC TGCGCGGCTT GAAGGCCAAT CTGGCGCAGA AGCCGTCGGG CGGCGACCGC GAGACCGCGC GCGACGTCGC CGCCGGCAAA TGCGACCTCG GCATCGGCAA CACCTACTAC TGGGCGCTGA TGATGAACGG CGATCCCGAC AAGAAGCCGT GGGCGGAAGC CACCCGCGTG ATCCTGCCGA CCTTCGAGGG CGGCGGCACC CACGTCAACC TGTCCGGCGT ACTGCTGGCC AAGCACGCGC CGAACAAGGC CAATGCGCTG AAGCTGATCG AATGGCTGGC CGGCGACAAG GCGCAGCAGA TCTATGCCGA CGCCAACTAC GAATATCCGA TCCGCCCCGG CGTCGCGCTC AATCCGACCA TCGCGAGCTA CGGACGATTG ACCGCCGATC CGATGCCGAT CGCCAAGATC GCCGCGCAGC GCAAGACCGC CTCGACCCTG GTCGACAAGG TCGGGTTCGA CAACTGA
|
Protein sequence | MIAIRAAALA VAAALLPLQT AAAAEQVNVY TYRESKLVQP LFDAFTKDTG IAVNVISASS GLEQRIKAEG AGSPADVLLT VDIGRIDDAV AAGVSQPINS PVIDEIVPPQ YRDPDGHWAG ISMRARVIYA SKDRVKQQAI TYEELADPKW KGKICIRSGQ HIYNNALFAA YAAHHGEAKA EQWLRGLKAN LAQKPSGGDR ETARDVAAGK CDLGIGNTYY WALMMNGDPD KKPWAEATRV ILPTFEGGGT HVNLSGVLLA KHAPNKANAL KLIEWLAGDK AQQIYADANY EYPIRPGVAL NPTIASYGRL TADPMPIAKI AAQRKTASTL VDKVGFDN
|
| |