Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1463 |
Symbol | |
ID | 3908413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1652516 |
End bp | 1653532 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883357 |
Product | extracellular solute-binding protein |
Protein accession | YP_485084 |
Protein GI | 86748588 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.46603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCC GCTTTCGCAC CGCTGCCCTC GCCCTCGCCG CTGTCGTCTT GCCAGCGCAG GCGTTCGCAG CCGAACAGGT CAACGTCTAC ACCTATCGCG AGACCAAGCT GGTCCAGCCG CTGTTCGACG CTTTCACCAA GGACACCGGC ATCGCCGTCA ACGTGATTTC GGCGAGTTCG GGGCTGGAGC AGCGGATGAA GGCGGAAGGC GCCAACAGCC CGGCCGACGT GCTGCTGACG GTCGATATCG GGCGGATCGA CGAGGCGGTG GCGGCCGGCG TCACCCAGCC GATCCAGTCG GCGGTGGTCG ACGAGATCGT GCCGCCGCGC TATCGCGATC CCGACGGCCA CTGGGCCGGC ATCTCGATGC GGGCGCGGGT GATCTACGCC TCGAAGGACC GCGTCAAGCA AGACGCGATC ACCTACGAGG AACTGGCCGA TCCAAAATGG AAGGGCAAGA TCTGCATCCG CTCTGGCCAG CACATCTACA ACAACGCGCT GTTTGCCGCC TATATCGCCA AGCACGGCGA GGAGAAGGCC GAGGCCTGGC TGCGCGGCCT CAAGGCCAAT CTGGCGCAGA AACCGTCGGG CGGCGACCGC GAGACGGCGC GCGACGTGGC GGCGGGCAAA TGCGACATCG GCATCGGCAA CACCTACTAC TGGGCGCTGA TGATGAACGG CGATCCCGAC AAGAAGCCGT GGGCGGAAGC GACCCGCGTG ATCCTGCCGA CCTTCGAGGG CGGCGGCACC CACGTCAATC TGTCGGGCGT GCTGCTGGCC AAGAACGCGC CGAACAAGGC CAACGGCGTC AAGCTGATCG AATGGCTGCT CGGCGAGAAG GCGCAGCAGA TCTACGCCAA CGCCAACTAC GAATATCCGA TCCGCCCCGG CGTGCCGCTC AACCCGACCA TTGCCGGCTA CGGCAAGCTG ACCGCCGACT CGCTGCCGAT CGCCAAGATC GCCGCGCAGC GCAAGGCCGC CTCGACGCTG GTCGACAAGG TCGGGTTCGA CAACTGA
|
Protein sequence | MSRRFRTAAL ALAAVVLPAQ AFAAEQVNVY TYRETKLVQP LFDAFTKDTG IAVNVISASS GLEQRMKAEG ANSPADVLLT VDIGRIDEAV AAGVTQPIQS AVVDEIVPPR YRDPDGHWAG ISMRARVIYA SKDRVKQDAI TYEELADPKW KGKICIRSGQ HIYNNALFAA YIAKHGEEKA EAWLRGLKAN LAQKPSGGDR ETARDVAAGK CDIGIGNTYY WALMMNGDPD KKPWAEATRV ILPTFEGGGT HVNLSGVLLA KNAPNKANGV KLIEWLLGEK AQQIYANANY EYPIRPGVPL NPTIAGYGKL TADSLPIAKI AAQRKAASTL VDKVGFDN
|
| |