Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1273 |
Symbol | |
ID | 3973031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1388495 |
End bp | 1389496 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637924383 |
Product | putative periplasmic solute-binding protein |
Protein accession | YP_531154 |
Protein GI | 90422784 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.384428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.646198 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGT CTCGCTGGGT GTTCGCGCTG GTTGCCGTGC CGCTGTTGCT GCAGCCGGCC AAGGCGGCGG ATGTGATCTG CTACAATTGT CCGCCGGAAT GGGCGGACTG GGCCTCGATG CTGAAGGCGA TCAAGACCGA CCTCGGCTAT GACATCCCGC ACGACAACAA GAATTCCGGC CAGGCGCTGG CGCAGATCCT CGCCGAAAAG GCCAACCCGG TCGGCGACAT CGGCTATTTC GGCGTCACCT TCGGCATGAA GGCCAAGACG CAGGACGCGC TGGAGCCCTA CAAGCCCGCG CATTGGGACC AGGTCGATGC CGGCCTGAAG GATCCCGACG GCTATTGGAC CACGATCCAT TCCGGCACGC TCGGACTGTT CGTCAATAAG GATGCGCTCG GCGGCAAGCC GGTGCCGAAG TGCTGGAAGG ACCTGCTGAA GCCCGACTAC AAGGGCATGG TCGGCTATCT CGATCCGTCC TCGGCGGCGG TCGGCTATGT CGGCTCGGTC GCGGTCAATC TCGCGCTCGG CGGCTCCGCG AGCGATTTTT CGCCCGCGAT CAATTTCTTC AAGGACCTGC AGAAGAACCA GCCGATCGTG CCGAAGCAAA CCTCCTACGC CCGCGTGGTG TCGGGTGAGA TCCCGATCCT GTTCGACTAC GACTTCAACG CCTATCGGGC GAAATATTCC GAGGCCGGCC ATTTCGAATT CGTGATTCCC TGCGAAGGCT CGGTGGTGTT TCCCTATGTC GTCGGTCTGG TGAAGAACGC GCCGGACAAG GACAAGGCCA AGAAGGTGAT GGACTATCTG TTGTCCGACA AGGGCCAGGC GATCTGGACC AACGCCTATC TGCGCCCGGC GCGCAAGATC GCTCTGCCGG AAGCGGTGAA GGCCAAGTTC CTGCCGGACA GCGATTACGA TCGCGCCAAG AGCGTCGATT GGGGCGAGAT GGAAACCGCG CAAAAGGGCT TCGTCGAACG CTATCTCGCC GACGTCCACT GA
|
Protein sequence | MKLSRWVFAL VAVPLLLQPA KAADVICYNC PPEWADWASM LKAIKTDLGY DIPHDNKNSG QALAQILAEK ANPVGDIGYF GVTFGMKAKT QDALEPYKPA HWDQVDAGLK DPDGYWTTIH SGTLGLFVNK DALGGKPVPK CWKDLLKPDY KGMVGYLDPS SAAVGYVGSV AVNLALGGSA SDFSPAINFF KDLQKNQPIV PKQTSYARVV SGEIPILFDY DFNAYRAKYS EAGHFEFVIP CEGSVVFPYV VGLVKNAPDK DKAKKVMDYL LSDKGQAIWT NAYLRPARKI ALPEAVKAKF LPDSDYDRAK SVDWGEMETA QKGFVERYLA DVH
|
| |