Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3988 |
Symbol | |
ID | 4898698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 1132302 |
End bp | 1133924 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640114591 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045838 |
Protein GI | 126464725 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC TGCTCGCCTC CACCGCCGTG ATGCTGGCGC TCGCCCTGCC CGCCGCAGCC CAGGACTATA CGCCCGACCC GAACGCCAGG CCCGGCGGCA CGATCACCAT CACCTACAAG GACGATGTGG CGACGCTCGA TCCGGCCATC GGCTACGACT GGCAGAACTG GTCGATGATC AAATCGATCT TCGACGGGCT GATGGATTAC GTCCCCGGCA CGACCGAGCT GCGCCCGGGT CTCGCCGAGA GCTACGAGAT CTCGGAGGAC GGGCTCACCT ATACGTTCAA GCTGCGCCCG GGCGTGACAT TCCACAACGG CCGCGAGATG ACGGCCGAGG ATGTGAAATA TTCGCTCGAC CGCGTGACGC TGCCCGCGAC CCAGTCGCCG GGAGCGGGCT TCTTCGGCTC GATCAAGGGC TTCGATGCGA TGGCCGACGG CTCGGCCACC ACGCTCGAGG GCGTGACGGT GGTCGATCCC TCGACCGTGA AGATCGAGCT CTCGCGTCCC GACGCCACCT TCCTGCATGT GATGGCGCTG AACTTCGCCT CGGTGGTGCC GAAGGAGGCC GTCGAGGCGG CGGGCGCCGA CTTCGGCAAG CAGCCGGTCG GCACCGGGGC CTTCAAGCTC GCCGAATGGA CCCTCGGCCA ACGGCTCGTC TTCGAGAAGA ACGCCGATTA CTGGCGCGAG GGCGTGCCCT ATCTCGACAG CATCGTCTTC GAAGTGGGAC AGGAGCCGAT TGTGGCGCTG CTGCGGCTGC AGAACGGCGA GGTGGACGTG CCCGGCGACG GCATTCCGCC TGCGAAATTC ACCGAAGTGA TGGCCGATCC GGCGCAGGCC GAGCGCGTGG TCGAGGGCGG CCAGCTGCAC ACGGGCTACA TCACGATGAA CGTGACCCAG CCGCCCTTCG ACAATCTGCA GGTCCGTCAG GCCGTCAACA TGGCGATCAA CAAGCAGCGG ATCACCCAGA TCATCAACGG CCGCGCGATC CCCGCGACCC AGCCGCTGCC GCCCTCGATG CCGGGCTATA CCGAAGGCTA CGAGGGCTAT CCGCACGATG TCGAGAAGGC CAAGGCGCTG CTCTCCGAGG CGGGCTTCGC CGACGGGTTC GAGACCGAGC TCTATGTGAT GAACACCGAC CCGAACCCGC GCATCGCGCA GGCGATCCAG CAGGATCTGT CGCAGATCGG CATCAAGGCC GCGATCCAGA GCCTCGCGCA GGCCAATGTG ATCGAGGCCG GCGGCAATGG CTCGGCGCCG ATGATCTGGT CGGGCGGCAT GGCCTGGATC GCGGATTTCC CCGATCCGTC CAACTTCTAC GGCCCGATCC TCGGCTGCGC GGGCGCGGCT GACGGCGGCT GGAACTGGTC GAAATTCTGC GACGAGGCGC TCGACGCCAA GGCCACCGAG GCCGACAGCC TCGCCGATCC GGCCCGTGCC GAGGAGCGGC TGAAGCTCTG GTCCGACGTC TATATGGGCG TGATGGAGAA GGCGCCGTGG GTGCCCGTCT TCAACGAACA GCGCTACACG ATGAAATCCG CGCGCATGGG CGGCGACGAC AGCCTCTATG TCGATCCCGT CTCGATCCCC GTGAACTACG ACTATGTCTT CGTGACCGAG TAA
|
Protein sequence | MKRLLASTAV MLALALPAAA QDYTPDPNAR PGGTITITYK DDVATLDPAI GYDWQNWSMI KSIFDGLMDY VPGTTELRPG LAESYEISED GLTYTFKLRP GVTFHNGREM TAEDVKYSLD RVTLPATQSP GAGFFGSIKG FDAMADGSAT TLEGVTVVDP STVKIELSRP DATFLHVMAL NFASVVPKEA VEAAGADFGK QPVGTGAFKL AEWTLGQRLV FEKNADYWRE GVPYLDSIVF EVGQEPIVAL LRLQNGEVDV PGDGIPPAKF TEVMADPAQA ERVVEGGQLH TGYITMNVTQ PPFDNLQVRQ AVNMAINKQR ITQIINGRAI PATQPLPPSM PGYTEGYEGY PHDVEKAKAL LSEAGFADGF ETELYVMNTD PNPRIAQAIQ QDLSQIGIKA AIQSLAQANV IEAGGNGSAP MIWSGGMAWI ADFPDPSNFY GPILGCAGAA DGGWNWSKFC DEALDAKATE ADSLADPARA EERLKLWSDV YMGVMEKAPW VPVFNEQRYT MKSARMGGDD SLYVDPVSIP VNYDYVFVTE
|
| |