Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3982 |
Symbol | |
ID | 4899129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 1124354 |
End bp | 1125865 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640114585 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045832 |
Protein GI | 126464719 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGAA AATGGGCCCG CGTCGGCCTC ATCGCGCTGA GCGTGACCCT CGGCCTCGGC AGCATTGCCG AAGCCGCAGG CGTGCTGACC ATCGGCCGGC GCGAGGACGG CACCACCTTC GACCCGATCG CCACGGCGCA GAACGTGGAT TTCTGGGTCT TCTCGAACGT CTATGACGTG CTCGTGCGCG TGGACAAGAC CGGGACCAAG CTCGAGCCCG GTCTGGCCGA AAGCTGGGAG ATCTCGGAGG ACGGGCTCAC CTACACCTTC CATCTGCGCG ACGCGAACTT CTCCGACGGC TCGCCCATCA CCGCCGAGGA CGCGGCCTTC ACGCTGCTGC GCATCCGCGA CAGCGAACTG TCGCTCTGGT CGGACAGCTA TGCGGTGATC GAGACCGCCG AGGCGACCGA TCCGAAGACT CTCGTCGTCA AGCTGAAGAC CCCCTCCGCG CCCTTCCTGT CCACCATGGC CATGCCCGCC GTCTCGATCC TGTCCAAGGC AGGCGTCGAA GCCATGGGCG AGGAAGCCTA CGCCGAGAAG CCCGTGGCCT CCGGCGCCTT CACCGTCGAG GAATGGCGCC GCGGCGACCG GGTGATCCTG AAGAAGAACC CGGAATTCTG GCAAGCCGAC CGGGTGAGCC TCGACGGGGT CGAATGGATC TCGATCCCCG ACGACAACAG CCGGATGCTG AGCGTGCAGG CGGGCGAACT CGACGCCGCG ATCTTCGTGC CCTTCTCGCG GGTGGCGGAA CTGAAGAAGG ATACCAACCT CAAGGTCATG GTCGAGCCCT CGACGCGCGA GGACCATCTG CTCCTCAACC ACGAGCACGA GCCACTGAAC GATCCCAAGG TGCGCGAGGC GATCGACCTC GCCATCGACA AGCAGGCGAT CGTCGACACG GTGACCTTCG GGCAGGCGCA GATTGCCAAT TCCTACATCC CGGCGGGCGC GCTCTATCAC AACGACAACA ACCTGCTGCG GCCCCACGAC CCCGAGAAGG CCAAGGCGCT TCTGGCCGAG GCGGGCGTGT CCGACGTGAG CCTCGATTAT GTGGTGAACG CGGGCAACGA GGTGGACGAG CAGATCGCGG TCCTTCTCCA GCAGCAGCTG GGTCAGGCCG GGGTCACGGT CAATCTGCAG AAGATGGACC CGAGCATGAC CTGGGACATG CTGGTGAATG GCGAATACGA CCTGTCGGTC ATGTATTGGA CGAACGACAT CCTCGATCCC GACCAGAAGA CCACCTTCGT TCTGGGTCAC GACGTCAACA TGAACTACAT GACGCGCTAC GAGAACGAGA CGGTGAAGCA GCTCGTGGCC GATGCCCGGC TCGAGATGGA CCCGGCCAAG CGCGAGGCGA TGTATACCGA GATCCAGGAA CTGTCGAAGG CCGATACCCA CTGGATCGAC CTCTATTACA GCCCCTTCAT CAACGTGACG CGCGCCAATA TCGAGAACTT CAACCAGAAC CCGCTGGGCC GCTTCTTCCT CGAGGATACG GTGAAGAACT GA
|
Protein sequence | MTGKWARVGL IALSVTLGLG SIAEAAGVLT IGRREDGTTF DPIATAQNVD FWVFSNVYDV LVRVDKTGTK LEPGLAESWE ISEDGLTYTF HLRDANFSDG SPITAEDAAF TLLRIRDSEL SLWSDSYAVI ETAEATDPKT LVVKLKTPSA PFLSTMAMPA VSILSKAGVE AMGEEAYAEK PVASGAFTVE EWRRGDRVIL KKNPEFWQAD RVSLDGVEWI SIPDDNSRML SVQAGELDAA IFVPFSRVAE LKKDTNLKVM VEPSTREDHL LLNHEHEPLN DPKVREAIDL AIDKQAIVDT VTFGQAQIAN SYIPAGALYH NDNNLLRPHD PEKAKALLAE AGVSDVSLDY VVNAGNEVDE QIAVLLQQQL GQAGVTVNLQ KMDPSMTWDM LVNGEYDLSV MYWTNDILDP DQKTTFVLGH DVNMNYMTRY ENETVKQLVA DARLEMDPAK REAMYTEIQE LSKADTHWID LYYSPFINVT RANIENFNQN PLGRFFLEDT VKN
|
| |