Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3587 |
Symbol | |
ID | 4898155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 679707 |
End bp | 680744 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640114196 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045450 |
Protein GI | 126464337 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0949603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA CACGCGCGAC CGGCCTATCC GTGCTTGCGG CCCTCCTGCT CGCCGGAGCG GGCACCGCCG CCTTCGCCCA GTCGGGCGAC ACGCTGGCCG CGGTGAAGGA GCGGGGCGAG GTCCTCTGCG GCGTCCATCC CGCGCGCCAC GGCTTCGCGG CCCCCGACAG CCAGGGCAAA TGGAGCGGCT TCGAGGTCGA TTTCTGCCAC GCCATCGCGG CGGCGGTGTT CGGCGATGCG AACAAGGTTC GGTTCGTGGC GCTCTCCTCG CAGCAGCGCT TCCCGGCGAT CCAGTCGGGC GAGGTCGATG TGCTGGCGCG CAACGTGACC GCCACGCTCA GCCGCGACAC GGCGCTCGGG CTGAACTTTG CGCCGCCGAT CTTCTACACC GGCACGGGCT TCCTCGTGCG CGCGGCCGAC GGGATCGAGA AGGTCGAGGA TCTGGACGGC GCGGCGATCT GCATGGCGCC CGGCTCCACC ACCGAGCGCA ACGTGGCGCA GATCTTCGCT GCCCGCGGGC TCAGCTACAC GCCCGTCGTG ATCGAGAACA ACAAGCAGCT GGTCGATGCC TATGTCACGG GGCGCTGCGA CGCGCTGACC AAGGACAAGG CGGCGCTTCC GGGCGTGCGG GCCTTCGACA CCGAGGTTCC CGGAGACCAT GTGCTGCTGC CCGGCATCTA TTCCAAGGAG CCGCTCGCCA TGGCCGTGCG TCAGGGCGAC GACAAATGGT ACGATCTGGT GAAATGGGTG ACCTACGCCA CCTTCAACGC CGAGGAACTG GGCGTGACCC AGGCCAATGT CGACGAGATG AAAGCCTCGG ACGATCCCGA CATCCAGACC CTTCTGGGCG TGATCGGCGA CAACGGCACG AAGCTCGGGG TGCCCAACGA CTGGGCCTAT GCGATCGTGA AGCAGGTCGG CAACTACGAA GACATCTACA TGCGCCATTT CGGGCCCGAC ACCCCCGTGG CGCTCGACCG CGACCAGAAC CAGCTCTGGA CCGAGGGCGG GCTGCTCTAC GGCTTCCCGA TGCACTGA
|
Protein sequence | MNQTRATGLS VLAALLLAGA GTAAFAQSGD TLAAVKERGE VLCGVHPARH GFAAPDSQGK WSGFEVDFCH AIAAAVFGDA NKVRFVALSS QQRFPAIQSG EVDVLARNVT ATLSRDTALG LNFAPPIFYT GTGFLVRAAD GIEKVEDLDG AAICMAPGST TERNVAQIFA ARGLSYTPVV IENNKQLVDA YVTGRCDALT KDKAALPGVR AFDTEVPGDH VLLPGIYSKE PLAMAVRQGD DKWYDLVKWV TYATFNAEEL GVTQANVDEM KASDDPDIQT LLGVIGDNGT KLGVPNDWAY AIVKQVGNYE DIYMRHFGPD TPVALDRDQN QLWTEGGLLY GFPMH
|
| |