Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1437 |
Symbol | |
ID | 4896444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1490987 |
End bp | 1492798 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640112025 |
Product | extracellular solute-binding protein |
Protein accession | YP_001043319 |
Protein GI | 126462205 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0112279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATACAGG AATTCCGGTC CCGCCTGAGG GGCGCCGCGA TGGCTCTCGG GCTTGCCGTT CTGGGCACGA CGCTGGCCGC CGAACCCCGG CACGGCATAG CTATGTATGG CGAACCGGCG CTTCCACCGG ATTTTGTGTC TCTGCCATAT GCAAATCCCG ACGCGCCGAA GGGCGGCCGG ATCGTGCTGG GCGAGACGGG CGGATTCGAT TCCCTCAACC CCTACATCGT GAAGGGTCGC GCCCCCTATT CGCTCGCGCC GCTGACGGTC GAGACGCTGC TCGGCCGGTC GCTCGACGAG CCCTTCACGC TCTACGGGCT TCTGGCCGAA TCGGTCGAGA CGGACCCCGC CCGGACCTGG GTCGAATTCA CGCTGCGCGA GGGCGCGCGG TTCTCGGACG GCACGCCCGT GACGGTCGGG GATGTGCTCT GGTCGTTCGA GATTCTCGGC ACGAAAGGGA TGCCGCGCTA CTGGGGCGCG TGGCAGAAGA TCGCGAGCGC CGAGCAGACG GGCCCGCGCT CGGTCCGCTT CACCTTCTCC GAGCAGGACC GCGAACTGCC GCTGGTGCTG GGTCTGCGCC CGATCCTGAA GAAGGCGCAG TGGGAGGGGC GCGACTTCGG CTCCTCGGGC TTCGAGGCGC CGATCGGCTC GGGCCCTTAC ATCCTCGAGA GCTTCGAGCC GGGCCGGGTG CTGCGCTACC GCCGGAACCC CGACTGGTGG GGGCGCGACC TGCCCTTCAA CCGCGGCCTG CACAATCTCG ACGAGGTGGT GGTCGAGTAT TTCGGCGATG CGAGCGTGGC CTTCGAGGCG TTCAAGGCCG GCGCCCTCTC GGTCTATCGC GAGACGAGCG CCGCCCGCTG GGCCACTCAC TACGGCTTCC CGGCCGTGCA GAGCGGCGCG ATGGTCAAGT CCGAGATCCC GCATGGCCGC CCGTCCGGAA TGGAGGGGCT GGTGATGAAC ACGCGCCGCG CCCCTTTCGC GGACTGGCGC GTGCGCGAGG CGATGCTCTT GGCCTTCGAC TTCGACCTCA TCAACCGCAC GCTGACCGGC GGCGCCGAGC CGCGGATCGC CTCCTACTTC TCGAACTCCG CGCTCGGGAT GGAGGCGGGG GCGCCTGCCA CGGGGCGCGA GCGGGCGCTG CTCGAGCCCT TTGCGGCAGA TCTGCTGCCC GGCACGCTCG ACGGCTATGC CCTGCCCGCC ACGCACGGCG CCTCGAACCG CGGCAACCTC CGCAAGGCCG CCCGGCTTCT CTCGGAGGCC GGCTGGCGGA TCGAGGACGG GATGCTGGAA GGCCCCGGCG GCGAGCCCTT CGCCTTCGAG ATCCTGCTGC CGCAGGGCGC GGATGCGATG ATCGCGGCCG CGATCATCTA CCGGCAGGCC CTGACGCGGC TCGGGATCTC GGCCCGCATC ACGACCGTCG ATCCGGCGCA GTTCAAGCAG CGGGTCGACA ATCAGGATTT CGACATGACG AGCTTCCTGC GCTCGCTCAC CCTCTCGCCC GGCAACGAGC AGCTCCTCTA CTGGTCGGCC GAGGACAAGG ATCTGCCCGG CTCGCGCAAC CTGATGGGGA TGGAGAGCCC CGCCGCCGAG GCCGTGATCC GGCACATGCT GGCCACCGAC GATGCCGAGG AGTTTCAAGC CTCCGTGAGG GCGCTCGACC GGGTCCTGAC CGCCGGTAGA TATGTCATTC CGATGTGGTA TTCTCGGGTC TCCCGGCTGG CGCACGACAG GCACCTGCGC TATCCGGCAA AAACACCCAT CTATGGCGAC TGGCCGGGCT TCCTGCCGGA CGTCTGGTGG CAAGAAAAGT GA
|
Protein sequence | MIQEFRSRLR GAAMALGLAV LGTTLAAEPR HGIAMYGEPA LPPDFVSLPY ANPDAPKGGR IVLGETGGFD SLNPYIVKGR APYSLAPLTV ETLLGRSLDE PFTLYGLLAE SVETDPARTW VEFTLREGAR FSDGTPVTVG DVLWSFEILG TKGMPRYWGA WQKIASAEQT GPRSVRFTFS EQDRELPLVL GLRPILKKAQ WEGRDFGSSG FEAPIGSGPY ILESFEPGRV LRYRRNPDWW GRDLPFNRGL HNLDEVVVEY FGDASVAFEA FKAGALSVYR ETSAARWATH YGFPAVQSGA MVKSEIPHGR PSGMEGLVMN TRRAPFADWR VREAMLLAFD FDLINRTLTG GAEPRIASYF SNSALGMEAG APATGRERAL LEPFAADLLP GTLDGYALPA THGASNRGNL RKAARLLSEA GWRIEDGMLE GPGGEPFAFE ILLPQGADAM IAAAIIYRQA LTRLGISARI TTVDPAQFKQ RVDNQDFDMT SFLRSLTLSP GNEQLLYWSA EDKDLPGSRN LMGMESPAAE AVIRHMLATD DAEEFQASVR ALDRVLTAGR YVIPMWYSRV SRLAHDRHLR YPAKTPIYGD WPGFLPDVWW QEK
|
| |