Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0526 |
Symbol | |
ID | 5082794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 526397 |
End bp | 528328 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640482081 |
Product | extracellular solute-binding protein |
Protein accession | YP_001166737 |
Protein GI | 146276578 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0241638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.268494 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGGAG TCGCGGCGCG CACGGCGCAG GGCAGGGTCG CAACCTCGAG ACTTCCCGAC GTGCAGTCAT GGCTTCTGGG GGGGCTTGGC CTTCTGCTCG TCACGGCCGC GGCGCTTCCG GTCCGCGCGC AGGAGGCTGA GACGATCATC CGCTCGCATG GCATCTCGAC CTTCGGCGAA CTGAAATACC CGGCCGATTT CACGCATCTC GCCTATGTGA ACCCCGACGC GCCCAAGGGG GGCGAGATCT CGGAATGGAC CTTCGGCGGG TTCGATTCGA TGAACCCCTA CTCGGTGAAG GGCCGGGCCG CCGCGCTCTC GTCGATCATG TACGAATCGA TCCTCACCGG CACTGCCGAC GAGATCGGCG CCTCCTATTG CCTGCTTTGC GAGACGCTGG AATATCCCGA GGATCGCAGC TGGGTGATCT TCAACCTGCG CCCCGAGGCG AAGTTCTCGG ACGGAACGCC GGTGACGGCC GAGGATGTGG TCTTTTCCTA CGAGACCTTC GTCACCAAGG GGCTGACCGA TTTCCGCACC GTCTTTGCCC AGCAGGTCGA AAGCGCCGAG GCGCTCGACA GTCACCGGGT CAGGTTCACC TTCAAGCCGG GCATCCCCAC CCGCGACCTG CCGCAGGACG TGGGCGGCCT GCCGGTCCTG TCCAAGGCCC AGTACGAGCG CGAGGGGCTG GATCTGGAGG AGGGCAGCCT CAAGCCCTTC CTCGGCTCGG GGCCCTATGT GCTCGACCGG ATGAACGTGG GCCAGACGGT CGTCTATCGC CGCAATCCCG ACTACTGGGG CAATGATCTG CCGATCAACC GCGGGCGGGG CAATTTCGAC ACGATCCGCA TTGAATATTA CGCCGATTAC AACGCGGCCT TCGAGGGCTT CAAGGGCGGC AGCTACACCT TCCGCAACGA GGCCTCCTCG ATCCTCTGGG CCACGGGCTA CGACTTTCCT TCGGTCGACG CAGGCCATGT GACGAAGGTC GAACTGCCCT CGGGCGCCAA GGCCACCGGG CAGGGCTGGA TGATGAACCT GCGCCGCGAG AAGTTCCAGG ACCCCCGCGT GCGCGAGGCG CTGGGCCTGA TGTTCAACTT CGAATGGTCC AACGCGACGC TGTTCTACGG CCTCTATACC CGTGTCGATT CCTTCTGGGA AAACAGCTAC CTAGAGGCCG AGGGTTCGCC GTCCGAGGCC GAGGTGGCGC TGATGAAGCC GCTGGTGGAC GAGGGGCTGC TGCCCGAGTC GATCCTGACG GACCCTCCCG TCAGCCCCGC CGGATCGGGC GACCGGCAAC TCGACCGTGG CAACCTGCGC GCGGCGAGCC GGCTTCTGGA CGAGGCCGGC TGGGCGGTGG GCGCCGACGG CCTGCGCCGC AACGCCAGCG GCGAGGTGCT GCGGATCGAG TTCCTGAACG ACAGCCAGAC CTTCGACCGC GTCATCAACC CGTTCATCGA GAACCTGCGG GCGCTCGGGA TCGACGCGCT GATGACGCGG GTGGACAACG CCCAGATGGA AAGCCGCACC CGGCCTCCGA GCTACGATTT CGACATCACC ACCGGGAACG CGCGCACGAA CTACATCTCC GGCTCGGAAC TCAAGCAGTA TTACGGGTCG GAAACCGCCG ATGTCTCCAG CTTCAACATG ATGGGGCTCA AGAGCCCGGC GGTGGACCGG ATGATCGAGA TGGTGCTGGC GGCCCATACC TCGGACGAGC TTGAGGTGGC GACCAAGGCG CTGGATCGTG TGCTGCGGCT TCAGCGGTTC TGGGTGCCGC AATGGTACAA GGCCAGCCAC ACCGTCGCCT ATTACGACAT GTACGAGCAT CCCGAGGAAC TTCCGCCCTA TGCGCTGGGA GAGCTGGACT TCTGGTGGTT CAACCCCGAC AAGGCCGAGG CCTTGCGCGC GGCGGGCGCG CTGAGACGCT AA
|
Protein sequence | MGGVAARTAQ GRVATSRLPD VQSWLLGGLG LLLVTAAALP VRAQEAETII RSHGISTFGE LKYPADFTHL AYVNPDAPKG GEISEWTFGG FDSMNPYSVK GRAAALSSIM YESILTGTAD EIGASYCLLC ETLEYPEDRS WVIFNLRPEA KFSDGTPVTA EDVVFSYETF VTKGLTDFRT VFAQQVESAE ALDSHRVRFT FKPGIPTRDL PQDVGGLPVL SKAQYEREGL DLEEGSLKPF LGSGPYVLDR MNVGQTVVYR RNPDYWGNDL PINRGRGNFD TIRIEYYADY NAAFEGFKGG SYTFRNEASS ILWATGYDFP SVDAGHVTKV ELPSGAKATG QGWMMNLRRE KFQDPRVREA LGLMFNFEWS NATLFYGLYT RVDSFWENSY LEAEGSPSEA EVALMKPLVD EGLLPESILT DPPVSPAGSG DRQLDRGNLR AASRLLDEAG WAVGADGLRR NASGEVLRIE FLNDSQTFDR VINPFIENLR ALGIDALMTR VDNAQMESRT RPPSYDFDIT TGNARTNYIS GSELKQYYGS ETADVSSFNM MGLKSPAVDR MIEMVLAAHT SDELEVATKA LDRVLRLQRF WVPQWYKASH TVAYYDMYEH PEELPPYALG ELDFWWFNPD KAEALRAAGA LRR
|
| |