Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3409 |
Symbol | |
ID | 4898961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 476359 |
End bp | 477573 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640114006 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045274 |
Protein GI | 126464161 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.280364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.614723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA CCATCAGCGC GCTGGCCCTT CTGGCCGGCC TCGCGCCGGG GATCGCCTCG GCCGACAGCA CCGTCCGCTT CTGGTATCAC TTCGACAATC CGGAAAACCC GATGTCGGAT CTGGTGGCCA AGTTCGAGGA AGCGAACCCG GGCATCAGGA TCGACGCGCA GAACGTGCCC TGGAACAGCT ATTACGACAA TCTCTACACG GCCATCGTTG GCGGCAACGC GCCCGACGCG GCCATGCTGA AGATGTTCGC CCTGCCGCGC CTTCTCGAGA TGGAGGCGCT CGAGCCGCTC GACGAGATGA TCGCGGGCTG GGAGGGCCGC GACGACATCC TCGACAATCT CTTCGACCTG ACCGAGGCCG AGGATGGCAA GCACTATTAC CTGCCGGTGC AATATGTGGC GCTCTACCTC TATTACCGCG CCGACATGTT CGAGGAGCTG GGCCTCGCCC CGCCCGAGAC CTGCGACCAG TTCCGCGAAG CCGCGATCAA GCTCACCCGC GACACGAACA ACGACGGCAA GATCGACACC TACGGGTTCG GCTTCCGCGG CGGCAAGTCC GGGCACGAAC ATTGGGGCGC CTTCACCCTC GGCCGCGAGG GCGTGGCGCT CGATGACAGC CTCACCTCCG AGGCCGGCGT GGCGGGCACG CAGTTCGTGG TGGATCTCTT CCAGAAGGAC AAGGTCTTCC CGCCCTCGGC CCCGAACGAC GGCTTTCAGG AGATCATCGG CGCCTTCAAG ACCGGCGTGA CCGCGATGAC GATCCATCAT GTCGGCTCCT CGAACGATCT GGTGGCGGCG CTGGGCGACA AGGTCGCGGC CGTGCCGGTG CCGGAATGCG GCGGCGGGCG CTGGACCACC TTCGGCGACG AATCCACCGG CGTCTTCAGC AATGCCAGCG ACAAGGAAGC CGCCTGGAAG TGGATCGCGT TCCTTTCGTC GGAGGGCAAC AACGCGCTCT TCAACAGCGC CACCGGCCAG CTTCCGGTGA CCAAGAGCGA CAGCGCCACC TGGGACCAGC ACGAGAAGCG CTTCGTCGAC GCGACCCAGG CCTCGCTCCC CTTCGCCCAT CTGCTGCCCG CCTCCTCGGC CACGCCCGAG TTCGTGAACA CCGTCTGGCC CACGAACATG CAGCGCGCGC TGAACGGGGA GATCACCGCC GCCCAGATGA ACGAAGCCAT CGCCAAGCTC TTCGCCGAAG AGTGA
|
Protein sequence | MKRTISALAL LAGLAPGIAS ADSTVRFWYH FDNPENPMSD LVAKFEEANP GIRIDAQNVP WNSYYDNLYT AIVGGNAPDA AMLKMFALPR LLEMEALEPL DEMIAGWEGR DDILDNLFDL TEAEDGKHYY LPVQYVALYL YYRADMFEEL GLAPPETCDQ FREAAIKLTR DTNNDGKIDT YGFGFRGGKS GHEHWGAFTL GREGVALDDS LTSEAGVAGT QFVVDLFQKD KVFPPSAPND GFQEIIGAFK TGVTAMTIHH VGSSNDLVAA LGDKVAAVPV PECGGGRWTT FGDESTGVFS NASDKEAAWK WIAFLSSEGN NALFNSATGQ LPVTKSDSAT WDQHEKRFVD ATQASLPFAH LLPASSATPE FVNTVWPTNM QRALNGEITA AQMNEAIAKL FAEE
|
| |