Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1147 |
Symbol | |
ID | 5084579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1181348 |
End bp | 1182982 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640482705 |
Product | extracellular solute-binding protein |
Protein accession | YP_001167353 |
Protein GI | 146277194 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGGGC CCGCCGCTGC GGGTCCGCGA TTGCAGCGGG CGCCGGATCC CGGCCGCCCA TGGGAAGGAG CCGGCGCGTC GCCCCGGCCG CCTCCCTCGC CTGACGGAAC GGGAGAGACG GTCGCCAGTT CCCCGGCGGG CCGGGAACCA TCGGGCGCGA GGCGGCGTTG CGGGTCAGGA AGCGGGCATC TGCCGCATGC GCTTGTGCTT TCCAGCCACG GGAGGGGTGG GAGGGCCGGA GGAGCGCCGC ATCCGGGTTC GTTTCGGTGG GGGGAGGACG CAATGAAGGC GATGGTTCCC GCGATGGCGG CCACGCTCGC GCTGGCATCC GGCCCGCTCG CGGGGCAGGA GCTGACGTTT CCTCCCGGCG AGGACGACCG CTTTCACTGG AACAGTTTCG AGACCCTCAG GGAGCAGGAT TTCTCTGGGC AGCGGCTCAC GATCCTCGGG CCGTGGCTCG GCCCGGACCG GACGCTGTTC AATTCCGTCA TCGCCTATTT CGAGGCCGCG ACCGGAGCGG CCGTCACCTA CAACGGCTCG GACAATTTCG AGCAGCAGAT CGTGATCGAC GCGAGCGCGG GCTCGCCGCC GGACATCGCG ATCTTTCCCC AGCCCGGCCT TGCGCGGGAC CTGGCCTCGA AGGGGCAGCT GGCGCCCCTC GATCCGTCGC TCGGCGAGTG GCTGCGCGAG AATTACGCGG CCGGCGACAG CTGGGTGAAT CTTGGCACAT TTCCCGGCCG GGACGGGCAG GAGGCTCTCT ACGGGTTCTT CTACAAGATC GATGTGAAGT CGCTCGTCTG GTATGTGCCC GAGAATTTTG CCGACTTCGG CTATGAGGTT CCGGGCACGA TGGAGGAGCT TCTGGCGCTG TCCGAGCGGA TGGTGGAGGA TGGGGTGACG CCCTGGTGCA TCGGCCTCGC CTCGGGCGGG GCCACCGGCT GGCCCGCGAC CGACTGGGTC GAGGACATGA TGCTGCGCAT CAACCCGCCC GAAGTCTATG ACCAGTGGAC CCTGAACGAG ATCCCGTTCG ACGATCCGCA GGTGGTGGCG GCGATCGAGG AGTTCGGCCG GTTCGCGCGC GACGGACGCT TCGTGGCCGG TGGGCCCAAT GCGGTGGCCG CGACCGATTT CCGCGACAGC CCCAAGGGTC TCTTCGCCGC CCCGACGCAA TGCTTCATGC ACAAGCAGGC AAGCTTCATC CCCTCCTTCT TTCCCGAGGG GACGGTGATC GGCGAGGATG CCGACTTCTT CTACCTTCCC GCCTACGAGA GCCGCGACCT GGGCCAGCCG GTGCTGGGTG CGGGAACGGT CTTCGGCATC ACCCGCGACA CGCCGGTGGC GCGCGCCTTC ATCGACTTTC TCAAGACGCC GATCGCGCAC GAGGTCTGGA TGGCCCAGAC CGGCTTTCTC ACGCCGCACA CGGGCGTGAA CACCGATGTC TATGGCGATC CCACGCTGCG CAAGATGGGC GACATCCTGC TCGAGGCCAC GACCTTCCGC TTCGACGGAT CCGACCTGAT GCCGGGCGCG GTGGGCGCAG GCGCCTTCTG GACCGGAATG ATCGACTACA TGGGCGGACA GCCGGCCGAG ACCGTGGCGG CCGGCATCCA GCGCACCTGG GACACGTTCA AGTGA
|
Protein sequence | MEGPAAAGPR LQRAPDPGRP WEGAGASPRP PPSPDGTGET VASSPAGREP SGARRRCGSG SGHLPHALVL SSHGRGGRAG GAPHPGSFRW GEDAMKAMVP AMAATLALAS GPLAGQELTF PPGEDDRFHW NSFETLREQD FSGQRLTILG PWLGPDRTLF NSVIAYFEAA TGAAVTYNGS DNFEQQIVID ASAGSPPDIA IFPQPGLARD LASKGQLAPL DPSLGEWLRE NYAAGDSWVN LGTFPGRDGQ EALYGFFYKI DVKSLVWYVP ENFADFGYEV PGTMEELLAL SERMVEDGVT PWCIGLASGG ATGWPATDWV EDMMLRINPP EVYDQWTLNE IPFDDPQVVA AIEEFGRFAR DGRFVAGGPN AVAATDFRDS PKGLFAAPTQ CFMHKQASFI PSFFPEGTVI GEDADFFYLP AYESRDLGQP VLGAGTVFGI TRDTPVARAF IDFLKTPIAH EVWMAQTGFL TPHTGVNTDV YGDPTLRKMG DILLEATTFR FDGSDLMPGA VGAGAFWTGM IDYMGGQPAE TVAAGIQRTW DTFK
|
| |