Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4022 |
Symbol | |
ID | 4898728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 1167130 |
End bp | 1168425 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640114625 |
Product | extracellular solute-binding protein |
Protein accession | YP_001045872 |
Protein GI | 126464759 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTC GCACATTCCT CACCGCCACG GCGCTTGCGC TGATCGTTCA GGCGGGTGCC GCCGCGGCCC AGACCGAGAT CAGCTGGTGG CACGCCATGA CGGGCGCCAA CGCGGAAGTG GTCGAGAAGA TCGCCGCGGA TTTCAACGCG AGCCAGTCGG ACTACAAGGT GACAGCGGTC TTCAAGGGCA CCTACCCCGA GACGCTGAAC GCCGGCATCG CAGCCTTCCG CGCCGGTCAG GCCCCCGATA TCATCCAGGT CTTCGACGTG GGCACCGGCG TCATGATGGC GGCCGAGGGC GCGATCAAGC CGGTGGCCGA GGTGCTGGGC GACAGTTTCG ACAAGTCGGC CTACCTGCCG GGGATCGTGG CCTATTATTC CAAGCCCGAC GGCACGATGC TGTCCTTCCC CTACAACTCG TCCTCGCCGA TCCTCTATTA CAACAAGGAC ATCTTCGAGA AGGCGGGCCT CGATGCAGAC ACCCCGCCCA AGACCTGGAC CGAGGTCTGG GACATGGCGA AGAAGATCAA GGAGAGCGGC GCCGCCCCCT GCGGCTACAC CTCGACCTGG CTCACCTGGA TTCATACCGA GAATTTCGCG GCCTGGAACG ACGTGCCCTT CGCCACGAAC GAGAACGGGC TTGCCGATGT GAATGCCGAG CTGAAGATCA ACGAGCCGAT CTTCGTCAAC CACTTCCAGG CGCTGGCCGA TCTCGCCAAG GACGGCACGT TCAAATACGG CGGCCGCACG TCCGAGGCCA AGCAGATCTT CCTTGCGGGC GAATGCGGGA TCTTCACCGA AAGCTCGGGC GGGCTCGGCG ACATCGTGAA ATCGGGCATG AACTACGGCA TCGGCCAGCT GCCCTATGAC GAGGCGGGCA ACGGGCCGCA GAACACGGTG CCGGGCGGCG CGAGCCTCTG GGTGATGGGC GGCAAGTCGG ACGAGACCTA TGAGGGCGTC GCCGCCTTCT TCAACTATCT CTCGCAGACC GACGTGCAGG AATATCTGCA CCAGACGTCG GGCTATCTGC CGGTGACGAT GGAGGCCTAC GAGGCGACCA AGGCCTCGGG CTTCTACGAG AAGAACCCGG GCCGCGAGGT GCCGATCACC CAGATGATGG GCAAGGAGCC GACCGCCAAC TCCAAGGGCG TGCGCCTCGT GAACCTGCCG CAGGTGCGCG ACATCGAGAA CGAGGAGTTC GAGAAGATGC TCGCCGGAGA GCAGACCGCA CAGGAAGCGC TCGACGCGGC CGTCTCGCGC GGCAACGAGG CGATTCGCCA GGCCATCGGC GGCTGA
|
Protein sequence | MKRRTFLTAT ALALIVQAGA AAAQTEISWW HAMTGANAEV VEKIAADFNA SQSDYKVTAV FKGTYPETLN AGIAAFRAGQ APDIIQVFDV GTGVMMAAEG AIKPVAEVLG DSFDKSAYLP GIVAYYSKPD GTMLSFPYNS SSPILYYNKD IFEKAGLDAD TPPKTWTEVW DMAKKIKESG AAPCGYTSTW LTWIHTENFA AWNDVPFATN ENGLADVNAE LKINEPIFVN HFQALADLAK DGTFKYGGRT SEAKQIFLAG ECGIFTESSG GLGDIVKSGM NYGIGQLPYD EAGNGPQNTV PGGASLWVMG GKSDETYEGV AAFFNYLSQT DVQEYLHQTS GYLPVTMEAY EATKASGFYE KNPGREVPIT QMMGKEPTAN SKGVRLVNLP QVRDIENEEF EKMLAGEQTA QEALDAAVSR GNEAIRQAIG G
|
| |