Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0092 |
Symbol | |
ID | 3834285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 105873 |
End bp | 107150 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637824162 |
Product | extracellular solute-binding protein |
Protein accession | YP_425184 |
Protein GI | 83591432 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAGA CCACGGGTGG CGGGTTCGGT GTCTCGTGTT TGAGTGCTTT GGCGCTGGTC GCCGGGCTGG CGATCGCCGC GCCCAAACCG GCGGCGGCGG GCGGTTCGGT CGAGGTGTTG CATTGGTGGA CGGCGGGCGG CGAAGCCAAG GCGGTTTCGG CGCTGAAGGA TCAGTTCGAG GCCGAGGGCG GCACCTGGAT CGATTCGCCG GTGGCCGGCG GTGGCGGCGA CGCGGCGATG ACCGCTTTGC GCTCGCGGGT GATCGCCGGC AATCCGCCCT CGGCCGTGCA GCTCAAGGGG CCGTCGATCC AGGAATGGGC GGCCGAGGGC GTGGTCGCCA ATCTCGATGA CATCGCCAAG GCCGAAAACT GGGACAAGCT GCTGCCCGCC CTGCTGAAGT CGGTGGTGAC CTACGAGGGG CATTACGTCG CCGTTCCGGT CAATATCCAC CGTGTCGATT GGCTGTGGGC CAATCCGGCG GTTCTGGCCA AGGCCGGCGT CGCCGTGCCG ACCACCTGGG ACGAGTTCAA TACCGCCGCC GAGGCCCTGA AGGCCAAGGG GATCATTCCG CTGGCCCATG GCGGCCAGCC CTGGCAGGAC GCCACCTTGT TTGAAGTGGT GGTTCTGGGC CTGGGCGGGC CGGCCTTCTA CCACAAGGCC CTGGTCGAAT TGGACGACGC CGCCCTGCGC GGCGATACCA TGGTCAAGGT GTTCGACCAG ATGCGCCGCC TGCGCGGCTT CGTCGATCCC AACTTCTCGG GCCGCGACTG GAACCTTGCC ACCGCCATGG TGATCAACGG CGAGGCCGGC TTCCAGATCA TGGGCGACTG GGCCAAGGGC GAATTCCTGG GCGCGGGCAA GGTCCCGGGC AAGGATTTCC TGTGCATCGC CGCGCCGGGC AAGGGTTTCT TGCTCAATTC CGACAGTCTG GTGATGTTCG ACGTCAAGGG CGCCGATAAG ATCGAGGGTC AGAAGACCCT GGCCCGCTTG GTGCTGGGCG AAACGTTCCA ACGCACCTTC AACACCCTGA AGGGCTCGAT CCCCGCCCGT CAGGGCATGG ATCTGGCCGA TTTCGACGCT TGCGCCCAGA AATCCCAAGC CGACCTGACC AAAGCCATCG CCGCCGATAG CCTGGAGCCA AGCATGGCCC ATGAAATGGC CGTTCCACGC TCGGTGCGCG GGGCGATCAT GGATGTGGTC ACCGCGCATT TCAATTCAAG CGAATCCTCG GCCGAGGCCG TGGCCCATCT CGCCGACTCC ATCGCCCAGG CCCGTTAG
|
Protein sequence | MRKTTGGGFG VSCLSALALV AGLAIAAPKP AAAGGSVEVL HWWTAGGEAK AVSALKDQFE AEGGTWIDSP VAGGGGDAAM TALRSRVIAG NPPSAVQLKG PSIQEWAAEG VVANLDDIAK AENWDKLLPA LLKSVVTYEG HYVAVPVNIH RVDWLWANPA VLAKAGVAVP TTWDEFNTAA EALKAKGIIP LAHGGQPWQD ATLFEVVVLG LGGPAFYHKA LVELDDAALR GDTMVKVFDQ MRRLRGFVDP NFSGRDWNLA TAMVINGEAG FQIMGDWAKG EFLGAGKVPG KDFLCIAAPG KGFLLNSDSL VMFDVKGADK IEGQKTLARL VLGETFQRTF NTLKGSIPAR QGMDLADFDA CAQKSQADLT KAIAADSLEP SMAHEMAVPR SVRGAIMDVV TAHFNSSESS AEAVAHLADS IAQAR
|
| |