Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2356 |
Symbol | |
ID | 3835790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2742295 |
End bp | 2743884 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637826464 |
Product | extracellular solute-binding protein |
Protein accession | YP_427443 |
Protein GI | 83593691 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.458822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAAA TAGTGATTGG CGCGGCTTCG GCCGTTATCC TCGCCATGGC GGCGAGCGGG GCCCAGGCCA AGACGCTGGT CTATTGCTCG GAAGGCAGCC CCGAGGGCTT CAATCCGGCT TTTTACACCA CCGGCACGAC CTTCGACGCC ACCAGCAAGA ACATTTTCGA CAAGCTCGTT CTTTTCAAGC GCGGCACCAC GGAGATCGAA CCCGGTCTGG CCGAGAGCTG GGAGGTTTCG CCCGACGGCA AGACCTATAC CTTCCACCTG CGCAAGGGCG TGACCTTCCA CGACAGCGAC ATCTTCAAGC CGACGCGGCA ATTCAACGCC GATGACGTGA TCTGGAGCTT CGAGCGTCAG TTGAAGAAGG ATCACCCCTA TCACGCGGTT TCCGGCGGCA CCTACGACTA CTTCGAAGGC ATGTCGATGA ACACCCTTCT CGAAAAGATC GAGAAGGTCG ACGATTATAC GGTGGTCTTC CACCTGAGCC GCCCCGAAGC GCCGATGCTG GCCAATCTGG CCATGGACTT CGCCTCGATC TTCTCGGCCG AATACGCCGA TAAGATGATG AAGGCCGGAA CCCCGGAAGT CGTTGACCAG AAGCCGATCG GCACCGGTCC CTTCATGTTC CGCGGTTACC AGAAGGACGC CCAGATCCGC TACGAGGCCA ATCCGACCTA TTGGCAGGGC AAGGCCGCCA TCGACCGCCT GGTTTTCGTC ATCACCCCCG ACGCCAGCGT GCGCTACGCC AAGCTGAAGG CCGGCGAATG CCATGTGATG CCCTATCCCA ATCCGGCCGA CCTGGAAGCC ATGAAGACCG ACAAGGCGGT CAACCTGATG CACCAGGAAG GCCTGAACGT CGGCTATCTG GCCTATAACG TCGAGAAGAA GCCCTTCGAC GACGTGCGCG TGCGCAAGGC CCTCAATCTG GCGATCGACA AGAAGGCGAT CATCGACGCC GTTTATCAGG GCGCCGGCAC CGCCGCCACC AACCCGATCC CGCCGACGAT CTGGTCCTAC AACAAGGCCG TCAAGGACGA CGCCTTCGAT CCGGCCGCCG CCAAGAAGCT GCTGGCCGAA GCCGGGGTGA AGGATCTCAA GACCACCATC TGGGCAATGC CCGTCCAGCG CCCCTACAAC CCCAATGCCC GCCGCATGGC CGAAATCCTT CAGGCCAACT GGAAGGCCGT GGGCGTGGAT GCCGAAATCA CCTCCTACGA ATGGGGCGAA TACCTCAAGC GCGCCAAGGC CGGCGAGCAT GAGACGGCGC TGTTTGGCTG GACCGGCGAC AATGGCGATC CCGATAATTT CCTGGCGGTT CTGCTGGGCT GCGACGCCAT CCCCGGCAAC AACTATGCGC GCTGGTGCGA CAAGTCCTTT GAAAACCTGA TCCAGAAGGC CAAGATCGCC ACCAGCCAGG AAGAGCGGGT GAAGCTCTAC GAAGAGGCTC AGGTCATCTT CAAGGAGCAG GCCCCCTGGG CGACGATCGC GCATTCGGTG GTCTACGAGC CGATTCGCAA GGAAGTTATC GACTATAAGA TAGATCCGCT TGGCGGACAT ATCTTCTACG GCGTCGACCT CAAGAAATAG
|
Protein sequence | MRKIVIGAAS AVILAMAASG AQAKTLVYCS EGSPEGFNPA FYTTGTTFDA TSKNIFDKLV LFKRGTTEIE PGLAESWEVS PDGKTYTFHL RKGVTFHDSD IFKPTRQFNA DDVIWSFERQ LKKDHPYHAV SGGTYDYFEG MSMNTLLEKI EKVDDYTVVF HLSRPEAPML ANLAMDFASI FSAEYADKMM KAGTPEVVDQ KPIGTGPFMF RGYQKDAQIR YEANPTYWQG KAAIDRLVFV ITPDASVRYA KLKAGECHVM PYPNPADLEA MKTDKAVNLM HQEGLNVGYL AYNVEKKPFD DVRVRKALNL AIDKKAIIDA VYQGAGTAAT NPIPPTIWSY NKAVKDDAFD PAAAKKLLAE AGVKDLKTTI WAMPVQRPYN PNARRMAEIL QANWKAVGVD AEITSYEWGE YLKRAKAGEH ETALFGWTGD NGDPDNFLAV LLGCDAIPGN NYARWCDKSF ENLIQKAKIA TSQEERVKLY EEAQVIFKEQ APWATIAHSV VYEPIRKEVI DYKIDPLGGH IFYGVDLKK
|
| |