Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0439 |
Symbol | |
ID | 3834005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 516351 |
End bp | 517445 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637824523 |
Product | extracellular solute-binding protein |
Protein accession | YP_425531 |
Protein GI | 83591779 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGATT CGGTTCTGAC CTCTCGCGCT CTGTGTCGTG GCCTCGCGGT GGTTCTGGCC ATCGGTTTCG CTCTGCTGTC GCCGCTGGCC GGTCGCGCCG AGCAAGCACC GGCGTCCCCG ATCTCCGCAG CGGAGGGGGA CACCCTTAGG CAGGTGCGCG AACGGGGGTA TCTGACCTGC GGCGTCAATG CCCGGCTGAC CGGCTTGGGA TCCGTCGACC CCGACGGGGT GTGGCGCGGC TTTGACATCG ATTATTGCCG GGCGGTGGCG GCGGCCGCCC TTGGCGACAG CGGACGGGTC CGATTCGTTC CCCTGGATAT CCGTTCGCGC CTGCTGGCCC TGACCAGCGG CGAGGTCGAT CTGCTGTCGC GCAACACCAC CTGGACGTTG GAGCGCGATG CCGGCCAAGG CATCTCGTTC GTTGGCGTCA GTTTATTCGA TGGACCGGGG CTGCTCGCCT GGAAGGATCT TCCCGGCGAC GATCTGGCCT CGCTGCCGCC GACGGCCCGT CTCTGCGTCC AAAGCGCGGC GACGGCGGCC CGGGTGCTTC CCGCCCGCAT GGCCGCCGCC GGTTTGTCTT TTACCATTCT GCCCTTTCGC TCGCTGGAAG AGGCGCGCAT GGCCTTGTTC ACCCGGGCCT GCGACGGTTA TGTCGCCGAT CGCACGGCGC TGGCCTCGCT GGCGACCTTC GAGGCGCCGC GCCCCGACGC CCTGCGCCTG TTGCCCGATC TGTTGGCCGT TGAGCCGCTG GGGCCGGCGG TGCGCGACGG CGATGCCAAT TGGTTCGACA TCGCCCGCTG GACCCTGTTC GCCCTGATCA AGGCCGAGGA AGAGGGGTTA AGCCACGATC GCCTGCCCGC GGTGCTTGAA AGCACCCAGG ATCCGGAGCT GCGCCGCTTT CTGGGGCTTG ATCCCGGAGT GGGGGCGCCC CTTGGTCTGG ACGACGCCTG GGTCCGCCGG ATTGTCAGTC AGGTCGGCAC CTACGGCGAG GTCTTCGACC GCAATCTGGG CCAGGGCAGC CCTCTGAAGC TTGAGCGCGG GCCCAATGCC CTGCGGCGCG ACGGCGGGTT GATGGTCGCC CCGCCGTTTT TCTAA
|
Protein sequence | MEDSVLTSRA LCRGLAVVLA IGFALLSPLA GRAEQAPASP ISAAEGDTLR QVRERGYLTC GVNARLTGLG SVDPDGVWRG FDIDYCRAVA AAALGDSGRV RFVPLDIRSR LLALTSGEVD LLSRNTTWTL ERDAGQGISF VGVSLFDGPG LLAWKDLPGD DLASLPPTAR LCVQSAATAA RVLPARMAAA GLSFTILPFR SLEEARMALF TRACDGYVAD RTALASLATF EAPRPDALRL LPDLLAVEPL GPAVRDGDAN WFDIARWTLF ALIKAEEEGL SHDRLPAVLE STQDPELRRF LGLDPGVGAP LGLDDAWVRR IVSQVGTYGE VFDRNLGQGS PLKLERGPNA LRRDGGLMVA PPFF
|
| |