Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0779 |
Symbol | |
ID | 3834085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 931092 |
End bp | 932144 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637824870 |
Product | extracellular solute-binding protein |
Protein accession | YP_425870 |
Protein GI | 83592118 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAAGC AGTTGAAAAA AATCGCTTTC AAGGCCCAAG CCGCCGGTCT CGCCGCCGCC CTGGCCTTAT GCGGGCTGGG ATTTTCCTCC TCGGCCGAGG CTGGCGAGAC CCTGGACGCC ATCAAGGCGC GCGGGGTCAT CAAGATCGGC GTCGGCGGAA ATTCTCCGGG CTTTTCGGCG CCTGACAGCG CCGGTCGCTG GCAGGGCTTT TTCATCGATA TCGGCCGCGC CCTGGCGGTG ACGGTGTTCA ATGATCCCGA GAAGGCCGAA TTCGTCAATT CCTCGCCCCA GCAACGCCTG CCGGCCCTGC AATCGGGCGA ATTCGATATC TTGCTGTCGG GCGTGACCCA GACCATCACC CGGGCGACCA AGCTGGGCTT CCATTTCGGC CCGGTGGTCT TCTACGACGG CCAGGGATTG CTGGTGCCCA AGAAGCTGGG GATCACCAAG GGCAGCGAGC TTGATGGCGC CACCGTCTGC GTGCAGACGG GTACGACGGG CGAGTTGAAC ATCGCCGACT TCTTCCGCCA GCATAAAATC TCGTTCAAGC CGGTGGTGAT CGAGGAGTCC AACGAGTTCC TCAAGGCCTT TGCGTCGGGA CGCTGCGACG TGCTGACCCA GGACAGCTCC GATCTGGCGA TCCGCCGCAC CTTGCTGCCC AATGCCGCCG ATTATGTGCT GCTGCCCGAG CGCATCTCCA AGGAACCGCT GGCTCCGGCC ATCCGCTATG GCGATGACCG CTGGCTGGAA ATCGTCAACT GGACGGTCTA CGCCCTGATC GAGGCCGAGG AATTGGGCAT CACCCAGGCC ACTATCGACA GCTTCCTTGG CAGCGATAAT CCGAGCATCC GCCGCTTCCT GGGCGTTGAT CCCAGTCTGG CCGAGGCCAC CGGCCTCGAT GCCAAATTCG CCTATAACAT CATCAAGGCC CTGGGCAATT ACGGCGAGGT GTTCGAACGC TCGGTCGGCA AGGCCAGCAA GCTCGGCTTC GAACGGGGCT ATAACCAGCC CTGGACCCAA GGCGGCTTGC TGTATTCGCC GCCGTTCCGC TGA
|
Protein sequence | MVKQLKKIAF KAQAAGLAAA LALCGLGFSS SAEAGETLDA IKARGVIKIG VGGNSPGFSA PDSAGRWQGF FIDIGRALAV TVFNDPEKAE FVNSSPQQRL PALQSGEFDI LLSGVTQTIT RATKLGFHFG PVVFYDGQGL LVPKKLGITK GSELDGATVC VQTGTTGELN IADFFRQHKI SFKPVVIEES NEFLKAFASG RCDVLTQDSS DLAIRRTLLP NAADYVLLPE RISKEPLAPA IRYGDDRWLE IVNWTVYALI EAEELGITQA TIDSFLGSDN PSIRRFLGVD PSLAEATGLD AKFAYNIIKA LGNYGEVFER SVGKASKLGF ERGYNQPWTQ GGLLYSPPFR
|
| |