Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2121 |
Symbol | |
ID | 3835548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 2462962 |
End bp | 2464158 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637826223 |
Product | hypothetical protein |
Protein accession | YP_427208 |
Protein GI | 83593456 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.160147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTTG TCTATCTGGG CACCTATGAC CACGACAAGC CGCGCAACCG CCTGATGATC GCGGCCCTGC GCGCCGCCGG CGTCGAGGTG CGCGAGTGTT GCGTCGATGT CTGGGCCGGC GTGGCCGACA AAAGCCTGAT CGGCGGCTGG GCCCGCGTAA CCCGGCTTTT GCGCTGGTTG GCGGCCTATC CGCTGCTGAT CGTCCGCTAT CTGCGCTGTC CGCCCCATGA CGCCGTGGTT TTTGGCTATC TCGGCCATCT CGATGTGCTG ATTCTCGGCC CCTTCGCCCG CCTGCGCCGG GTCGCCGTTG TTTGGGACGT CTTTCTTTCC CTTCCCGATA CGGTGGTTGG CGACCGCAAG ATGATCTCGC CGCGCCATCC GCTGGCCCAT CTCTTACGCT GGGGCGAAGG CTTGGCCTGC CGCCGCGTCG ATCTGGCGCT GATGGATACC CGCGCCCAAT CGCGGCTGCT GGAAGACGCC TATGGCCTGC CGCTGGCGCG CTCGGGCCAT GTTTTCGTCG GCGCGGAACT CGATCGCTTC CCCTATCTGC CGCCCCGCCC GCCCCCCGGC CCCGGCCAGC CGCTTACCGT GTTGTTCTAC GGCCAGTTCA TCGCCTTGCA TGGCATTGAA ACCATCCTGC GCGCCGCCGC CCTTGATCGC GAGCGGCGGG TGGCGTGGCG GCTGATCGGC AGCGGCCAGG AAGCCGGCAA AATCCAGGCG ATGATGGAGG TCACCCCCGG TCTGGCCATT CACTGGCAGC GGTGGGTCGA TTACGAGCGG TTGATCGAGG CGGTGGCGGC GGCCGATGTC TGCCTGGGGA TCTTCGGCGG CTCGGATAAG GCGGGGCGGG TCATCGCCAA TAAGGTGTTC CAGATCGCCG CGACCGGCCG GCCTTTGGTG ACCCGCCAAT CGCCGGCGAT CGGCGAATTA TTCGAGCCGG GACTGGCCGG TATCCGCCTA ATCCCCCCCG AAGACCCCGC CGCCCTGCTC CAAGCCGTTC TCGAGCTTGG CCACAGCGGG GAAAGCCTTC CCCAGGATCT GCGCGAGCGC TTCTCGCCCG CCGCCCTGGG CGCCCGCCTA AGCGCCCTGA TCGCCCGCAC ACTCGCGGCG AAGGCGGATC CGCCGCGCCA GCAGCGACCC GATGACGAAG CCCATCCCAT CGCCGAGGAC GGTGACAAGG CGCAGCAGAA GCGATAG
|
Protein sequence | MRVVYLGTYD HDKPRNRLMI AALRAAGVEV RECCVDVWAG VADKSLIGGW ARVTRLLRWL AAYPLLIVRY LRCPPHDAVV FGYLGHLDVL ILGPFARLRR VAVVWDVFLS LPDTVVGDRK MISPRHPLAH LLRWGEGLAC RRVDLALMDT RAQSRLLEDA YGLPLARSGH VFVGAELDRF PYLPPRPPPG PGQPLTVLFY GQFIALHGIE TILRAAALDR ERRVAWRLIG SGQEAGKIQA MMEVTPGLAI HWQRWVDYER LIEAVAAADV CLGIFGGSDK AGRVIANKVF QIAATGRPLV TRQSPAIGEL FEPGLAGIRL IPPEDPAALL QAVLELGHSG ESLPQDLRER FSPAALGARL SALIARTLAA KADPPRQQRP DDEAHPIAED GDKAQQKR
|
| |