Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3157 |
Symbol | |
ID | 3836603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3648198 |
End bp | 3649415 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637827272 |
Product | hypothetical protein |
Protein accession | YP_428239 |
Protein GI | 83594487 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.842057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGCC CGCAATCGAC GCCAGACGAT CACTCCCCCA AGAACACATC CGAAGGAAAA CGGCGCCGTT CTCCGGCGGC GTCTGCCCTG TGGATCGTTG TTTTCCTGCT TTTTCTGATC TGCGTCGGGC TTTCGCTTTA TCTGTTTTTC AGAACCCCCG AAATCACCCG GAAAACGGCC GCCGTCGCCA CCCCGGGCGC GACCCAGGAC ACCGGCGAGC GCGCGGCTTA TCTCGCCCAG CAGATCGCCC TGCGCCGGGT CGAGTTAGGC GACCTTGTCG CCGCCATCGA TCCGCCGCGC TGCGTCGCCC CGGCCCAAAT CGACGAAAGC CGCCTGCGCC TTTTGCGCCA GCAGCGCGGC GACGCCCTGT CCCTTTGGCG ATCCTTGAGC ACCGGGGGCA AAACCGGGGC CACCCAGGCG ATCCCCGACA CCATCCCCGA ACCGGCGGTG GCCACGGCTT TGCCCGGGGC GACGCCGCCG CTTGCCCCCG CCACGCCGCA AGCCGCCCCT GGGGAAAGCG CACCGACGAT GGGGATCGGG CCGTTGCGCG ATCGGCTGGA AAAGGCCTCG GTCATCGTCC TTGGCCTGCC CAAGGGCGAG GCCGACGGCC TGATGACCGG CACCGGTTTT TTCATCGCCG ATAACCTTGT CGTGACCAAC CGCCATGTGA TCGACGCCGC CGATCCGGCC AAGCTGTTCA TCACCAGTTC CAGCCTGGGC AAGATGGCCC AGGTGTCGCT GATCGCCACC ACCGTCTCGG CGGTTCCGGG GGAAGCCGAT TACGCGGTGC TGGGAACGGG AAGCGTGCGC GCCCCGGCCA CCCTCGCCCT GTCGCTCGAC GCCGGCAAGC TGACGCCGGT GATCGCGGCG GGCTATCCCG GCATGGCGCT GCTGGGCGAT CAGGGCTTCC AAAAGCTGAT CCAGGGCGAT CTGTCCTCGG CCCCCGATCT CAACATGAAC CGGGGCGAGG TCCGCTCGGT GCGCCCGGTC GGCGCCATCG TCCAGATCAT CCATACCGCC GATGTTCTGC AAGGCTACAG CGGCGGTCCG CTGCTTGACA CCTGCGGCCG GGCGATCGGC GTCAACACCT TCATCCAGGT CGATCGCGAT CAGGCCGCCA AGCTCAACAG CGCCCAGAAG GTTGATACCC TGCTGGCCTT CCTGCAAAAA AAGGGCATCA CCCCGGCCCT CGACAGCCGC GCCTGTCAGC CCGGCTGA
|
Protein sequence | MSSPQSTPDD HSPKNTSEGK RRRSPAASAL WIVVFLLFLI CVGLSLYLFF RTPEITRKTA AVATPGATQD TGERAAYLAQ QIALRRVELG DLVAAIDPPR CVAPAQIDES RLRLLRQQRG DALSLWRSLS TGGKTGATQA IPDTIPEPAV ATALPGATPP LAPATPQAAP GESAPTMGIG PLRDRLEKAS VIVLGLPKGE ADGLMTGTGF FIADNLVVTN RHVIDAADPA KLFITSSSLG KMAQVSLIAT TVSAVPGEAD YAVLGTGSVR APATLALSLD AGKLTPVIAA GYPGMALLGD QGFQKLIQGD LSSAPDLNMN RGEVRSVRPV GAIVQIIHTA DVLQGYSGGP LLDTCGRAIG VNTFIQVDRD QAAKLNSAQK VDTLLAFLQK KGITPALDSR ACQPG
|
| |