Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1080 |
Symbol | |
ID | 3908932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1239310 |
End bp | 1240470 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882973 |
Product | radical SAM family protein |
Protein accession | YP_484701 |
Protein GI | 86748205 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1533] DNA repair photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.981176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.301506 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGAG CATCCAATGC CCTCAAGCGC CCGCCGGTCA CGGCGCCCTC CGAGCCGGTG GGCGCAATCT CTCCGTTTCC TGAGATTGAA ATTGCGATCG ACAAGCAGCG GCGGCGCGGC CGCGGCGCGC AATCCAACGA GTCGGGCCGC TACGAAGCCG AGGCGCGGGT CGCGTTCGAT GATGGCTGGC AGAGCCTCGA CGAGCTGCCG CCGTTCAAGA CCACGGTCGC GCTCGACACC GCGCGGAAAG TCATCACCCG CAACGAGTCG CCGGATATCG GCTTCGATCG TTCGATCAAT CCGTATCGCG GCTGCGAGCA CGGCTGCGTC TATTGCTTCG CGCGGCCGAC CCATGCCTAT CTCGGCCTGT CGCCGGGGCT GGATTTCGAA TCGCGGCTGT TCGCCAAGCC GGATGCGCCG GCGCTGCTGG AGAAAGAACT CGCCGCTGCC GACTATCAGC CGCGGATGAT CGCGATCGGT ACCAATACCG ACCCGTATCA GCCGATCGAG CGCGAGCACA AGATCATGCG GGGCGTTCTC GAAGTGCTGG AGAAGACCGG CCATCCGGTC GGCATCGTCA CCAAATCGGC GCTGGTCACG CGTGACATCG ACATTCTGGC GCGGATGGCG AAGCGCCAGC TCGCCAAGGT CGCGCTGTCG GTGACATCGC TGGATCCGAA ACTGGCGCGC ACCATGGAGC CGCGCGCCTC CGCGCCTGAG AAGCGGCTGG AAGCGCTGAA GCGGCTCTCC GAGGCCGGGA TTCCGACCAC CGTGATGGTG GCGCCGGTGA TCCCGGCGCT CAACGATGTG GAGATCGAGC GCATCCTCGA CGCCGCCGCC CATGCCGGCG TCAAGGAGGC CAGCTACGTG ATGCTGCGGC TGCCGCTGGA AGTGCGCGAC CTGTTCCGCG AATGGCTGAT GGCGAACTAT CCGGATCGCT ACCGCCACGT CTTCACCCTG ATCCGCGACA TGCGCGGCGG CCGCGACTAC GATTCGCAAT GGGGCACGCG GATGAAAGGC ACCGGCCCGA TCGCCTGGAT GATCGGTCGC CGCTTCGAGA CCGCCTGCGC GCGGCTCGGC CTCAACAAGC GCCGCTCGAA ATTGACGACG GATCATTTCG AAAAGCCGGA GCGGGCGGGG CAGCAGCTGA GTTTGTTCTA G
|
Protein sequence | MSRASNALKR PPVTAPSEPV GAISPFPEIE IAIDKQRRRG RGAQSNESGR YEAEARVAFD DGWQSLDELP PFKTTVALDT ARKVITRNES PDIGFDRSIN PYRGCEHGCV YCFARPTHAY LGLSPGLDFE SRLFAKPDAP ALLEKELAAA DYQPRMIAIG TNTDPYQPIE REHKIMRGVL EVLEKTGHPV GIVTKSALVT RDIDILARMA KRQLAKVALS VTSLDPKLAR TMEPRASAPE KRLEALKRLS EAGIPTTVMV APVIPALNDV EIERILDAAA HAGVKEASYV MLRLPLEVRD LFREWLMANY PDRYRHVFTL IRDMRGGRDY DSQWGTRMKG TGPIAWMIGR RFETACARLG LNKRRSKLTT DHFEKPERAG QQLSLF
|
| |