Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0619 |
Symbol | |
ID | 3908312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 698153 |
End bp | 699034 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882508 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_484241 |
Protein GI | 86747745 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0236093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.724235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAC TGCCCGAAGT CGAAACCGTC CGCCTCGGGC TGCAGCCGGC CATGGAGGGA TTCCGGATCG ACCGCGCGAT GGCGAACCGC TGCGATCTGC GGTTTCCGTT TCAACCCGAC TTCGCCGCGC GGCTGACCGG GCAGACCATC ACCGGGCTGG GGCGCCGCGC CAAATATCTG TTGGCCGATC TGTCGAGCGG CGACGTGCTG CTGATGCATC TCGGCATGTC GGGCTCGTTC CGGGTGGTGA ACGGCGCCGG CGACGCGACG CCGGGCGAGT TTCACCATCC GCGCAGCGAG GACCGCACCC ACGATCACGT CGTGTTCGAG ATGTCGAGCG GCGCGCGGGT GATCTTCAAC GACCCGCGGC GATTCGGCTT CATGAAGATC TTTGCGCGCG CCGCGATCGA CGACGAGCCG CATCTCAAGG GGCTCGGGCC CGAGCCATTG GGCAACGCCT TCGACGCCGC GATGCTGGCG CGGGCCTGCG CCGGCAAGCA GACCAGCCTG AAGGCGGCGC TGCTCGACCA GCGCGTCGTC GCCGGGCTCG GCAACATTTA TGTCTGCGAG GCGCTGTGGC GCGCGCATCT GTCGCCGAAG AGGAAGGCGT CGACGCTGGC CGACCGCAAA GGCGCGCCGA CCGATCGCGC CGTGCGGCTG GTGGATGCAA TCCGCGCCGT GCTCGGCGAC GCCATCAAGG CCGGCGGCTC GTCACTGCGC GACCACCGCC AGACCTCCGG CGAACTCGGT TACTTCCAGC ATTCCTTCGC GGTGTACGAC CGCGAAGGCG AGCGCTGCCG CACGCCGGGA TGCAATGGGA CGGTGAAGCG GCTGGTGCAG AACGGGCGGT CGACGTTCTG GTGTTCGGGT TGCCAGACGT AG
|
Protein sequence | MPELPEVETV RLGLQPAMEG FRIDRAMANR CDLRFPFQPD FAARLTGQTI TGLGRRAKYL LADLSSGDVL LMHLGMSGSF RVVNGAGDAT PGEFHHPRSE DRTHDHVVFE MSSGARVIFN DPRRFGFMKI FARAAIDDEP HLKGLGPEPL GNAFDAAMLA RACAGKQTSL KAALLDQRVV AGLGNIYVCE ALWRAHLSPK RKASTLADRK GAPTDRAVRL VDAIRAVLGD AIKAGGSSLR DHRQTSGELG YFQHSFAVYD REGERCRTPG CNGTVKRLVQ NGRSTFWCSG CQT
|
| |