Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0087 |
Symbol | |
ID | 6407730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 96707 |
End bp | 97582 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642709996 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001989125 |
Protein GI | 192288520 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAC TCCCCGAAGT CGAAACCGTC CGCCGCGGGC TGCAGCCGGC GATGGAGGGC TTTCGGATCG ACCGCGCGGT GGCGCACCGG GAGAACCTGC GGTTTCCGCT GCAGAAGGAC TTCGCCGCCC GGCTCACCGG CCAGACCGTC ACCGGGCTCG GCCGGCGTGC CAAATATCTG CTGGCGGACC TGTCGAGCGG CGACGTGCTG CTGATGCATC TCGGGATGTC CGGCTCGTTC CGGGTGATCG GGGCCGATGG CGAGACCACG CCGGGCGAGT TTCACTATCC GCGCAGCGAG GACCGCACCC ACGATCACGT GGTGTTCGAA ATGGCGTCCG GCGCCCGCGT CGTGTTCAAC GATCCGCGCC GGTTCGGCTT CATGAAGGTG TTTCCGCGCA GCGAGATCGA AACCGAGCCG CACCTGAAGG GCCTCGGTCC CGAGCCGCTC GGCAACGCGT TCGACGCCAG CCTGCTGGCG AAGGCCTGCG CCGGCAAGCA AACCAGCCTC AAGGCCGCGC TGCTCGATCA GCGCGTGGTT GCCGGGCTCG GCAACATCTA TGTTTGCGAG GCGCTGTTTC GCGCGCACCT GTCACCAAAG CGCAAAGCCT CGACGCTGGC CAATCGCAAG GAAGAGCCGA CCGACCACGC GGTGCGGCTG ACCGAGGCGA TCCGCGAGGT GCTGGGCGAA GCGATCAAGG CCGGCGGCTC ATCGCTCCGC GACCACCGCC AGACCAGCGG TGAGCTCGGT TACTTCCAGC ACGCGTTCAA GGTGTACGAC CGCGAAGGCA AGCCGTGCCC GACCTGCGGC GGCACGGTGC AACGCTTCGT GCAGAACGGC CGGTCGACGT TCTGGTGCCC GAAGTGCCAG AAGTGA
|
Protein sequence | MPELPEVETV RRGLQPAMEG FRIDRAVAHR ENLRFPLQKD FAARLTGQTV TGLGRRAKYL LADLSSGDVL LMHLGMSGSF RVIGADGETT PGEFHYPRSE DRTHDHVVFE MASGARVVFN DPRRFGFMKV FPRSEIETEP HLKGLGPEPL GNAFDASLLA KACAGKQTSL KAALLDQRVV AGLGNIYVCE ALFRAHLSPK RKASTLANRK EEPTDHAVRL TEAIREVLGE AIKAGGSSLR DHRQTSGELG YFQHAFKVYD REGKPCPTCG GTVQRFVQNG RSTFWCPKCQ K
|
| |