Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1199 |
Symbol | |
ID | 4021675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1357166 |
End bp | 1358296 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637961391 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_568338 |
Protein GI | 91975679 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCCA TGAGTCTAGC AGGCGCGGCG CCGCGGCGGA AACAATCCCC GGTCATCACA GCAGGCGAAC GCACCGGCCC TCACGGCGGG GAGCGGCCGG CGGCGCTGCT CGCCTGGTAT GATCGCCACC GCCGCATTCT GCCCTGGCGG CCGCCCGCCG GCGTCCCGGC CGACCCCTAT GCGGTGTGGC TGTCGGAAAT CATGCTCCAG CAAACCACCG TCCGCGCGGT CGGGCCGTAT TTCGAGAAAT TCATGGCGCG CTGGCCGAGC GTGAAGGCGC TGGGCGAGGC CTCGCTCGAC GACGTGCTGC GGATGTGGGC CGGGCTTGGC TATTACTCGC GCGCCCGCAA TCTGCACGCC TGCGCGGTCG CGGTGACGCG CGACCATGGC GGCGCGTTTC CGGACACCGA ACAAGGCCTG CGCGCGCTGC CGGGCGTCGG TCCCTATACG GCGGCGGCGA TCGCGGCGAT CGCTTTCGGC CGCCAGACCA TGCCGGTCGA CGGCAATATC GAGCGCGTGG TGTCACGGCT CCATGCGGTC GAGGAGGAAT TGCCGAAGGC GAAGCCGCGC ATTCAAGAAC TCGCGGCGAC GCTGCTCGGA CCAGAGCGCG CCGGCGACAG CGCGCAGGCG CTGATGGATC TCGGCGCCAC CATCTGTACG CCGAAGAAGC CGGCCTGCGC GCTATGCCCT TTGAACGACG GCTGCGTCGC GCGGCTGCGC GGCGATGCCG AGACGTTTCC GCGAAAAGCG CCGAAGAAGA CCGGTGCGCT GCGCCGCGGC GCCGCCTTCG TGGTGACGCG CGGCGATCAG CTGCTGCTCC GCAGCCGCGC GGCGAAAGGC CTGCTCGGCG GCATGACCGA AGTGCCGAAT TCCGACTGGC GCGCCGATCA GGACGATGCT GTCGCGCGCG CGCAGGCGCC GGCTCTGAAA GGCGTCACGC GCTGGCAGCG CAAGCCGGGC GTCGTCACCC ACGTGTTCAC GCACTTCCCG CTGGAGTTGG TAGTCTACAC CGCGCAGGCG CCGGCCGGAA CGCGCGCGCC GGCGGGGATG CGTTGGGCAG AGGTCGCGAC GCTCGCCGAC GAAGCCCTGC CCAATCTGAT GCGCAAGGTG ATCGCCCATG CGCTGGACTA A
|
Protein sequence | MRAMSLAGAA PRRKQSPVIT AGERTGPHGG ERPAALLAWY DRHRRILPWR PPAGVPADPY AVWLSEIMLQ QTTVRAVGPY FEKFMARWPS VKALGEASLD DVLRMWAGLG YYSRARNLHA CAVAVTRDHG GAFPDTEQGL RALPGVGPYT AAAIAAIAFG RQTMPVDGNI ERVVSRLHAV EEELPKAKPR IQELAATLLG PERAGDSAQA LMDLGATICT PKKPACALCP LNDGCVARLR GDAETFPRKA PKKTGALRRG AAFVVTRGDQ LLLRSRAAKG LLGGMTEVPN SDWRADQDDA VARAQAPALK GVTRWQRKPG VVTHVFTHFP LELVVYTAQA PAGTRAPAGM RWAEVATLAD EALPNLMRKV IAHALD
|
| |