Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A0151 |
Symbol | mutY |
ID | 5798615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 157248 |
End bp | 158366 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641338173 |
Product | adenine DNA glycosylase |
Protein accession | YP_001604780 |
Protein GI | 162418954 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.523345 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00000764149 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGCAAG CGCAACAATT CGCGCACGTG GTACTTGATT GGTACCAACA CTTTGGCCGC AAAACCCTGC CATGGCAGTT GGATAAGACC CCCTATCAAG TATGGCTGTC AGAAGTGATG TTGCAACAAA CTCAGGTTGC GACCGTCATC CCCTATTTTC AACGTTTTAT GCTGCGCTTC CCTGATATTC AGGCACTGGC GGCTGCGCCG TTGGATGATG TACTGCATTT ATGGACCGGT TTGGGTTACT ACGCCCGTGC CAGAAACCTG CATAAAGCGG CCCAAATGGT CGTGGAACAC CATCAAGGGG AGTTTCCCAC AACATTTGAC CAGATACTGG CATTGCCGGG TATCGGGCGC TCAACTGCCG GGGCTATTTT ATCGCTGTCT TTAGGCCAGC ATTTTCCTAT TTTGGATGGC AACATCAAAC GGGTGCTGGC CCGTTGCTAT GCCGTTGACG GCTGGCCGGG AAAAAAAGAG GTCGAAGGCC GCCTGTGGCA AATCAGCGAA GATGTCACAC CCGCCAACGG GGTGGGCCAG TTTAATCAGG CAATGATGGA TTTAGGCGCG ATGGTGTGTA CTCGCTCTAA ACCTAAATGT GAACTTTGCC CATTGAATAT CGGCTGTATG GCGTACGCTA ACCACAGTTG GGCGCGCTAT CCGGGCAAAA AACCTAAACA GACGTTGCCG GAAAAAACCG CCTGGTTCTT ATTAATGCAA AATGGATCGC AAGTGTGGCT CGAACAGCGC CCCCCAGTCG GCTTATGGGG CGGCTTATTC TGTTTCCCAC AATTTGCTGA ACAAGAAGAA CTCATTCACT GGCTGCAAAA ACAGGGTATT CCCGCCAATG AAACCCAGCA GTTAACCGCG TTTCGCCATA CGTTTAGTCA TTTCCATCTG GATATAGTCC CTATATGGCT AAATACGGCC TCAGTCCGAG GATGCATGGA TGATGGCGCA GGTCTCTGGT ATAACTTAGC CCAGCCACCT TCGGTAGGGT TAGCTGCTCC GGTTGAGCGT TTATTGCATC AGTTATTAAA AGATCCGTTG GCAAAAGATG AGTTAACGCA ACAACAACTC ACAAAGCAAT CGCCTACCCA ACCAGCTTTA TTTGACTAG
|
Protein sequence | MMQAQQFAHV VLDWYQHFGR KTLPWQLDKT PYQVWLSEVM LQQTQVATVI PYFQRFMLRF PDIQALAAAP LDDVLHLWTG LGYYARARNL HKAAQMVVEH HQGEFPTTFD QILALPGIGR STAGAILSLS LGQHFPILDG NIKRVLARCY AVDGWPGKKE VEGRLWQISE DVTPANGVGQ FNQAMMDLGA MVCTRSKPKC ELCPLNIGCM AYANHSWARY PGKKPKQTLP EKTAWFLLMQ NGSQVWLEQR PPVGLWGGLF CFPQFAEQEE LIHWLQKQGI PANETQQLTA FRHTFSHFHL DIVPIWLNTA SVRGCMDDGA GLWYNLAQPP SVGLAAPVER LLHQLLKDPL AKDELTQQQL TKQSPTQPAL FD
|
| |