Gene YpAngola_A0151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0151 
SymbolmutY 
ID5798615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp157248 
End bp158366 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content50% 
IMG OID641338173 
Productadenine DNA glycosylase 
Protein accessionYP_001604780 
Protein GI162418954 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.523345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000764149 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGCAAG CGCAACAATT CGCGCACGTG GTACTTGATT GGTACCAACA CTTTGGCCGC 
AAAACCCTGC CATGGCAGTT GGATAAGACC CCCTATCAAG TATGGCTGTC AGAAGTGATG
TTGCAACAAA CTCAGGTTGC GACCGTCATC CCCTATTTTC AACGTTTTAT GCTGCGCTTC
CCTGATATTC AGGCACTGGC GGCTGCGCCG TTGGATGATG TACTGCATTT ATGGACCGGT
TTGGGTTACT ACGCCCGTGC CAGAAACCTG CATAAAGCGG CCCAAATGGT CGTGGAACAC
CATCAAGGGG AGTTTCCCAC AACATTTGAC CAGATACTGG CATTGCCGGG TATCGGGCGC
TCAACTGCCG GGGCTATTTT ATCGCTGTCT TTAGGCCAGC ATTTTCCTAT TTTGGATGGC
AACATCAAAC GGGTGCTGGC CCGTTGCTAT GCCGTTGACG GCTGGCCGGG AAAAAAAGAG
GTCGAAGGCC GCCTGTGGCA AATCAGCGAA GATGTCACAC CCGCCAACGG GGTGGGCCAG
TTTAATCAGG CAATGATGGA TTTAGGCGCG ATGGTGTGTA CTCGCTCTAA ACCTAAATGT
GAACTTTGCC CATTGAATAT CGGCTGTATG GCGTACGCTA ACCACAGTTG GGCGCGCTAT
CCGGGCAAAA AACCTAAACA GACGTTGCCG GAAAAAACCG CCTGGTTCTT ATTAATGCAA
AATGGATCGC AAGTGTGGCT CGAACAGCGC CCCCCAGTCG GCTTATGGGG CGGCTTATTC
TGTTTCCCAC AATTTGCTGA ACAAGAAGAA CTCATTCACT GGCTGCAAAA ACAGGGTATT
CCCGCCAATG AAACCCAGCA GTTAACCGCG TTTCGCCATA CGTTTAGTCA TTTCCATCTG
GATATAGTCC CTATATGGCT AAATACGGCC TCAGTCCGAG GATGCATGGA TGATGGCGCA
GGTCTCTGGT ATAACTTAGC CCAGCCACCT TCGGTAGGGT TAGCTGCTCC GGTTGAGCGT
TTATTGCATC AGTTATTAAA AGATCCGTTG GCAAAAGATG AGTTAACGCA ACAACAACTC
ACAAAGCAAT CGCCTACCCA ACCAGCTTTA TTTGACTAG
 
Protein sequence
MMQAQQFAHV VLDWYQHFGR KTLPWQLDKT PYQVWLSEVM LQQTQVATVI PYFQRFMLRF 
PDIQALAAAP LDDVLHLWTG LGYYARARNL HKAAQMVVEH HQGEFPTTFD QILALPGIGR
STAGAILSLS LGQHFPILDG NIKRVLARCY AVDGWPGKKE VEGRLWQISE DVTPANGVGQ
FNQAMMDLGA MVCTRSKPKC ELCPLNIGCM AYANHSWARY PGKKPKQTLP EKTAWFLLMQ
NGSQVWLEQR PPVGLWGGLF CFPQFAEQEE LIHWLQKQGI PANETQQLTA FRHTFSHFHL
DIVPIWLNTA SVRGCMDDGA GLWYNLAQPP SVGLAAPVER LLHQLLKDPL AKDELTQQQL
TKQSPTQPAL FD