Gene YPK_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0820 
Symbol 
ID6090308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp936615 
End bp937874 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID641595882 
Productadenine DNA glycosylase 
Protein accessionYP_001719574 
Protein GI170023069 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.746728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGAG TAACGAGCGC CGCTAACGCC GCTGCGGCGT CAAGGACGAA GAGTATAAGG 
AAAGTTCCGG TTTACAGCAC AAGAACTCTA TGCTGCAATC CAGCTCTCAT TTATTTACCT
GTCGGATATC GATACCTGCT TATGATGCAA GCGCAACAAT TCGCGCACGT GGTACTTGAT
TGGTACCAAC ACTTTGGCCG CAAAACCCTG CCATGGCAGT TGGATAAGAC CCCCTATCAA
GTATGGCTGT CAGAAGTGAT GTTGCAACAA ACTCAGGTTG CGACCGTCAT CCCCTATTTT
CAACGTTTTA TGCTGCGCTT CCCTGATATT CAGGCACTGG CGGCTGCGCC GTTGGATGAA
GTACTGCATT TATGGACCGG TTTGGGTTAC TACGCCCGTG CCAGAAACCT GCATAAAGCG
GCCCAAATGG TCGTGGAACA CCATCAAGGG GAGTTTCCCA CAACATTTGA CCAGATACTG
GCATTGCCGG GTATCGGGCG CTCAACTGCC GGGGCTATTT TATCGCTGTC TTTAGGCCAG
CATTTTCCTA TTTTGGATGG CAACGTCAAA CGGGTGCTGG CCCGTTGCTA TGCCGTTGAC
GGCTGGCCGG GAAAAAAAGA GGTCGAAAGC CGTCTGTGGC AAATCAGCGA AGATGTCACG
CCCGCCAACC GGGTGGGCCA GTTTAATCAG GCAATGATGG ATTTAGGCGC GATGGTGTGT
ACTCGCTCTA AACCTAAATG TGAACTTTGC CCATTGAATA TCGGCTGTAT GGCGTACGCT
AACCACAGTT GGGCGCGCTA TCCGGGCAAA AAACCTAAAC AGACGTTGCC GGAAAAAACC
GCCTGGTTCT TATTAATGCA AAATGGATCG CAAGTGTGGC TCGAACAGCG CCCCCCAGTC
GGCTTATGGG GCGGCTTATT CTGTTTCCCA CAATTTGCTG AACAAGAAGA ACTCATTCAC
TGGCTGCAAA AACAGGGTAT TCCCGCCAAT GAAACCCAGC AGTTAACCGC GTTTCGCCAT
ACGTTTAGTC ATTTCCATCT GGATATAGTC CCTATATGGC TAAATACGGC CTCAGTCCGA
GGATGCATGG ATGATGGCGC AGGTCTCTGG TATAACTTAG CCCAGCCACC TTCTGTAGGG
TTAGCTGCTC CGGTTGAGCG TTTATTGCAT CAGTTATTAA AAGATCCGTT GGCAAAAGAT
GAGTTAACGC AACAACAACT CACAAAGCAA TCGCCTACCC AACCAGCTTT ATTTGACTAG
 
Protein sequence
MDRVTSAANA AAASRTKSIR KVPVYSTRTL CCNPALIYLP VGYRYLLMMQ AQQFAHVVLD 
WYQHFGRKTL PWQLDKTPYQ VWLSEVMLQQ TQVATVIPYF QRFMLRFPDI QALAAAPLDE
VLHLWTGLGY YARARNLHKA AQMVVEHHQG EFPTTFDQIL ALPGIGRSTA GAILSLSLGQ
HFPILDGNVK RVLARCYAVD GWPGKKEVES RLWQISEDVT PANRVGQFNQ AMMDLGAMVC
TRSKPKCELC PLNIGCMAYA NHSWARYPGK KPKQTLPEKT AWFLLMQNGS QVWLEQRPPV
GLWGGLFCFP QFAEQEELIH WLQKQGIPAN ETQQLTAFRH TFSHFHLDIV PIWLNTASVR
GCMDDGAGLW YNLAQPPSVG LAAPVERLLH QLLKDPLAKD ELTQQQLTKQ SPTQPALFD