Gene EcHS_A3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3249 
Symbolmug 
ID5592697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3258412 
End bp3258918 
Gene Length507 bp 
Protein Length168 aa 
Translation table11 
GC content53% 
IMG OID640922366 
ProductG/U mismatch-specific DNA glycosylase 
Protein accessionYP_001459862 
Protein GI157162544 
COG category[L] Replication, recombination and repair 
COG ID[COG3663] G:T/U mismatch-specific DNA glycosylase 
TIGRFAM ID[TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.472365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAGG ATATTTTGGC TCCAGGGTTA CGGGTCGTGT TTTGCGGTAT CAACCCTGGG 
CTTTCATCCG CCGGGACTGG TTTTCCCTTT GCTCATCCGG CAAATCGCTT CTGGAAGGTG
ATATATCAGG CCGGGTTTAC CGACCGTCAG TTGAAGCCGC AGGAGGCACA GCATCTGCTG
GATTATCGTT GTGGCGTCAC CAAACTGGTT GACAGGCCAA CGGTGCAAGC CAGTGAAGTT
TCAAAGCAGG AGTTGCACGC AGGTGGGCGT AAGTTGATTG AAAAAATTGA GGATTATCAG
CCGCAGGCGT TGGCGATTCT GGGCAAACAA GCATATGAAC AGGGATTCAG CCAGCGCGGT
GCACAGTGGG GGAAACAAAC GCTCACCATT GGTTCGACGC AGATTTGGGT GCTGCCAAAT
CCCAGCGGTT TAAGTCGCGT TTCACTAGAG AAACTGGTTG AAGCGTATCG CGAGCTGGAC
CAGGCGCTGG TAGTGCGTGG GCGATAA
 
Protein sequence
MVEDILAPGL RVVFCGINPG LSSAGTGFPF AHPANRFWKV IYQAGFTDRQ LKPQEAQHLL 
DYRCGVTKLV DRPTVQASEV SKQELHAGGR KLIEKIEDYQ PQALAILGKQ AYEQGFSQRG
AQWGKQTLTI GSTQIWVLPN PSGLSRVSLE KLVEAYRELD QALVVRGR