Gene Mpe_A3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3235 
Symbol 
ID4786514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3439641 
End bp3440747 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content73% 
IMG OID640091808 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_001022423 
Protein GI124268419 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGCCG CCGTGTCCCC ACCGGCCCCG AGCGGCAGGA CGCCGGCTTC CGCGCCCGAC 
ACCGGCGCAA CCGCCACCAC GGCCAGCCTG GCCGAGCGCG TGGTGACGTG GCAGCGCACC
CAGGGTCGCC ACGGACTGCC CTGGCAGCGC GAGCGCGATC CCTACCGCGT GTGGCTGTCG
GAGATCATGC TGCAGCAGAC CCAGGTCAGC ACCGTGCTGA CCTACTATGT CCGCTTTCTC
GAACGCTTTC CGGATGTGGC CGCGCTGGCG CGCGCGGCGC TCGACGACGT GCTGGCCGCC
TGGGCTGGCC TGGGCTACTA CAGCCGCGCG CGCAATCTGC ACCGCTGCGC CCAGGCGGTG
ATGGCCGAGC ACGGCGGCCG CTTCCCGGCC AGCGCCGAGC AGCTCGCCAC GCTGCCCGGC
ATCGGCCGAT CGACCGCCGC CGCCATCGCC GCGTTCTGCT TCGGCGAGCG GGCCGCGATC
CTCGACGGGA ACGTGAAGCG CGTGTTGACG CGCGTCCTGG GCTTCAGCGC CGACCTCGCC
GTCGCCCGCC ACGAGCGCGG CCTGTGGGCT CGGGCCTGCG AGCTGCTGCC TCCGGCGTCG
GCCGACATGC CGACCTACAC CCAGGGGTTG ATGGACCTGG GTGCCACCGT CTGCCTGGCC
CGCAAGCCGA ACTGCCTGCT CTGCCCGCTT CAGGGCGACT GCGTGGCGCG ACGTGAGGGC
CGGCCCGAGG CCTACCCGGT GAAGACGCGC AAGCTCAAGC GCACCCGCCG CGAACACTGG
TGGCTGTGGC TGGAGCACGC TGGCGCGGTG TGGCTGCAGA AACGCCCGGC GACCGGCGTG
TGGGCCGGAC TGTGGAGCCT GCCACTGCTC GACGACGAAG CCGCGCTCGG TGCGGTGGTG
CAGCGCTGGC AGGTGCCGGT GGAGCCGCAG CCGCTGATCG AGCACGCACT GACCCACTTC
GACTGGACGC TGCACCCGCG GCGCGCGGTG CTGGACAGCG CAGAGGGTGT CGAGGCCGCG
CTGGGCCCCG GCCGCTGGAT CGCGCTCGAC GCCCTCGATA CCGTGGGGCT GCCGGCGCCG
CTGAAGAAGC TGCTCGCGGC GCGCTAA
 
Protein sequence
MSAAVSPPAP SGRTPASAPD TGATATTASL AERVVTWQRT QGRHGLPWQR ERDPYRVWLS 
EIMLQQTQVS TVLTYYVRFL ERFPDVAALA RAALDDVLAA WAGLGYYSRA RNLHRCAQAV
MAEHGGRFPA SAEQLATLPG IGRSTAAAIA AFCFGERAAI LDGNVKRVLT RVLGFSADLA
VARHERGLWA RACELLPPAS ADMPTYTQGL MDLGATVCLA RKPNCLLCPL QGDCVARREG
RPEAYPVKTR KLKRTRREHW WLWLEHAGAV WLQKRPATGV WAGLWSLPLL DDEAALGAVV
QRWQVPVEPQ PLIEHALTHF DWTLHPRRAV LDSAEGVEAA LGPGRWIALD ALDTVGLPAP
LKKLLAAR