Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3235 |
Symbol | |
ID | 4786514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3439641 |
End bp | 3440747 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640091808 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_001022423 |
Protein GI | 124268419 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGCCG CCGTGTCCCC ACCGGCCCCG AGCGGCAGGA CGCCGGCTTC CGCGCCCGAC ACCGGCGCAA CCGCCACCAC GGCCAGCCTG GCCGAGCGCG TGGTGACGTG GCAGCGCACC CAGGGTCGCC ACGGACTGCC CTGGCAGCGC GAGCGCGATC CCTACCGCGT GTGGCTGTCG GAGATCATGC TGCAGCAGAC CCAGGTCAGC ACCGTGCTGA CCTACTATGT CCGCTTTCTC GAACGCTTTC CGGATGTGGC CGCGCTGGCG CGCGCGGCGC TCGACGACGT GCTGGCCGCC TGGGCTGGCC TGGGCTACTA CAGCCGCGCG CGCAATCTGC ACCGCTGCGC CCAGGCGGTG ATGGCCGAGC ACGGCGGCCG CTTCCCGGCC AGCGCCGAGC AGCTCGCCAC GCTGCCCGGC ATCGGCCGAT CGACCGCCGC CGCCATCGCC GCGTTCTGCT TCGGCGAGCG GGCCGCGATC CTCGACGGGA ACGTGAAGCG CGTGTTGACG CGCGTCCTGG GCTTCAGCGC CGACCTCGCC GTCGCCCGCC ACGAGCGCGG CCTGTGGGCT CGGGCCTGCG AGCTGCTGCC TCCGGCGTCG GCCGACATGC CGACCTACAC CCAGGGGTTG ATGGACCTGG GTGCCACCGT CTGCCTGGCC CGCAAGCCGA ACTGCCTGCT CTGCCCGCTT CAGGGCGACT GCGTGGCGCG ACGTGAGGGC CGGCCCGAGG CCTACCCGGT GAAGACGCGC AAGCTCAAGC GCACCCGCCG CGAACACTGG TGGCTGTGGC TGGAGCACGC TGGCGCGGTG TGGCTGCAGA AACGCCCGGC GACCGGCGTG TGGGCCGGAC TGTGGAGCCT GCCACTGCTC GACGACGAAG CCGCGCTCGG TGCGGTGGTG CAGCGCTGGC AGGTGCCGGT GGAGCCGCAG CCGCTGATCG AGCACGCACT GACCCACTTC GACTGGACGC TGCACCCGCG GCGCGCGGTG CTGGACAGCG CAGAGGGTGT CGAGGCCGCG CTGGGCCCCG GCCGCTGGAT CGCGCTCGAC GCCCTCGATA CCGTGGGGCT GCCGGCGCCG CTGAAGAAGC TGCTCGCGGC GCGCTAA
|
Protein sequence | MSAAVSPPAP SGRTPASAPD TGATATTASL AERVVTWQRT QGRHGLPWQR ERDPYRVWLS EIMLQQTQVS TVLTYYVRFL ERFPDVAALA RAALDDVLAA WAGLGYYSRA RNLHRCAQAV MAEHGGRFPA SAEQLATLPG IGRSTAAAIA AFCFGERAAI LDGNVKRVLT RVLGFSADLA VARHERGLWA RACELLPPAS ADMPTYTQGL MDLGATVCLA RKPNCLLCPL QGDCVARREG RPEAYPVKTR KLKRTRREHW WLWLEHAGAV WLQKRPATGV WAGLWSLPLL DDEAALGAVV QRWQVPVEPQ PLIEHALTHF DWTLHPRRAV LDSAEGVEAA LGPGRWIALD ALDTVGLPAP LKKLLAAR
|
| |