Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0133 |
Symbol | alkA |
ID | 4899976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 127526 |
End bp | 128440 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640133363 |
Product | DNA-3-methyladenine glycosylase 2 |
Protein accession | YP_001064418 |
Protein GI | 126452165 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.221432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGCCCG ACAGCGTCGT GCTCGAGCTG CCGTTCAAAT CGCCGTACGA TTGGCCGCGC GTGCTGCGCT TCTTCGCGGG GCGCGCGATT CCGGGCGTCG AGGCGGTCGG GGACGGCGCG TATCGGCGCA CGGTCGACCA TCACGGCGCG ATCGGCACGT TGACGGTGCG CAAGCATCCG CGCAAGCGCT GTCTTGTCGC GCTCGTCGAG GGCGATGCGG CGCGGCATGC GGACGCTGCA TTCGCCGCGC GGCTTTCGAC GATGTTCGAT TTGCAGGCCG ATCCGGCCGC GATCGGCGCG CATCTCGCGC GCGACGCGTG GTTCGCGCCG CTCGTCGGCG CCGCGCCCGG CCTGCGCGTG CCGGGCGCGT GGTCGGGCTT CGAGCTGATC GTGCGCGCGA TCGTCGGCCA GCAGGTGAGC GTGAAGGCCG CGACGACGAT CGTCGGGCGG CTCGTCGAGC GCGCGGGCGA GCGGCTTGCC CCGCACGCGC CGGGCGCGAT TGGCTGGCGG TTTCCGGAGC CCGCCGCGCT CGCCGCGTGC GACGTGTCGC GCATCGGGAT GCCGGGCAAG CGTGCCGCGG CGCTGCAGGG CGTCGCGCGC GCGGTCGCCT CGGGCGACGT GCCGCTCGAT GCGTACGCGA GCGATCCGGC CGGCGTGCGC GCCGCGCTGC TCGCGCTGCC GGGGATCGGC CCGTGGACCG TCGAATACGT CGCGATGCGC GCATGGCGCG ACGCCGACGC ATGGCCCGCG ACCGACCTCG TGCTGATGCA GGCGATCGTC GCGCGCGACC CGGCGCTAGA CCGGCCGGCA AGCCAGCGCC TGCGCGCCGA TGCGTGGCGG CCGTGGCGCG CGTATGCGGC GATGCATCTA TGGAACGAGA TCGCCGATCG CGCCGGTTCG GCGCGCGGCG GATAG
|
Protein sequence | MRPDSVVLEL PFKSPYDWPR VLRFFAGRAI PGVEAVGDGA YRRTVDHHGA IGTLTVRKHP RKRCLVALVE GDAARHADAA FAARLSTMFD LQADPAAIGA HLARDAWFAP LVGAAPGLRV PGAWSGFELI VRAIVGQQVS VKAATTIVGR LVERAGERLA PHAPGAIGWR FPEPAALAAC DVSRIGMPGK RAAALQGVAR AVASGDVPLD AYASDPAGVR AALLALPGIG PWTVEYVAMR AWRDADAWPA TDLVLMQAIV ARDPALDRPA SQRLRADAWR PWRAYAAMHL WNEIADRAGS ARGG
|
| |