Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0117 |
Symbol | alkA |
ID | 4885075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 114211 |
End bp | 115152 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640126045 |
Product | DNA-3-methyladenine glycosylase II |
Protein accession | YP_001057172 |
Protein GI | 284159957 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000127704 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGCCG CCGTCGAACC GGCCGCCGTG CGGCCCGACA GCGTCGTGCT CGAGCTGCCG TTCAAATCGC CGTACGATTG GCCGCGCGTG CTGCGCTTCT TCGCGGGGCG CGCGATTCCG GGCGTCGAGG CGGTCGGGGA CGGCGCGTAT CGGCGCACGG TCGACCATCA CGGCGCGATC GGCACGTTGA CGGTGCGCAA GCATCCGCGC AAGCGCTGTC TTGTCGCGCT CGTCGAGGGC GATGCGGCGC GGCATGCGGA CGCTGCATTC GCCGCGCGGC TTTCGACGAT GTTCGATTTG CGGGCCGATC CGGCCGCGAT CGGTGCGCAT CTCGCGCGCG ACGCGTGGTT CGCGCCGCTC GTCGGCGCCG CGCCCGGCCT GCGCGTGCCG GGCGCGTGGT CGGGCTTCGA GCTGATCGTG CGCGCGATCG TCGGCCAGCA GGTGAGCGTG AAGGCCGCGA CGACGATCGT CGGGCGGCTC GTCGAGCGCG CGGGCGAGCG GCTTGCCCCG CACGCGCCGG GCGCGATCGG CTGGCGGTTT CCGGAGCCCG CCGCGCTCGC CGCGTGCGAC TTGTCTCGCA TCGGGATGCC GGGCAAGCGT GCCGCGGCGC TGCAGGGCGT CGCGCGCGCG GTCGCCGCGG GCGACGTGCC GCTCGATGCG TACGCGAGCG ATCCGGCCGG CGTGCGCGCC GCGCTGCTCG CGCTGCCGGG GATCGGCCCG TGGACCGTCG AATACGTCGC GATGCGCGCA TGGCGCGACG CCGACGCATG GCCCGCGACC GACCTCGTGT TGATGCAGGC GATCGTCGCG CGCGACCCGG CGCTCGACCG GCCGGCAAGC CAGCGCCTGC GCGCCGATGC GTGGCGGCCG TGGCGCGCGT ATGCGGCGAT GCATCTATGG AACGAGATCG CCGATCGCGC CGGTTCGGCG CGCGGCGGAT AG
|
Protein sequence | MSAAVEPAAV RPDSVVLELP FKSPYDWPRV LRFFAGRAIP GVEAVGDGAY RRTVDHHGAI GTLTVRKHPR KRCLVALVEG DAARHADAAF AARLSTMFDL RADPAAIGAH LARDAWFAPL VGAAPGLRVP GAWSGFELIV RAIVGQQVSV KAATTIVGRL VERAGERLAP HAPGAIGWRF PEPAALAACD LSRIGMPGKR AAALQGVARA VAAGDVPLDA YASDPAGVRA ALLALPGIGP WTVEYVAMRA WRDADAWPAT DLVLMQAIVA RDPALDRPAS QRLRADAWRP WRAYAAMHLW NEIADRAGSA RGG
|
| |