Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1942 |
Symbol | |
ID | 3847637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2187894 |
End bp | 2188832 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637841611 |
Product | DNA-3-methyladenine glycosylase |
Protein accession | YP_442472 |
Protein GI | 83719392 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACGG CCAGAAAAGC GCCGGCTAAG CGCGCGGCGA CGCAGCCGCC GATTGCGGCC AAGCGGGCCG GCGCGCGTGC GCCGGCGGTG CACAAGGCGG GCGCGAAGCG CGCGGCGCCG AACGGCGCGG CCAATGGCGC CGCGCACAAG CCGAGCTTGC GCACGGCCGC CGCGAAGTCG GCGCGTGCGA AGGTCGATGG CGAGCACGTC GCCGACGCGT TGCCGCGCGA GGCCGGCGTT GCCGCGGCGG AGCCCGCCGC GCGCAAGGCG CGCGTATCGG AGGCCGCCGT GCCGGTGCAG CTCTCCGACG CCGAGACGGT CGCGCGTCCG CCGTACTGGG ACAAGGCGTG CGCCGATCTC GTCAAGCGCG ACCGGATTCT CAAGAAGCTG ATCCCGAAAT TCGGCCCGGC GCATCTCGTC AAGCGCGGCG ACTCGTTCGT CACGCTCGCG CGCTCGGTGG TCGGCCAGCA GATCTCGGTC GCCGCCGCGC AGTCGGTTTG GGTCAAGATC GAGACCGCGT GCCCGAAGCT CGCGCCGCCG CAGATCATCA AGCTCGGTCA GGAAAAACTG ATCGCGTGCG GCCTGTCGAA GCGCAAGTCC GAATATATCC TCGACCTCGC CCAGCACTTC GTGTCGGGCG CGCTGCACGT CGACAAATGG GCGTCGATGG ACGACGAGGA CGTGATCGCC GAGCTCACGC AGATTCGCGG GATCGGACGC TGGACGGCCG AGATGTTCCT GATCTTCAAT CTGTCGCGCC CGGACGTGCT GCCGCTCGAC GATCTGGGGC TCATTCGCGC GATCAGCGTC AATTATTTCA GCGGCGAGCC CGTCACACGC AGCGAGGCGC GTGAGGTGGC GGCGAACTGG GAGCCGTGGC GAACGGTCGC CACCTGGTAC ATGTGGCGCA GTCTCGATCC GCTGACCGCC AATAACTGA
|
Protein sequence | MATARKAPAK RAATQPPIAA KRAGARAPAV HKAGAKRAAP NGAANGAAHK PSLRTAAAKS ARAKVDGEHV ADALPREAGV AAAEPAARKA RVSEAAVPVQ LSDAETVARP PYWDKACADL VKRDRILKKL IPKFGPAHLV KRGDSFVTLA RSVVGQQISV AAAQSVWVKI ETACPKLAPP QIIKLGQEKL IACGLSKRKS EYILDLAQHF VSGALHVDKW ASMDDEDVIA ELTQIRGIGR WTAEMFLIFN LSRPDVLPLD DLGLIRAISV NYFSGEPVTR SEAREVAANW EPWRTVATWY MWRSLDPLTA NN
|
| |