Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1124 |
Symbol | |
ID | 3844799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1307326 |
End bp | 1308444 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637838427 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_439321 |
Protein GI | 83717638 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0470513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAACA TCGACAACCC GCAGCGCGAC CGGGAAGCCG GCTGCGCCGA CGCGACGCAG GACACGACGC GCATCGACGA CGTGCGCATC GGCGCGGTGC GCCCGCTCAT CTCGCCGGCG CTGCTGCAGG ACGAACTGCC CGTGCCGGGC GCCGTCCAGA CGCTCGTCGA AGCGAGCCGC GACGCGATCG GCGGCGTGCT GCACGGCCGC GACGATCGCC TGCTCGCGAT CGTCGGCCCG TGCTCGATCC ACGATCACGA CCAGGCGCTC GACTACGCGC GCCGGCTGAA GGCCGCCGCC GACGCGCTGC GCGACGACCT GCTGATCGTG ATGCGCGTGT ATTTCGAGAA GCCGCGCACG ACGGTCGGCT GGAAGGGCTA CATCAACGAT CCGCGCCTCG ACGGCAGCTT CCGCATCAAC GAAGGGCTGC GCGCCGCGCG CCGGCTGCTC ATCGACATCA ACGCGCTCGG CCTGCCCGCC GGCACCGAAT TCCTCGATCT GCTGAGCCCG CAGTACATCG CGGACCTGAT CGCGTGGGGC GCGATCGGCG CGCGCACGAC GGAGAGCCAG AGTCACCGGC AGCTCGCGTC GGGGCTGAGC TGCCCGATCG GCTTCAAGAA CGGCACCGAC GGCGGCGTGC AGGTCGCGGC CGACGCGATC GTCGCGGCGC GCGCGAGCCA CGCGTTCATG GGCATGACGA AGATGGGAAT GGCGGCGATC TTCGAGACGC GCGGCAACGA CACCGCGCAC GTGATCCTGC GCGGCGGCAA GAAGGGCCCG AACTACGATC GCGCGAGCAT CGACGAAGCG TGCGCGGCGC TGCGCGCGGC GGACCTGCGC GAACAGGTGA TGGTCGACTG CTCGCATGCG AATTCGAACA AGTCGCACCT GCGGCAGATC GACGTCGCCG AGGACCTCGC GCGGCAGTTG TCGGACGGCG AGCGGCGCAT TACCGGCGTG ATGGTCGAGA GCAACCTGGA GGCCGGCCGG CAGGATCTGA AGCCCGGCGC GCCGCTGCAA TACGGCGTGT CGATCACCGA CGCATGCCTG AGCTGGGCGC AGACCGAGCC CGTGCTCGAC ACGCTCGCGC AGGCGGTGCG GCGGCGGCGC ACCGCCTGA
|
Protein sequence | MQNIDNPQRD REAGCADATQ DTTRIDDVRI GAVRPLISPA LLQDELPVPG AVQTLVEASR DAIGGVLHGR DDRLLAIVGP CSIHDHDQAL DYARRLKAAA DALRDDLLIV MRVYFEKPRT TVGWKGYIND PRLDGSFRIN EGLRAARRLL IDINALGLPA GTEFLDLLSP QYIADLIAWG AIGARTTESQ SHRQLASGLS CPIGFKNGTD GGVQVAADAI VAARASHAFM GMTKMGMAAI FETRGNDTAH VILRGGKKGP NYDRASIDEA CAALRAADLR EQVMVDCSHA NSNKSHLRQI DVAEDLARQL SDGERRITGV MVESNLEAGR QDLKPGAPLQ YGVSITDACL SWAQTEPVLD TLAQAVRRRR TA
|
| |