Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_4683 |
Symbol | |
ID | 6279901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010676 |
Strand | - |
Start bp | 779105 |
End bp | 780223 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642615768 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001888421 |
Protein GI | 187919390 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.270793 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGCCA TCGACAATCC TTTGCACGAT CAGGAAGTCG GCTCGGCCGA CGCCACCCAG GACACCACGC GCATCGACGA CGTTCGGATC GGCGCGGTTC GTCCACTGAT TTCCCCTGCC CTGCTGCAGG ATGAGTTGCC CGTACCGCCT TCGGTGCAGA CGCTGGTCGA GAAGACCCGC GCGGAAATCG CCGACATTCT GCATGGCCGC GACGACAGGC TCGTGATGAT CGTCGGACCG TGTTCGATTC ACGACCACGA CCAGGCAATC GAATACGCGC ACAAACTGAA AGTCGCCGCC GACACCTACA AAGACGACTT GCTGATCGTC ATGCGAGTCT ACTTCGAGAA GCCGCGCACC ACGGTCGGCT GGAAAGGCTA TATCAACGAT CCGCGTCTGG ACGGCAGTTT CCGCATCAAC GAAGGTCTGC GTCTTGCGCG GCAACTGCTG CTCGACATCA ACGGTCTTGG CTTGGCCACG GCCACCGAGT TTCTCGATCT GCTGAGCCCG CAGTACATCG CCGATCTGAT CGCGTGGGGC GCCATCGGCG CGCGCACGAC GGAGAGCCAG AGCCATCGTC AGCTGGCATC GGGTCTGAGC TGCCCGATCG GTTTCAAGAA CGGCACGGAC GGCGGCGTGC AGATCGCCGC GGACGCGATC GTCGCCGCGC GCGCGAGCCA CGCGTTCATG GGCATGACGA AAATGGGCAT GGCCGCGATC TTCGAAACGC GCGGCAACGA CGACGCGCAC GTGATCCTGC GCGGCGGCAA GAAAGGGCCG AACTACGACA GCGCGTCGGT GGAAGCCACT TGCGAGGCGC TCAAATCAGC GGGTTTGCGT GAGCAGGTCA TGGTCGACTG TTCGCATGCA AACTCCGGCA AGTCGCATTT GCGTCAGCTG GAGGTGGTGC AGGATCTGAC GCAGCAACTG TCGCAGCGCG AGCGCCGCAT TATCGGCGTC ATGCTGGAAA GTCATCTGGA AGAAGGACGT CAGGATCTGA AGCCAGGTGT GCCGTTGCGT CACGGCGTGT CGATCACGGA TGCGTGCGTC AGCTGGACGC AGACCGAGCC GGCGCTCGAA ACACTCGCCG AGGCCACCCG CAAACGTCGC GCGGGCTGA
|
Protein sequence | MSAIDNPLHD QEVGSADATQ DTTRIDDVRI GAVRPLISPA LLQDELPVPP SVQTLVEKTR AEIADILHGR DDRLVMIVGP CSIHDHDQAI EYAHKLKVAA DTYKDDLLIV MRVYFEKPRT TVGWKGYIND PRLDGSFRIN EGLRLARQLL LDINGLGLAT ATEFLDLLSP QYIADLIAWG AIGARTTESQ SHRQLASGLS CPIGFKNGTD GGVQIAADAI VAARASHAFM GMTKMGMAAI FETRGNDDAH VILRGGKKGP NYDSASVEAT CEALKSAGLR EQVMVDCSHA NSGKSHLRQL EVVQDLTQQL SQRERRIIGV MLESHLEEGR QDLKPGVPLR HGVSITDACV SWTQTEPALE TLAEATRKRR AG
|
| |