Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1808 |
Symbol | |
ID | 3844733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 2183691 |
End bp | 2185481 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637839109 |
Product | short chain dehydrogenase |
Protein accession | YP_440002 |
Protein GI | 161723118 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities [COG2267] Lysophospholipase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.67175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCCC TTTCCGACGA AGCGCCGCTC GCGCTGTTCG AATCGGTTCA TACCGAAACC GCAGTCGCGG CCGGCGACGT CACGCTCGCC GCGAAGACCT GGGGCGACGC GTCGCGTCCC GCCGTCGTGC TCGTTCACGG CTATCCGGAC AGCAGCGAAG TCTGGCGCCG CGTCGCGCCC CTCCTCGCGA AGTCGTACTA CGTGATCGCG TACGACGTGC GCGGCGCGGG CCTGTCGACG AAGCCCGCGC GCACGGCCGA TTACCGGCTC GAGCGGCTCG TCGACGACTT CGTCGCGGTG ATCGACGCGC TCGCGCCGAA CCGCGCGGTA CACGTGATCG GCCACGACTG GGGCTCGATC CAGGGCTGGG AGTTCGTCAC CGAGCCGCGG CTCGCGGGGC GCATCCTGTC GTACACGTCG TGCTCCGGGC CGAATCTCGA CCACGTCGGC TACTGGCTGC GCCAGCAGCT CGCGCGGCCG TCGCCCGCGT CGATCAGGCG GCTCGCCGGC CAGCTCGTGC GCTCGTGGTA CGTGTACCTG TTTCACCTGC CGTTCATTCC CGAGCTCAAC TGGCGCCTGT GGCTCGGCCG CGCGTGGCCC GCGCTGATGC GCCGCATCGA GCGCACCGAC GTCGGCCCGC GCCCGACGCA GACCGAGGAC GGCGTGCACG GCGTGCGCCT GTATCGCGCG AACTTCATCC GCCGCGTGTT CGCGCCGCGC GAACGCTACG CGCACGCGCC CGTGCAGGTC GTCGTGCCGC TGCGCGACAA GTTCGTGAGC CCCGCGCTGT CGGCCGACAT CGCGCGCTGG GTGCCGACCT ATTACCGCCG CGAGGTGGCC GAGCGGCACT GGCTGCCGAT GTCGGACCCC GCACGCTTCG CGGCGCTCGC GCAGGAGTTG ATCGAAGCGG TCGAGACGGG CGTTCAGCCG CCCGCGCTCG CGCATGCGCG CCGCTTGAGC GGCACCGGTC CGTTCGTCGG CAAGCGCGTC GTGATCACCG GCGCGGGCAG CGGGATCGGC CGCTGCGCGG CCGTCGAATT CGCGAAGCAA GGCGCGTCGA TCGTCGCCGT CGACATCGAC GAGCAGGCGG CCGAGCGCAC CGCGCTCCTC GTGCGGCTGC TCGGCGCGCA GGCCGACGTG CGGCGCGTCG ACGTCGGTTC GGCCGACGAC ATGGAAGCGC TCGCGAACTG GGTCGGCGAG GAGCTGGGCG GCGCGGACGT CGTCGTCAAC AACGCCGGCA TCGGCATGGC GGGCGGCATC CTCGACACGT CGGCCGCGCA TTGGGAGCGC ATCCTGCGCG TGAACCTGTG GGGCGTGATC CACGGCTCGC GCCTGTTCGC CAAGCAGATG GCCGCGCGCG GCGCGGGCGG CCACATCGTC AACACCGCGT CGGCGGCCGC GTTCGGCCCG TCGCGCGACC TGCCCGCGTA CGCGACGACG AAGGCCGCGG TGCTGATGCT GAGCGAGTGC ATGCGCGCGG AGCTCGCGGA CCACGGGATC GGCGTGACGG CCGTCTGCCC CGGTTTCGCC GAGACGGGCA TCATGGCGTC GACCCAATAC GCGGCAGCAA AGAGCGCGCA GGACGAAGCG CGGCTGCGCA AGCGCGCGAC GAAGCTCTAC CAGATGCGCG GCCTGAAGCC GGAGAGCGTC GCGAAGGCGA TGGTCGACGG CGTGCTGCAG AACAAGCCGG TTGTCGCGAT CGGCGCGGAA GCGCACGCGA TGCGCTTCGT CGGGCGCTTC GCGCCGTGGC TCGGCCGGAT GATCGCCCGC GTCAGCATGG CGTCGCACTG A
|
Protein sequence | MQPLSDEAPL ALFESVHTET AVAAGDVTLA AKTWGDASRP AVVLVHGYPD SSEVWRRVAP LLAKSYYVIA YDVRGAGLST KPARTADYRL ERLVDDFVAV IDALAPNRAV HVIGHDWGSI QGWEFVTEPR LAGRILSYTS CSGPNLDHVG YWLRQQLARP SPASIRRLAG QLVRSWYVYL FHLPFIPELN WRLWLGRAWP ALMRRIERTD VGPRPTQTED GVHGVRLYRA NFIRRVFAPR ERYAHAPVQV VVPLRDKFVS PALSADIARW VPTYYRREVA ERHWLPMSDP ARFAALAQEL IEAVETGVQP PALAHARRLS GTGPFVGKRV VITGAGSGIG RCAAVEFAKQ GASIVAVDID EQAAERTALL VRLLGAQADV RRVDVGSADD MEALANWVGE ELGGADVVVN NAGIGMAGGI LDTSAAHWER ILRVNLWGVI HGSRLFAKQM AARGAGGHIV NTASAAAFGP SRDLPAYATT KAAVLMLSEC MRAELADHGI GVTAVCPGFA ETGIMASTQY AAAKSAQDEA RLRKRATKLY QMRGLKPESV AKAMVDGVLQ NKPVVAIGAE AHAMRFVGRF APWLGRMIAR VSMASH
|
| |