Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0820 |
Symbol | |
ID | 4904340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 815969 |
End bp | 817759 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640143926 |
Product | short chain dehydrogenase |
Protein accession | YP_001074856 |
Protein GI | 126455578 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities [COG2267] Lysophospholipase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCCC TTTCCGACGA AGCGCCGCTC GCGCTGTTCG AATCGGTTCA CACCGAAACC GCCGTCGCGG CCGGCGACGT CACGCTCGCC GCGAAGACCT GGGGCGACGC GTCGCGCCCC GCCGTCGTGC TCGTGCACGG CTATCCGGAC AACAGCGAAG TCTGGCGCCG CGTCGCGCCC CTGCTCGCGA AGTCGTACTA CGTGATCGCC TACGACGTGC GCGGCGCCGG GCTGTCGACC AAGCCCGCGC GCACGGCCGA CTATCGGCTC GAGCGGCTCG TCGACGATTT CGCGGCGGTG ATCGACGCGC TCGCGCCGAA TCGCGCGGTG CACGTGGTCG GCCACGACTG GGGCTCGATC CAGGGCTGGG AATTCGTCAC CGAGCCGCGC CTCGCCGGGC GCATCCTGTC GTACACGTCG TGCTCCGGAC CGAATCTCGA TCACGTCGGC TACTGGCTGC GCCAGCAGCT CGCGCGGCCC TCGCCCGCGT CGATCAAGCG CCTCGCCGGC CAGCTCGTGC GCTCATGGTA CGTGTACCTG TTTCACCTGC CGCTCATCCC CGAGCTCAAC TGGCGCCTGT GGCTCGGCCG CGCGTGGCCC GCGCTGATGC GCCGGCTCGA GCACACCGAC GTCGGCGTGC GCCCGACGCA GACCGAGGAC GGCGTGCACG GCGTGCGCCT GTATCGCGCG AACTTCCTCC GCCGCGTGCT CGCGCCGCGC GAGCGCTATG CGCACGCGCC CGTGCAAGTG GTCGTGCCGC TGCGCGACAA GTTCGTGAGC CCCGCGCTGT CGGCCGACAT CGCGCGTTGG GTGCCGACCT ATTACCGCCG CGAAGTGGCC GAGCGGCACT GGCTGCCGAT GTCGGAGCCG GCGCGCTTCG CCGCGCTCGC GCAGGAACTG ATCGAGGCGG TCGAGACGGG CGTGCAGCCG CCCGCGCTCG CGCATGCGCG CCGCAGGAGC GGCACGGGGC CGTTCGTCGG CAAACGCGTC GTGATCACCG GCGCGGGCAG CGGAATCGGC CGCTGCGCGG CCGTCGAATT CGCGAAGCAG GGCGCGTCGA TCGTCGCCGT CGACATCGAC GAGCAGGCGG CCGAGCGCAC CGCGCTGCTC GTGCGGCTGC TCGGCGCGCA GGCCGACGTG CGGCGCGTGG ACGTCGGCTC GGCCGACGAC ATGGAGGCGC TCGCGAACTG GGTGGGCGAC GAGCTGGGCG GCGCGGACGT CGTCGTCAAC AACGCGGGCA TCGGCATGGC GGGCGGCATC CTCGACACGT CGGCCGCGCA TTGGGAGCGC ATCCTGCGCG TGAACCTGTG GGGCGTGATC CACGGCTCGC GCCTGTTCGC CAAGCAGATG GCCGCGCGCG GCGCGGGCGG CCACATCGTC AACACCGCGT CGGCGGCCGC GTTCGGCCCG TCGCGCGACC TGCCCGCGTA CGCGACGACG AAGGCCGCGG TGCTGATGCT GAGCGAATGC ATGCGCGCGG AGCTCGCGGA CCACGGCATC GGCGTGACGG CGGTGTGCCC CGGCTTCGCG GAGACCGGCA TCATGGCGTC GACCCAATAC GCGGGCGCGA AGAGCGACCA GGACGAAGCG CGGCTGCGCA AGCGCGCGAC GAAGCTTTAC CAGATGCGCG GCCTGAAGCC GGAGACCGTC GCGAAGGCGA TGGTCGACGG CGTGCTGCAG AACCAACCCG TCGTCGCGAT CGGCGCGGAA GCGCATGCGA TGCGCTTCGT CGGGCGCTTC GCGCCGTGGC TCGGCCGGCT GATCGCCCGC GTCAGCATGG CGTCGCACTG A
|
Protein sequence | MQPLSDEAPL ALFESVHTET AVAAGDVTLA AKTWGDASRP AVVLVHGYPD NSEVWRRVAP LLAKSYYVIA YDVRGAGLST KPARTADYRL ERLVDDFAAV IDALAPNRAV HVVGHDWGSI QGWEFVTEPR LAGRILSYTS CSGPNLDHVG YWLRQQLARP SPASIKRLAG QLVRSWYVYL FHLPLIPELN WRLWLGRAWP ALMRRLEHTD VGVRPTQTED GVHGVRLYRA NFLRRVLAPR ERYAHAPVQV VVPLRDKFVS PALSADIARW VPTYYRREVA ERHWLPMSEP ARFAALAQEL IEAVETGVQP PALAHARRRS GTGPFVGKRV VITGAGSGIG RCAAVEFAKQ GASIVAVDID EQAAERTALL VRLLGAQADV RRVDVGSADD MEALANWVGD ELGGADVVVN NAGIGMAGGI LDTSAAHWER ILRVNLWGVI HGSRLFAKQM AARGAGGHIV NTASAAAFGP SRDLPAYATT KAAVLMLSEC MRAELADHGI GVTAVCPGFA ETGIMASTQY AGAKSDQDEA RLRKRATKLY QMRGLKPETV AKAMVDGVLQ NQPVVAIGAE AHAMRFVGRF APWLGRLIAR VSMASH
|
| |