Gene BURPS1106A_A0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0820 
Symbol 
ID4904340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp815969 
End bp817759 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content72% 
IMG OID640143926 
Productshort chain dehydrogenase 
Protein accessionYP_001074856 
Protein GI126455578 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities
[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCCC TTTCCGACGA AGCGCCGCTC GCGCTGTTCG AATCGGTTCA CACCGAAACC 
GCCGTCGCGG CCGGCGACGT CACGCTCGCC GCGAAGACCT GGGGCGACGC GTCGCGCCCC
GCCGTCGTGC TCGTGCACGG CTATCCGGAC AACAGCGAAG TCTGGCGCCG CGTCGCGCCC
CTGCTCGCGA AGTCGTACTA CGTGATCGCC TACGACGTGC GCGGCGCCGG GCTGTCGACC
AAGCCCGCGC GCACGGCCGA CTATCGGCTC GAGCGGCTCG TCGACGATTT CGCGGCGGTG
ATCGACGCGC TCGCGCCGAA TCGCGCGGTG CACGTGGTCG GCCACGACTG GGGCTCGATC
CAGGGCTGGG AATTCGTCAC CGAGCCGCGC CTCGCCGGGC GCATCCTGTC GTACACGTCG
TGCTCCGGAC CGAATCTCGA TCACGTCGGC TACTGGCTGC GCCAGCAGCT CGCGCGGCCC
TCGCCCGCGT CGATCAAGCG CCTCGCCGGC CAGCTCGTGC GCTCATGGTA CGTGTACCTG
TTTCACCTGC CGCTCATCCC CGAGCTCAAC TGGCGCCTGT GGCTCGGCCG CGCGTGGCCC
GCGCTGATGC GCCGGCTCGA GCACACCGAC GTCGGCGTGC GCCCGACGCA GACCGAGGAC
GGCGTGCACG GCGTGCGCCT GTATCGCGCG AACTTCCTCC GCCGCGTGCT CGCGCCGCGC
GAGCGCTATG CGCACGCGCC CGTGCAAGTG GTCGTGCCGC TGCGCGACAA GTTCGTGAGC
CCCGCGCTGT CGGCCGACAT CGCGCGTTGG GTGCCGACCT ATTACCGCCG CGAAGTGGCC
GAGCGGCACT GGCTGCCGAT GTCGGAGCCG GCGCGCTTCG CCGCGCTCGC GCAGGAACTG
ATCGAGGCGG TCGAGACGGG CGTGCAGCCG CCCGCGCTCG CGCATGCGCG CCGCAGGAGC
GGCACGGGGC CGTTCGTCGG CAAACGCGTC GTGATCACCG GCGCGGGCAG CGGAATCGGC
CGCTGCGCGG CCGTCGAATT CGCGAAGCAG GGCGCGTCGA TCGTCGCCGT CGACATCGAC
GAGCAGGCGG CCGAGCGCAC CGCGCTGCTC GTGCGGCTGC TCGGCGCGCA GGCCGACGTG
CGGCGCGTGG ACGTCGGCTC GGCCGACGAC ATGGAGGCGC TCGCGAACTG GGTGGGCGAC
GAGCTGGGCG GCGCGGACGT CGTCGTCAAC AACGCGGGCA TCGGCATGGC GGGCGGCATC
CTCGACACGT CGGCCGCGCA TTGGGAGCGC ATCCTGCGCG TGAACCTGTG GGGCGTGATC
CACGGCTCGC GCCTGTTCGC CAAGCAGATG GCCGCGCGCG GCGCGGGCGG CCACATCGTC
AACACCGCGT CGGCGGCCGC GTTCGGCCCG TCGCGCGACC TGCCCGCGTA CGCGACGACG
AAGGCCGCGG TGCTGATGCT GAGCGAATGC ATGCGCGCGG AGCTCGCGGA CCACGGCATC
GGCGTGACGG CGGTGTGCCC CGGCTTCGCG GAGACCGGCA TCATGGCGTC GACCCAATAC
GCGGGCGCGA AGAGCGACCA GGACGAAGCG CGGCTGCGCA AGCGCGCGAC GAAGCTTTAC
CAGATGCGCG GCCTGAAGCC GGAGACCGTC GCGAAGGCGA TGGTCGACGG CGTGCTGCAG
AACCAACCCG TCGTCGCGAT CGGCGCGGAA GCGCATGCGA TGCGCTTCGT CGGGCGCTTC
GCGCCGTGGC TCGGCCGGCT GATCGCCCGC GTCAGCATGG CGTCGCACTG A
 
Protein sequence
MQPLSDEAPL ALFESVHTET AVAAGDVTLA AKTWGDASRP AVVLVHGYPD NSEVWRRVAP 
LLAKSYYVIA YDVRGAGLST KPARTADYRL ERLVDDFAAV IDALAPNRAV HVVGHDWGSI
QGWEFVTEPR LAGRILSYTS CSGPNLDHVG YWLRQQLARP SPASIKRLAG QLVRSWYVYL
FHLPLIPELN WRLWLGRAWP ALMRRLEHTD VGVRPTQTED GVHGVRLYRA NFLRRVLAPR
ERYAHAPVQV VVPLRDKFVS PALSADIARW VPTYYRREVA ERHWLPMSEP ARFAALAQEL
IEAVETGVQP PALAHARRRS GTGPFVGKRV VITGAGSGIG RCAAVEFAKQ GASIVAVDID
EQAAERTALL VRLLGAQADV RRVDVGSADD MEALANWVGD ELGGADVVVN NAGIGMAGGI
LDTSAAHWER ILRVNLWGVI HGSRLFAKQM AARGAGGHIV NTASAAAFGP SRDLPAYATT
KAAVLMLSEC MRAELADHGI GVTAVCPGFA ETGIMASTQY AGAKSDQDEA RLRKRATKLY
QMRGLKPETV AKAMVDGVLQ NQPVVAIGAE AHAMRFVGRF APWLGRLIAR VSMASH