Gene BURPS668_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3893 
Symbol 
ID4881991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3791232 
End bp3792218 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID640129821 
Product3-oxoadipate enol-lactone hydrolase family protein 
Protein accessionYP_001060887 
Protein GI126438367 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.28749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTTCGA GATACGTCGG AGATAATGTA GCGCGACGAT TCCATCGCTC CGCACGAGAA 
AGCGGCAATC ATGACGATTG CCGCCGATTG CTTCGGCATG CGCATCGATT CCGCGCGTTC
GCCCGAACTC CCTTAACCGA TTCATTGCCT AGCGACCTTA TCGACATGCC TTTCGTCACG
ATCGATGGCC AGCCCCTGCA CTATCAGATC AGGGGCGCCG GCGCGCCCGT CCTGTTCGGA
CACAGCTACC TGTGGGATTC GTCGATGTGG GAGCCGCAGC TCGACGCGCT CTCGAAGTCG
TACCGCGTAA TCGCGCCGGA CCTGTGGGGA CACGGCCGGT CCGGCCCGCT GCCCGACGGC
ACGCGCAGCC TCGACGATCT CGCGAGACAG ATGAGCGAGC TCCTCGATCA CCTCGGCATC
GACACCTGCT CGATCGTCGG GCTATCGGTG GGCGGCATGT GGGCGGTGCC GCTCGCGCAT
CGCGCGCCGC AACGCATCGA TCGTCTCGTG CTGATGGATA CCTACGTCGG CGTCGAGCCC
GACGCGACGC GCAACCAGTA TTTCCAGATG CTCGAGGCCA TCGACGCGCA AGGCGCGATT
CCGGCGCCGC TGCTCGACGC GATCGTGCCG ATCTTCTTCC GCCCCGGCAT CGATCCGGCG
AGCGAGCTGC CCACGGGCTT CCGGCGCGCG CTGCAGGCGT TCACGACCGA GCGGCTGCGC
GACTCGGTGA TACCGCTCGG CAAGATCACG TTCGGCCGCG AAGACGCGCG CGCGCAACTG
AGCGCGCTGC CGGCGGACCG CACGCTCGTG ATGTGCGGCG CGAACGACGT CGCGCGGCCG
CCCGAGGAAG CCGACGAAAT CGCGGCGCTC ATCGGCTGCG AAAAGGCGTT CGTGCCGAAT
GCCGGACATA TCTCGAATCT CGAGAATCCG GCATTCGTCA CGCAGGCGCT GAGCGACTGG
CTCGGGCGCG GCGCGGCCCG CGCGTGA
 
Protein sequence
MISRYVGDNV ARRFHRSARE SGNHDDCRRL LRHAHRFRAF ARTPLTDSLP SDLIDMPFVT 
IDGQPLHYQI RGAGAPVLFG HSYLWDSSMW EPQLDALSKS YRVIAPDLWG HGRSGPLPDG
TRSLDDLARQ MSELLDHLGI DTCSIVGLSV GGMWAVPLAH RAPQRIDRLV LMDTYVGVEP
DATRNQYFQM LEAIDAQGAI PAPLLDAIVP IFFRPGIDPA SELPTGFRRA LQAFTTERLR
DSVIPLGKIT FGREDARAQL SALPADRTLV MCGANDVARP PEEADEIAAL IGCEKAFVPN
AGHISNLENP AFVTQALSDW LGRGAARA