Gene BURPS1106A_3835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3835 
Symbol 
ID4900789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3735596 
End bp3736873 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content70% 
IMG OID640137061 
ProductGDSL-like lipase/acylhydrolase domain-containing protein 
Protein accessionYP_001068056 
Protein GI126454674 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00947114 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCCC GACGCTGGCT TTCCGCTCTT TCGTGCTGTC TCGCTTTGGC TGCATCGCAA 
CCGGGGAGTG CCGCGCAAGC CGATGCGCCG GCGCGCTGGG TCGCGTCCTG GGCAACCGCA
CTGCAGCCGA TCCCGGATCT TGCCGCGCTG CCGCCGTTGT ATCGCGCGCC GGAGGTCGCG
GGGCGCACGG TCCGCCAGAT CGTCTATCCG ACGCTCGCGG GCAAGGCGGT TCGCATCCGC
GTAAGCAATG CGTACGGCAA GACGCCGCTC GCGATCGGCG AGATGAACAT CGGCCGGTCG
GCGGGCGGTG CGGCGGTTGC TGCGGGCAGC TCGACGGCGG TGACTTTCGG CGGCCGTCGC
GAAACGGAAG TGCCGCCGGG GCAGGAGCGG GACAGCGATC CCGTCGCGTA CGACGTGAGG
GCCGGCGAGC CGTACGCGCT CAGCCTGTAC CTGGGAAGCC GCCAGACGAT GACGGTCTGG
CACCGCGTAT CGAATCAGGT CAATTACGTG TCGGCGCCGG GTAACCACAC GGGCGACGCC
TCACCCGACG CGTTTCGCAC GCGCTTCACG CAATCCGCCT GGATCGCCGA GTTGGCGGTG
GCGGCGCGGC AGCCGGGCGC GGCGGCGATC GCGGCCGTCG GCGATTCGAT CACCGATGGC
CTGCGCTCGA GCCTGAACCG CAATCGCCGC TGGCCGGATG CGCTGGCGGC CCGGCTCGAG
CGCGCGGGCG CAGGCGACAT CGGCGTGGCG AATCTCGGCA TCAGCGGCAA TCGGCTGCTG
AGCGACTCGC GCTGCTACGG CATCGCGCTT GAGCGCCGCT TCGAGCGTGA CGTGCTGACG
CGCGCGGGCG TGAAGGTCGC GGTGCTGCTG ATTGGCATCA ACGACATCAA TTTCGCTGCG
ATGCCCGCCC GGTCCGGGCT CGACTGTGAT GCGCCGCATA CGCGGGTCGA CGCGCAAGCG
TTGATCGCGG GCTACCGCCG CGTGATCGCG GCTGCGCACG CGCGAGGCGT TGCGGTATTC
GGCGCGACGC TGACGCCGGC GTCGCTGCCG CCGGCGCGCG AAGCGATCCG TCGCGAAGTC
AACGAATGGA TTCGAACCTC GGGCGCCTTC GACGGCGTCG TGGATTTCGA CGCCGCGCTG
CGCGATCCGG CTAAGCCGTC GACATTGCTG CGTCGCTATA ACAGTGGCGA CGACATCCAC
CCGAGCGACG CCGGCTATGC GGCGATGGCC GAGGCGGTGC CGCTGGAGCG ACTGGCGGCG
GCGGCCGGGC GCCGCTGA
 
Protein sequence
MTSRRWLSAL SCCLALAASQ PGSAAQADAP ARWVASWATA LQPIPDLAAL PPLYRAPEVA 
GRTVRQIVYP TLAGKAVRIR VSNAYGKTPL AIGEMNIGRS AGGAAVAAGS STAVTFGGRR
ETEVPPGQER DSDPVAYDVR AGEPYALSLY LGSRQTMTVW HRVSNQVNYV SAPGNHTGDA
SPDAFRTRFT QSAWIAELAV AARQPGAAAI AAVGDSITDG LRSSLNRNRR WPDALAARLE
RAGAGDIGVA NLGISGNRLL SDSRCYGIAL ERRFERDVLT RAGVKVAVLL IGINDINFAA
MPARSGLDCD APHTRVDAQA LIAGYRRVIA AAHARGVAVF GATLTPASLP PAREAIRREV
NEWIRTSGAF DGVVDFDAAL RDPAKPSTLL RRYNSGDDIH PSDAGYAAMA EAVPLERLAA
AAGRR