Gene BURPS1106A_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0438 
Symbol 
ID4902442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp403591 
End bp404583 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID640133668 
Productfumarylacetoacetate hydrolase family protein 
Protein accessionYP_001064721 
Protein GI126454589 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.221432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTG CTTCGCTCAA GGACGGCACG CGCGACGGCC AACTGATCGT CGTCTCGCGC 
GACCTGCACA CGGCGGCGAT CGCCGACGCG ATCGCGCCGA CGCTGCAGCG CGTGCTCGAC
GACTGGGCGT TCTACGCGCC GCAGCTGCGC GACCTGTACG ACGCACTGAA CCACGGCCGC
GCGCGCAACG CGTTCGCGTT CGAGCCCGCC GATTGCATGG CGCCGCTGCC GCGCGCGTTC
CAGTGGGCGG ACGGCTCCGC GTACGTGAAC CACGTCGAGC TCGTGCGCCG CGCGCGCGGC
GCCGAGATGC CGCCCGAGTT CTGGACCGAT CCGCTGATGT ACCAGGGCGG CAGCGACGAT
TTCCTCGGCC CGCGCGACGA CATCGTCTGC GCATCGGAGG CGTGGGGCAT CGATTTCGAG
GCGGAAGTCG CGGTGATCAC GGCCGACGTG CCGATGGGCG CCGCGCCCGA CGAGGCGCTG
AAAGCGGTGC GGCTCGTCAC GCTCGTGAAC GACGTGTCGC TGCGCAACCT GATTCCCGCC
GAGCTCGCGA AGGGCTTCGG CTTCTTCCAG AGCAAGCCGG CGAGCGCGTT CGCGCCGGTG
GCCGTGACGC CCGACGAGCT CGGCGAGCAC TGGCGCGAAG GCCGCCTGCA TCGCCCGATG
CTCGTCCACT GGAACGGCAA GAAGGTCGGT CAGCCGGATG CGGGCGTCGA CATGGTGTTT
CACTTCGGTC AACTGATCGC GCACGCGGCG AAGACGCGCA ACGTGCGCGC GGGCTCGATC
GTCGGCTCGG GCACGGTGTC GAACAAGGAT GCGAAGCGCG GCTACTGCTG CATCGCCGAG
AAGCGCTGCC TCGAGACGAT CGAGCACGGC GCGCCGCAGA CCGAGTTCAT GCGCTACGGC
GACAGGGTGA AGATCGAGAT GGTCGACGAG GCGGGGAAGT CGATCTTCGG CGCGATCGAG
CAGGCGGTCG CGCCGCTGGA CGCCGCCGCT TGA
 
Protein sequence
MKLASLKDGT RDGQLIVVSR DLHTAAIADA IAPTLQRVLD DWAFYAPQLR DLYDALNHGR 
ARNAFAFEPA DCMAPLPRAF QWADGSAYVN HVELVRRARG AEMPPEFWTD PLMYQGGSDD
FLGPRDDIVC ASEAWGIDFE AEVAVITADV PMGAAPDEAL KAVRLVTLVN DVSLRNLIPA
ELAKGFGFFQ SKPASAFAPV AVTPDELGEH WREGRLHRPM LVHWNGKKVG QPDAGVDMVF
HFGQLIAHAA KTRNVRAGSI VGSGTVSNKD AKRGYCCIAE KRCLETIEHG APQTEFMRYG
DRVKIEMVDE AGKSIFGAIE QAVAPLDAAA