Gene BURPS668_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3173 
SymbolfahA 
ID4883431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3113276 
End bp3114625 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content70% 
IMG OID640129101 
Productfumarylacetoacetase 
Protein accessionYP_001060185 
Protein GI126441604 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.96749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCAA GGCCTCAAGA AACACTTCAA TCCGGAGCAA CGATGAGCGC TATTCCCGAC 
ACGCTGCGCG CGAGCCTCGA TCCGTCCCGC AAGAGCTGGC TCGATACGGC GAACGCGGCC
GCGTGCGACT TCCCGATCCA GAACCTGCCG TTCGGCATCT TCAGCGACGC GCGTGACGCG
TCGCGCCGCG CGGGCGTCGC GCTTGGCGAT CAGATCATCG ATCTCGCCGC GCTCGCGCGC
GCGGGGCTGC TGACGGTCGA CGGCGGGGCG GCCGTGTTCG CGCGGCCGGC GCTCAACGAT
TTCATCTCGC TCGGCCGCGA CGCATGGCGC AGCGTGCGCG CCCAGCTGAG CGCGCTTTTC
GAGCGCGGCG AAGCGCGGCT GCGCGACGAC GCGGCGTTGC GCGCGAAGGT GCTCGTCGCG
CAGCGCGACG CGGCGCTTCA TCTGCCCGTC GACATTCCCG GCTATACCGA TTTCTATTCG
TCGAAGGAGC ACGCGACGAA CGTCGGCTCG ATGTTTCGCG ATCCGAAGAA CGCGCTGCTG
CCGAACTGGT CGGAGATGCC GATCGGCTAC AACGGCCGCG CGTCGTCGGT CGTCGTGAGC
GGCACGCCGG TGCGCCGGCC GAACGGCCAG CTGAAGCTGC CCGACAGCGA GCGCCCGGTG
TTCGGCGCGT GCCGCAAGCT CGACATCGAG CTCGAGACGG GCTTCATCGT CGGCCGCGGC
AACGCGCTCG GCGAGCCGAT CGCGTGCGAG GATGCGGAGT CGCACATCTT CGGGATGGTG
CTGCTCAACG ACTGGAGCGC GCGCGACATC CAGCAATGGG AATACGTGCC GCTCGGGCCG
TTCAACGCGA AGACGTTCGC GACGTCGATC TCGCCGTGGA TCGTCACGCT CGACGCGCTC
GAGCCGTTTC GCACCGCGCA GCCGAGGCAG GAGCCGGAGC CGCTCGCGTA TCTGCGCCAC
GGCGGCGCGC ATGCGTTCGA CATCGAGCTC GAAGTGCGGC TGAGGCCGGA GGGCGCCGCC
GACGCGACGA CGATCGCGCA CACGAACTTC AGGCACATGT ACTGGACGAT GGCGCAGCAG
CTCGCGCACC ACACGGTGTC GGGCTGCAAC ACGCGGGTCG GCGACCTGAT GGGCTCGGGC
ACGATCAGCG GGCCGGCGAA GCAGGCGTTC GGCAGCCTGC TCGAGCTGAC GTGGAACGGC
AAGGAGCCCG TCTCGCTCGC GGGCGGCGGC ACGCGCGCGT TCATCGAGGA CGGCGACGAG
CTGACGCTGG CGGGCTGGTG CCAGGGCGAC GGGTATCGCG TCGGCTTCGG CACGTGCGTC
GGGAAGATTC TGCCGGCGCG GGGCTGGTGA
 
Protein sequence
MLARPQETLQ SGATMSAIPD TLRASLDPSR KSWLDTANAA ACDFPIQNLP FGIFSDARDA 
SRRAGVALGD QIIDLAALAR AGLLTVDGGA AVFARPALND FISLGRDAWR SVRAQLSALF
ERGEARLRDD AALRAKVLVA QRDAALHLPV DIPGYTDFYS SKEHATNVGS MFRDPKNALL
PNWSEMPIGY NGRASSVVVS GTPVRRPNGQ LKLPDSERPV FGACRKLDIE LETGFIVGRG
NALGEPIACE DAESHIFGMV LLNDWSARDI QQWEYVPLGP FNAKTFATSI SPWIVTLDAL
EPFRTAQPRQ EPEPLAYLRH GGAHAFDIEL EVRLRPEGAA DATTIAHTNF RHMYWTMAQQ
LAHHTVSGCN TRVGDLMGSG TISGPAKQAF GSLLELTWNG KEPVSLAGGG TRAFIEDGDE
LTLAGWCQGD GYRVGFGTCV GKILPARGW