Gene BMA10247_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_1920 
SymbolfahA 
ID4892723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009080 
Strand
Start bp1900804 
End bp1902111 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID640150575 
Productfumarylacetoacetase 
Protein accessionYP_001081457 
Protein GI126451307 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.882274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTA TTCCCGACAC GCTGCGCGCG AGCCTCGATC CGTCCCGCAA GAGCTGGCTC 
GATACGGCGA ACGCGGCCGC GTGCGACTTC CCGATCCAGA ACCTGCCGTT CGGCATCTTC
AGCGACGCGC GCGACGCGTC GCGCCGCGCG GGCGTCGCCC TTGGCGATCA GATCATCGAT
CTCGCCGCGC TCGCGCGCGC GGGGCTGCTG ACGGTCGACG GCGGGGCGGC CGTGTTCGCG
CGGCCGGCGC TCAACGATTT CATCTCGCTC GGCCGCGACG CATGGCGCAG CGTGCGCGCC
CAGCTGAGCG CGCTTTTCGA GCGCGGCGAC GCGCGGCTGC GCGACGACGC GGCGTTGCGC
GCGAAGGTGC TCGTCGCGCA GCGCGACGCG GCGCTTCATC TGCCCGTCGA CATTCCCGGC
TATACCGATT TCTATTCGTC GAAGGAGCAC GCGACGAACG TCGGCTCGAT GTTTCGCGAT
CCGAAGAACG CGCTGCTGCC GAACTGGTCG GAGATGCCGA TCGGCTACAA CGGCCGCGCG
TCGTCGGTCG TCGTGAGCGG CACGCCGGTG CGCCGGCCGA ACGGCCAGCT GAAGCTGCCC
GACAGCGAGC GCCCGGTGTT CGGCGCGTGC CGCAAGCTCG ACATCGAGCT CGAGACGGGC
TTCATCGTCG GCCGCGGCAA CGCGCTCGGC GAGCCGATCG CGTGCGAGGA TGCGGAGTCG
CACATCTTCG GGATGGTGCT GCTCAACGAC TGGAGCGCGC GCGACATCCA GCAATGGGAA
TACGTGCCGC TCGGGCCGTT CAACGCGAAG ACGTTCGCGA CGTCGATCTC GCCGTGGATC
GTCACGCTCG ATGCGCTCGA GCCGTTTCGC ACCGCGCAGC CGAGGCAGGA GCCGGAGCCG
CTCGCGTATC TGCGCCACGG CGGCGCGCAT GCGTTCGACA TCGAGCTCGA AGTGCGGCTG
AGGCCGGAGG GCGCCGCCGA CGCGACGACG ATCGCGCGCA CGAACTTCAG GCACATGTAC
TGGACGATGG CGCAGCAGCT CGCGCACCAC ACGGTGTCGG GCTGCAACAC GCGGGTCGGC
GACCTGATGG GCTCGGGCAC GATCAGCGGG CCGGCGAAGC AGGCGTTCGG CAGCCTGCTC
GAGCTGACGT GGAACGGCAA GGAGCCCGTC TCGCTCGCGG GCGGCGGCAC GCGCGCGTTC
ATCGAGGACG GCGACGAGCT GACGCTGGCG GGCTGGTGCC AGGGCGACGG GTATCGCGTC
GGCTTCGGCA CGTGCGTCGG GGAGATTCTG CCGGCGCGGG GCCGGTGA
 
Protein sequence
MSAIPDTLRA SLDPSRKSWL DTANAAACDF PIQNLPFGIF SDARDASRRA GVALGDQIID 
LAALARAGLL TVDGGAAVFA RPALNDFISL GRDAWRSVRA QLSALFERGD ARLRDDAALR
AKVLVAQRDA ALHLPVDIPG YTDFYSSKEH ATNVGSMFRD PKNALLPNWS EMPIGYNGRA
SSVVVSGTPV RRPNGQLKLP DSERPVFGAC RKLDIELETG FIVGRGNALG EPIACEDAES
HIFGMVLLND WSARDIQQWE YVPLGPFNAK TFATSISPWI VTLDALEPFR TAQPRQEPEP
LAYLRHGGAH AFDIELEVRL RPEGAADATT IARTNFRHMY WTMAQQLAHH TVSGCNTRVG
DLMGSGTISG PAKQAFGSLL ELTWNGKEPV SLAGGGTRAF IEDGDELTLA GWCQGDGYRV
GFGTCVGEIL PARGR