Gene BURPS1106A_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3212 
SymbolfahA 
ID4900949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3129481 
End bp3130788 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID640136438 
Productfumarylacetoacetase 
Protein accessionYP_001067450 
Protein GI126454358 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTA TTCCCGACAC GCTGCGCGCG AGCCTCGATC CGTCCCGCAA GAGCTGGCTC 
GATACGGCGA ACGCGGCCGC GTGCGACTTC CCGATCCAGA ACCTGCCGTT CGGCGTCTTC
AGCGACGCGC GCGACGCGTC GCGCCGCGCG GGCGTCGCGC TTGGCGATCA GATCATCGAT
CTCGCCGCGC TCGCGCGCGC GGGGCTGCTG ACGGTCGACG GCGGGGCGGC CGTGTTCGCG
CGGCCGGCGC TCAACGATTT CATCTCGCTC GGCCGCGACG CATGGCGCAG CGTGCGCGTC
CAGCTGAGCG CGCTTTTCGA GCGCGGCGAC GCGCGGCTGC GCGACGACGC GGCGTTGCGC
GCGAAGGTGC TCGTCGCGCA GCGCGACGCG GCGCTTCATC TGCCCGTCGA CATTCCCGGC
TATACCGATT TCTATTCGTC GAAGGAGCAC GCGACGAACG TCGGCTCGAT GTTTCGCGAT
CCGAAGAACG CGCTGCTGCC GAACTGGTCG GAGATGCCGA TCGGCTACAA CGGCCGCGCG
TCGTCGGTCG TCGTGAGCGG CACGCCGGTG CGCCGGCCGA ACGGCCAGCT GAAGCTGCCC
GACAGCGAGC GCCCGGTGTT CGGCGCGTGC CGCAAGCTCG ACATCGAGCT CGAGACGGGC
TTCATCGTCG GCCGCGGCAA CGCGCTCGGC GAGCCGATCG CGTGCGAGGA TGCGGAGTCG
CACATCTTCG GGATGGTGCT GCTCAACGAC TGGAGCGCGC GCGACATCCA GCAATGGGAA
TACGTGCCGC TCGGGCCGTT CAACGCGAAG ACGTTCGCGA CGTCGATCTC GCCGTGGATC
GTCACGCTCG ATGCGCTCGA GCCGTTTCGC ACCGCGCAGC CGAGGCAGGA GCCAGAGCCG
CTCGCGTATC TGCGCCACGG CGGCGCGCAT GCGTTCGACA TCGAGCTCGA AGTGCGGCTG
AGGCCGGAGG GCGCCGCCGA CGCGACGACG ATCGCGCGCA CGAACTTCAG GCACATGTAC
TGGACGATGG CGCAGCAGCT CGCGCACCAC ACGGTGTCGG GCTGCAACAC GCGGGTCGGC
GACCTGATGG GCTCGGGCAC GATCAGCGGG CCGGCGAAGC AGGCGTTCGG CAGCCTGCTC
GAGCTGACGT GGAACGGCAA GGAGCCCGTC TCGCTCGCGG GCGGCGGCAC GCGCGCGTTC
ATCGAGGACG GCGACGAGCT GACGCTGGCG GGCTGGTGCC AGGGCGACGG GTATCGCGTC
GGCTTCGGCA CGTGCGTCGG GGAGATTCTG CCGGCGCGGG GCCGGTGA
 
Protein sequence
MSAIPDTLRA SLDPSRKSWL DTANAAACDF PIQNLPFGVF SDARDASRRA GVALGDQIID 
LAALARAGLL TVDGGAAVFA RPALNDFISL GRDAWRSVRV QLSALFERGD ARLRDDAALR
AKVLVAQRDA ALHLPVDIPG YTDFYSSKEH ATNVGSMFRD PKNALLPNWS EMPIGYNGRA
SSVVVSGTPV RRPNGQLKLP DSERPVFGAC RKLDIELETG FIVGRGNALG EPIACEDAES
HIFGMVLLND WSARDIQQWE YVPLGPFNAK TFATSISPWI VTLDALEPFR TAQPRQEPEP
LAYLRHGGAH AFDIELEVRL RPEGAADATT IARTNFRHMY WTMAQQLAHH TVSGCNTRVG
DLMGSGTISG PAKQAFGSLL ELTWNGKEPV SLAGGGTRAF IEDGDELTLA GWCQGDGYRV
GFGTCVGEIL PARGR