Gene BMA10229_A2688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2688 
SymbolhmgA 
ID4792002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2715390 
End bp2716742 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content67% 
IMG OID 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001028640 
Protein GI124386559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.515143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGAA CGACAATCAT GACATTGGAT TTTTCGAAAC CGGGCGAAGC CGGCTATCAG 
AGCGGCTTCG CGAACGAATT CGCGACCGAG GCGCTGCCGG GCGCGTTGCC GCACGCGCGC
AACTCGCCGC AGCGCGCGCC GTACGGGCTC TACGCGGAGC AGTTCTCCGG CACCGCGTTC
ACCGCGCCGC GCGGCCACAA CCGCCGCTCG TGGCTGTACC GGATCCGGCC CGCCGCCGTG
CATCGGCCGT TCGAGCTCGT GTCGGGCGAG CGCCGGATCG TCGCCGAGTT CGGCGATTCG
GACGACGTGC CGCCGACGCC GCCGAACCAG TTGCGCTGGG ATCCGCTGCC GATGCCCGCG
CAGCCGACCG ATTTCGTCGA CGGCTGGGTG ACGATGGCGG GCAACGGCTC GGCCGCCGCG
ATGAGCGGCT GCGCGATCCA CCTGTACGCG GCGAACCGCT CGATGCGCGA GCGCTTCTTC
TACAGCGCGG ACGGCGAGCT GCTGATCGTG CCGCAGGAAG GGCGCCTCTT CATCATGACG
GAGCTCGGCC GGCTCGATGT CGAGCCGTTC GAGATCGCGG TGATCCCGCG CGGCGTGCGC
TTCGCGGTCG CGCTGCCGGA CGGGCGCGCG CGCGGCTATG TGTGCGAGAA CTTCGGTGCG
CTGCTCAGGC TGCCGGACCT CGGGCCGATC GGCTCGAACG GCCTCGCGAA TCCGCGCGAC
TTCCTCACGC CGCACGCGTC GTACGAGGAT CGCGAAGGCG CGTTCGAGCT CGTCGCGAAG
CTGAATGGCC GGCTCTGGCG CGCGGACATC GATCATTCGC CGTTCGACGT CGTCGCGTGG
CACGGCAACT ACGCGCCGTA CAAGTACGAC CTGCGCCACT TCAACACGAT CGGCTCGATC
AGCTACGATC ATCCGGACCC GTCGATCTTC CTCGTGCTGC AGTCGCAAAG CGATACGCCG
GGCGTCGACG CGATCGACTT CGTGATCTTC CCGCCGCGCT GGCTCGCGGC CGAGGATACG
TTCCGCCCGC CTTGGTTCCA CCGCAACGTC GCGAGCGAAT TCATGGGGCT CGTGCACGGC
GTCTACGACG CGAAGGCCGA AGGCTTCGTG CCGGGCGGCG CGAGCCTGCA CAACTGCATG
TCCGGCCACG GGCCCGACGC GGACACGTTC GAGAAGGCTT CTTCGATCGA CACGTCGAAG
CCGAACAAGG TCGGCGACAC GATGGCGTTC ATGTTCGAGA CCCGCACGCT GATCCGGCCG
ACGCGCTTCG CGCTCGACAC CGCGCAACTG CAGGCGAACT ACTTCGAATG CTGGCAAGGC
CTCAAGAAAC ACTTCAATCC GGAGCAACGA TGA
 
Protein sequence
MERTTIMTLD FSKPGEAGYQ SGFANEFATE ALPGALPHAR NSPQRAPYGL YAEQFSGTAF 
TAPRGHNRRS WLYRIRPAAV HRPFELVSGE RRIVAEFGDS DDVPPTPPNQ LRWDPLPMPA
QPTDFVDGWV TMAGNGSAAA MSGCAIHLYA ANRSMRERFF YSADGELLIV PQEGRLFIMT
ELGRLDVEPF EIAVIPRGVR FAVALPDGRA RGYVCENFGA LLRLPDLGPI GSNGLANPRD
FLTPHASYED REGAFELVAK LNGRLWRADI DHSPFDVVAW HGNYAPYKYD LRHFNTIGSI
SYDHPDPSIF LVLQSQSDTP GVDAIDFVIF PPRWLAAEDT FRPPWFHRNV ASEFMGLVHG
VYDAKAEGFV PGGASLHNCM SGHGPDADTF EKASSIDTSK PNKVGDTMAF MFETRTLIRP
TRFALDTAQL QANYFECWQG LKKHFNPEQR