Gene BURPS1710b_3228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3228 
SymbolhmgA 
ID3688297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3539426 
End bp3540778 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content67% 
IMG OID637729683 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_334600 
Protein GI76808659 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGAA CGACAATCAT GACATTGGAT TTTTCGAAAC CGGGCGAAGC CGGCTATCAG 
AGCGGCTTCG CGAACGAATT CGCGACCGAG GCGCTGCCGG GCGCGTTGCC GCACGCGCGC
AACTCGCCGC AGCGCGCGCC GTACGGGCTC TACGCGGAGC AGTTCTCCGG CACCGCGTTC
ACCGCGCCGC GCGGCCACAA CCGCCGCTCG TGGCTGTACC GGATCCGGCC CGCCGCCGTG
CATCGGCCGT TCGAGCTCGT GTCGGGCGAG CGCCGGATCG TCGCCGAGTT CGGCGATTCG
GACGACGTGC CGCCGACGCC GCCGAACCAG TTGCGCTGGG ATCCGCTGCC GATGCCCGCG
CAGCCGACCG ATTTCGTCGA CGGCTGGGTG ACGATGGCGG GCAACGGCTC GGCCGCCGCG
ATGAGCGGCT GCGCGATCCA CCTGTACGCG GCGAACCGCT CGATGCGCGA GCGCTTCTTC
TACAGCGCGG ACGGCGAGCT GCTGATCGTG CCGCAGGAAG GGCGCCTCTT CATCATGACG
GAGCTCGGCC GGCTCGACGT CGAGCCGTTC GAGATCGCGG TGATCCCGCG CGGCGTGCGC
TTCGCGGTCG CGCTGCCGGA CGGGCGCGCG CGCGGCTATG TATGCGAGAA CTTCGGTGCG
CTGCTCAGGC TGCCGGACCT CGGGCCGATC GGCTCGAACG GCCTCGCGAA TCCGCGCGAC
TTCCTCACGC CGCACGCGTC GTACGAGGAT CGCGAAGGCG CGTTCGAGCT CGTCGCGAAG
CTGAATGGCC GGCTCTGGCG CGCGGACATC GATCATTCGC CGTTCGACGT CGTCGCGTGG
CACGGCAACT ACGCGCCGTA CAAGTACGAC CTGCGTCACT TCAACACGAT CGGCTCGATC
AGCTACGATC ATCCGGACCC GTCGATCTTC CTCGTGCTGC AGTCGCAAAG CGATACGCCG
GGCGTCGACG CGATCGACTT CGTGATCTTC CCCCCGCGCT GGCTCGCGGC CGAGGATACG
TTCCGCCCGC CTTGGTTCCA CCGCAACGTC GCGAGCGAAT TCATGGGGCT CGTGCACGGC
GTCTACGACG CGAAGGCCGA AGGCTTCGTG CCGGGCGGCG CGAGCCTGCA CAACTGCATG
TCCGGCCACG GGCCCGACGC GGACACGTTC GAGAAGGCTT CTTCGATCGA CACGTCGAAG
CCGAACAAGG TCGGCGACAC GATGGCGTTC ATGTTCGAGA CCCGCACGCT GATCCGGCCG
ACGCGCTTCG CGCTCGACAC CGCGCAACTG CAGGCGAACT ACTTCGAATG CTGGCAAGGC
CTCAAGAAAC ACTTCAATCC GGAGCAACGA TGA
 
Protein sequence
MERTTIMTLD FSKPGEAGYQ SGFANEFATE ALPGALPHAR NSPQRAPYGL YAEQFSGTAF 
TAPRGHNRRS WLYRIRPAAV HRPFELVSGE RRIVAEFGDS DDVPPTPPNQ LRWDPLPMPA
QPTDFVDGWV TMAGNGSAAA MSGCAIHLYA ANRSMRERFF YSADGELLIV PQEGRLFIMT
ELGRLDVEPF EIAVIPRGVR FAVALPDGRA RGYVCENFGA LLRLPDLGPI GSNGLANPRD
FLTPHASYED REGAFELVAK LNGRLWRADI DHSPFDVVAW HGNYAPYKYD LRHFNTIGSI
SYDHPDPSIF LVLQSQSDTP GVDAIDFVIF PPRWLAAEDT FRPPWFHRNV ASEFMGLVHG
VYDAKAEGFV PGGASLHNCM SGHGPDADTF EKASSIDTSK PNKVGDTMAF MFETRTLIRP
TRFALDTAQL QANYFECWQG LKKHFNPEQR