Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3228 |
Symbol | hmgA |
ID | 3688297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 3539426 |
End bp | 3540778 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637729683 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_334600 |
Protein GI | 76808659 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGAA CGACAATCAT GACATTGGAT TTTTCGAAAC CGGGCGAAGC CGGCTATCAG AGCGGCTTCG CGAACGAATT CGCGACCGAG GCGCTGCCGG GCGCGTTGCC GCACGCGCGC AACTCGCCGC AGCGCGCGCC GTACGGGCTC TACGCGGAGC AGTTCTCCGG CACCGCGTTC ACCGCGCCGC GCGGCCACAA CCGCCGCTCG TGGCTGTACC GGATCCGGCC CGCCGCCGTG CATCGGCCGT TCGAGCTCGT GTCGGGCGAG CGCCGGATCG TCGCCGAGTT CGGCGATTCG GACGACGTGC CGCCGACGCC GCCGAACCAG TTGCGCTGGG ATCCGCTGCC GATGCCCGCG CAGCCGACCG ATTTCGTCGA CGGCTGGGTG ACGATGGCGG GCAACGGCTC GGCCGCCGCG ATGAGCGGCT GCGCGATCCA CCTGTACGCG GCGAACCGCT CGATGCGCGA GCGCTTCTTC TACAGCGCGG ACGGCGAGCT GCTGATCGTG CCGCAGGAAG GGCGCCTCTT CATCATGACG GAGCTCGGCC GGCTCGACGT CGAGCCGTTC GAGATCGCGG TGATCCCGCG CGGCGTGCGC TTCGCGGTCG CGCTGCCGGA CGGGCGCGCG CGCGGCTATG TATGCGAGAA CTTCGGTGCG CTGCTCAGGC TGCCGGACCT CGGGCCGATC GGCTCGAACG GCCTCGCGAA TCCGCGCGAC TTCCTCACGC CGCACGCGTC GTACGAGGAT CGCGAAGGCG CGTTCGAGCT CGTCGCGAAG CTGAATGGCC GGCTCTGGCG CGCGGACATC GATCATTCGC CGTTCGACGT CGTCGCGTGG CACGGCAACT ACGCGCCGTA CAAGTACGAC CTGCGTCACT TCAACACGAT CGGCTCGATC AGCTACGATC ATCCGGACCC GTCGATCTTC CTCGTGCTGC AGTCGCAAAG CGATACGCCG GGCGTCGACG CGATCGACTT CGTGATCTTC CCCCCGCGCT GGCTCGCGGC CGAGGATACG TTCCGCCCGC CTTGGTTCCA CCGCAACGTC GCGAGCGAAT TCATGGGGCT CGTGCACGGC GTCTACGACG CGAAGGCCGA AGGCTTCGTG CCGGGCGGCG CGAGCCTGCA CAACTGCATG TCCGGCCACG GGCCCGACGC GGACACGTTC GAGAAGGCTT CTTCGATCGA CACGTCGAAG CCGAACAAGG TCGGCGACAC GATGGCGTTC ATGTTCGAGA CCCGCACGCT GATCCGGCCG ACGCGCTTCG CGCTCGACAC CGCGCAACTG CAGGCGAACT ACTTCGAATG CTGGCAAGGC CTCAAGAAAC ACTTCAATCC GGAGCAACGA TGA
|
Protein sequence | MERTTIMTLD FSKPGEAGYQ SGFANEFATE ALPGALPHAR NSPQRAPYGL YAEQFSGTAF TAPRGHNRRS WLYRIRPAAV HRPFELVSGE RRIVAEFGDS DDVPPTPPNQ LRWDPLPMPA QPTDFVDGWV TMAGNGSAAA MSGCAIHLYA ANRSMRERFF YSADGELLIV PQEGRLFIMT ELGRLDVEPF EIAVIPRGVR FAVALPDGRA RGYVCENFGA LLRLPDLGPI GSNGLANPRD FLTPHASYED REGAFELVAK LNGRLWRADI DHSPFDVVAW HGNYAPYKYD LRHFNTIGSI SYDHPDPSIF LVLQSQSDTP GVDAIDFVIF PPRWLAAEDT FRPPWFHRNV ASEFMGLVHG VYDAKAEGFV PGGASLHNCM SGHGPDADTF EKASSIDTSK PNKVGDTMAF MFETRTLIRP TRFALDTAQL QANYFECWQG LKKHFNPEQR
|
| |