Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3174 |
Symbol | hmgA |
ID | 4883711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 3114580 |
End bp | 3115914 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640129102 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001060186 |
Protein GI | 126440702 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.666135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTGG ATTTTTCGAA ACCGGGCGAA GCCGGCTATC AGAGCGGCTT CGCGAACGAA TTCGCGACCG AGGCGCTGCC GGGCGCGTTG CCGCACGCGC GCAACTCGCC GCAGCGCGCG CCGTACGGGC TCTACGCGGA GCAGTTCTCC GGCACCGCGT TCACTGCGCC GCGCGGCCAC AACCGCCGCT CGTGGCTGTA CCGGATCCGG CCCGCCGCCG TGCATCGGCC GTTCGAGCTC GTGTCGGGCG AGCGCCGGAT CGTCGCCGAG TTCGGCGATT CGGACGACGT GCCGCCGACG CCGCCGAACC AGTTGCGCTG GGATCCGCTG CCGATGCCCG CGCAGCCGAC CGATTTCGTC GACGGCTGGG TGACGATGGC GGGCAACGGC TCGGCTGCCG CGATGAGCGG CTGCGCGATC CACCTGTACG CGGCGAACCG CTCGATGCGC GAGCGCTTCT TCTACAGCGC GGACGGCGAG CTGCTGATCG TGCCGCAGGA AGGGCGCCTC TTCATCATGA CGGAGCTCGG CCGGCTCGAC GTCGAGCCGT TCGAGATCGC GGTGATCCCG CGCGGCGTGC GCTTCGCGGT CGCGCTGCCG GACGGGCGCG CGCGCGGCTA TGTATGCGAG AACTTCGGTG CGCTGCTCAG GCTGCCGGAC CTCGGGCCGA TCGGCTCGAA CGGCCTCGCG AATCCGCGCG ACTTCCTCAC GCCGCACGCG TCGTACGAGG ATCGCGAAGG CGCGTTCGAG CTCGTCGCGA AGCTGAATGG CCGGCTCTGG CGCGCGGACA TCGATCATTC GCCGTTCGAC GTCGTCGCGT GGCACGGCAA CTACGCGCCG TACAAGTACG ACCTGCGCCA CTTCAACACG ATCGGCTCGA TCAGCTACGA TCATCCGGAC CCGTCGATCT TCCTCGTGCT GCAGTCGCAA AGCGATACGC CGGGCGTCGA CGCGATCGAC TTCGTGATCT TCCCGCCGCG CTGGCTCGCG GCCGAGGATA CGTTCCGCCC GCCTTGGTTC CACCGCAACG TCGCGAGCGA ATTCATGGGG CTCGTGCACG GCGTCTACGA CGCGAAGGCC GAAGGCTTCG TGCCGGGCGG CGCGAGCCTG CACAACTGCA TGTCCGGCCA CGGGCCCGAC GCGGACACGT TCGAGAAGGC TTCTTCGATC GACACGTCGA AGCCGAACAA GGTCGGCGAC ACGATGGCGT TCATGTTCGA GACCCGCACG CTGATCCGGC CGACGCGCTT CGCGCTCGAC ACCGCGCAAC TGCAGGCGAA CTACTTCGAA TGCTGGCAAG GCCTCAAGAA ACACTTCAAT CCGGAGCAAC GATGA
|
Protein sequence | MTLDFSKPGE AGYQSGFANE FATEALPGAL PHARNSPQRA PYGLYAEQFS GTAFTAPRGH NRRSWLYRIR PAAVHRPFEL VSGERRIVAE FGDSDDVPPT PPNQLRWDPL PMPAQPTDFV DGWVTMAGNG SAAAMSGCAI HLYAANRSMR ERFFYSADGE LLIVPQEGRL FIMTELGRLD VEPFEIAVIP RGVRFAVALP DGRARGYVCE NFGALLRLPD LGPIGSNGLA NPRDFLTPHA SYEDREGAFE LVAKLNGRLW RADIDHSPFD VVAWHGNYAP YKYDLRHFNT IGSISYDHPD PSIFLVLQSQ SDTPGVDAID FVIFPPRWLA AEDTFRPPWF HRNVASEFMG LVHGVYDAKA EGFVPGGASL HNCMSGHGPD ADTFEKASSI DTSKPNKVGD TMAFMFETRT LIRPTRFALD TAQLQANYFE CWQGLKKHFN PEQR
|
| |