Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1397 |
Symbol | hmgA |
ID | 3846832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 1571201 |
End bp | 1572553 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637841069 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_441943 |
Protein GI | 83719016 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.762264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGAA CGACAATCAT GACACTGGAT TTTTCGAAAC CGGGCGAAGC CGGCTATCAG AGCGGCTTCG CGAACGAATT CGCGACCGAG GCGCTGCCGG GCGCGCTGCC GCACGCGCGC AACTCGCCGC AGCGCGCGCC GTACGGGCTC TACGCGGAGC AGTTGTCCGG CACCGCGTTC ACCGCGCCGC GCGGCCATAA CCGCCGCTCG TGGCTGTACC GCATCCGGCC CGCCGCCGTG CATCGGCCGT TCGAGCTCGT GTCCGGCGAG CGCCGGATCG TCGCCGATTT CGGCGATTCG GGCGACGTGC CGCCGACGCC GCCGAACCAG TTGCGCTGGG ACCCGCTGCC GATGCCCGCG CAGCCGACCG ATTTCGTCGA CGGCTGGGTG ACGATGGCGG GCAACGGCTC GGCCGCCGCG ATGAGCGGCT GCGCGATCCA TCTGTACGCG GCGAACCGCT CGATGCGCGA GCGCTTCTTC TACAGCGCGG ACGGCGAACT GCTGATCGTG CCGCAGGAAG GGCGGCTTTT CATCATGACG GAGCTCGGAC GGCTCGACGT CGAGCCGTTC GAGATCGCGG TGATTCCGCG CGGCGTGCGC TTCGCGGTCG CGCTGCCCGA CGGGCGCGCG CGCGGCTATG TCTGCGAGAA CTTCGGCGCG CTGCTCAGGC TGCCGGATCT CGGGCCGATC GGCTCGAACG GCCTGGCGAA TCCGCGCGAC TTCCTGACGC CGAACGCGTC GTACGAGGAT CGCGAAGGCG CGTTCGAGCT CGTCGCGAAG TTGAACGGCC GGCTCTGGCG CGCGGACATC GACCATTCGC CGTTCGACGT CGTCGCATGG CACGGCAACT ACGCGCCGTA CAAGTACGAC CTGCGTCACT TCAACACGAT CGGCTCGATC AGCTACGATC ATCCGGACCC GTCGATCTTC CTCGTGCTGC AGTCGCAAAG CGATACGCCG GGCGTCGACG CGATCGACTT CGTGATCTTC CCGCCGCGCT GGCTCGCGGC CGAGGATACG TTCCGCCCGC CGTGGTTCCA CCGCAACGTC GCGAGCGAGT TCATGGGGCT CGTGCACGGC GTCTACGACG CGAAGGCGGA AGGCTTCGTG CCGGGCGGCG CGAGCCTGCA CAACTGCATG TCGGGCCACG GGCCGGATGC GGACACGTTC GAGAAGGCGT CGGCGATCGA TACGTCGAGG CCGAACAAGG TCGGCGACAC GATGGCGTTC ATGTTCGAGA CCCGCACGCT GATCCGGCCG ACGCGCTTCG CGCTCGATAC CGCGCAACTG CAGGCGAACT ACTTCGAATG CTGGCAAGGC CTCAAGAAAC ATTTCAATCC GGAGCAACGA TGA
|
Protein sequence | MERTTIMTLD FSKPGEAGYQ SGFANEFATE ALPGALPHAR NSPQRAPYGL YAEQLSGTAF TAPRGHNRRS WLYRIRPAAV HRPFELVSGE RRIVADFGDS GDVPPTPPNQ LRWDPLPMPA QPTDFVDGWV TMAGNGSAAA MSGCAIHLYA ANRSMRERFF YSADGELLIV PQEGRLFIMT ELGRLDVEPF EIAVIPRGVR FAVALPDGRA RGYVCENFGA LLRLPDLGPI GSNGLANPRD FLTPNASYED REGAFELVAK LNGRLWRADI DHSPFDVVAW HGNYAPYKYD LRHFNTIGSI SYDHPDPSIF LVLQSQSDTP GVDAIDFVIF PPRWLAAEDT FRPPWFHRNV ASEFMGLVHG VYDAKAEGFV PGGASLHNCM SGHGPDADTF EKASAIDTSR PNKVGDTMAF MFETRTLIRP TRFALDTAQL QANYFECWQG LKKHFNPEQR
|
| |