Gene BTH_I1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I1397 
SymbolhmgA 
ID3846832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp1571201 
End bp1572553 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content67% 
IMG OID637841069 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_441943 
Protein GI83719016 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.762264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAGAA CGACAATCAT GACACTGGAT TTTTCGAAAC CGGGCGAAGC CGGCTATCAG 
AGCGGCTTCG CGAACGAATT CGCGACCGAG GCGCTGCCGG GCGCGCTGCC GCACGCGCGC
AACTCGCCGC AGCGCGCGCC GTACGGGCTC TACGCGGAGC AGTTGTCCGG CACCGCGTTC
ACCGCGCCGC GCGGCCATAA CCGCCGCTCG TGGCTGTACC GCATCCGGCC CGCCGCCGTG
CATCGGCCGT TCGAGCTCGT GTCCGGCGAG CGCCGGATCG TCGCCGATTT CGGCGATTCG
GGCGACGTGC CGCCGACGCC GCCGAACCAG TTGCGCTGGG ACCCGCTGCC GATGCCCGCG
CAGCCGACCG ATTTCGTCGA CGGCTGGGTG ACGATGGCGG GCAACGGCTC GGCCGCCGCG
ATGAGCGGCT GCGCGATCCA TCTGTACGCG GCGAACCGCT CGATGCGCGA GCGCTTCTTC
TACAGCGCGG ACGGCGAACT GCTGATCGTG CCGCAGGAAG GGCGGCTTTT CATCATGACG
GAGCTCGGAC GGCTCGACGT CGAGCCGTTC GAGATCGCGG TGATTCCGCG CGGCGTGCGC
TTCGCGGTCG CGCTGCCCGA CGGGCGCGCG CGCGGCTATG TCTGCGAGAA CTTCGGCGCG
CTGCTCAGGC TGCCGGATCT CGGGCCGATC GGCTCGAACG GCCTGGCGAA TCCGCGCGAC
TTCCTGACGC CGAACGCGTC GTACGAGGAT CGCGAAGGCG CGTTCGAGCT CGTCGCGAAG
TTGAACGGCC GGCTCTGGCG CGCGGACATC GACCATTCGC CGTTCGACGT CGTCGCATGG
CACGGCAACT ACGCGCCGTA CAAGTACGAC CTGCGTCACT TCAACACGAT CGGCTCGATC
AGCTACGATC ATCCGGACCC GTCGATCTTC CTCGTGCTGC AGTCGCAAAG CGATACGCCG
GGCGTCGACG CGATCGACTT CGTGATCTTC CCGCCGCGCT GGCTCGCGGC CGAGGATACG
TTCCGCCCGC CGTGGTTCCA CCGCAACGTC GCGAGCGAGT TCATGGGGCT CGTGCACGGC
GTCTACGACG CGAAGGCGGA AGGCTTCGTG CCGGGCGGCG CGAGCCTGCA CAACTGCATG
TCGGGCCACG GGCCGGATGC GGACACGTTC GAGAAGGCGT CGGCGATCGA TACGTCGAGG
CCGAACAAGG TCGGCGACAC GATGGCGTTC ATGTTCGAGA CCCGCACGCT GATCCGGCCG
ACGCGCTTCG CGCTCGATAC CGCGCAACTG CAGGCGAACT ACTTCGAATG CTGGCAAGGC
CTCAAGAAAC ATTTCAATCC GGAGCAACGA TGA
 
Protein sequence
MERTTIMTLD FSKPGEAGYQ SGFANEFATE ALPGALPHAR NSPQRAPYGL YAEQLSGTAF 
TAPRGHNRRS WLYRIRPAAV HRPFELVSGE RRIVADFGDS GDVPPTPPNQ LRWDPLPMPA
QPTDFVDGWV TMAGNGSAAA MSGCAIHLYA ANRSMRERFF YSADGELLIV PQEGRLFIMT
ELGRLDVEPF EIAVIPRGVR FAVALPDGRA RGYVCENFGA LLRLPDLGPI GSNGLANPRD
FLTPNASYED REGAFELVAK LNGRLWRADI DHSPFDVVAW HGNYAPYKYD LRHFNTIGSI
SYDHPDPSIF LVLQSQSDTP GVDAIDFVIF PPRWLAAEDT FRPPWFHRNV ASEFMGLVHG
VYDAKAEGFV PGGASLHNCM SGHGPDADTF EKASAIDTSR PNKVGDTMAF MFETRTLIRP
TRFALDTAQL QANYFECWQG LKKHFNPEQR