Gene BMA0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA0649 
SymbolhutI 
ID3089468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006348 
Strand
Start bp675742 
End bp676965 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content72% 
IMG OID637561466 
Productimidazolonepropionase 
Protein accessionYP_102427 
Protein GI53725645 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGA TTCTCTGGCA CAACCTGAAG CTGTGCGCGC ACGGCGACCC GAACGACACG 
ATCGCGGATG CGGCGATCGC GGTGAACGGC GACGGCACGA TCGCCTGGAC CGGGCGCGCG
AGCGACGTGC CGGCCGGCTA CGTGCACTGG CCGCGCGAGG ACCTGCGCGG CGCATGGGTG
ACGCCCGGCC TCGTCGATTG CCACACGCAC CTCGTCTACG GCGGCCAGCG CGCGGACGAG
TTCGCGCAGC GCCTGGCGGG GGCGAGCTAC GAGGAGATCG CGCAGCGCGG CGGCGGCATC
GTATCGACCG TGCGCGCGAC GCGCGACGCG AGCGAGGCGG CGCTGTTCGA GCAGGCGTGC
GCGCGGCTGC GGCCGCTCCT TGCCGAGGGC GTGACCGCGA TCGAGATCAA GTCCGGCTAC
GGGCTCGAAC TCGCGAGCGA GCGGCGGATG CTGCGCGTCG CGCGGCAGCT CGGCGAGCGC
TTTCCGGTGA GCGTCTATAC GACGTTCCTC GGCGCGCACG CGCTGCCGCC CGAGTACGCG
GGCCGCGCGG ACGAATATAT CGACGAGGTT TGCGAACGGA TGCTGCCCGC GCTCGCCGAC
GAAGGGCTCG TCGACGCGGT CGACGTGTTT TGCGAGCGGA TCGGCTTCAC GCTCGCGCAG
AGCGAGCGCG TGTTCGAAGC GGCGGCGCGG CGCGGGCTGC CCGTCAAGAT GCACGCGGAG
CAGTTGTCGA ACGGCGGCGG CTCCGCGCTC GCCGCGCGCT ATCGCGCGCT GTCGGCCGAC
CACCTCGAAT ATCTGGACGC GGCGGGCGTT GCCGCGATGC GTGCATCGGG CACGACGGCC
GTGCTGCTGC CGGGCGCGTA CTACTTCATC CGCGAGACGA AGCTGCCGCC GATCGATCTG
CTGCGCCGCC ACGGCGTGCC GATCGCGCTC GCGACCGATC ACAATCCGGG CACCTCGCCG
CTCACGTCGC TGCTGCTCAC GATGAACATG GGCTGCACGG TGTTCAAGCT GACCGTGCAG
GAGGCGCTCC TCGGCGTCAC GCGCCACGCG GCGGCGGCGC TCGGCGCGAG CGACCGGCAC
GGCTCGCTCG CGCCCGGGCG GCAGGCGGAT TTCGCGGTAT GGTCGGTCTC GACGCTCGCC
GAGCTCGCGT ACTGGTTCGG CCGGCCGCTG TGCGAGCGGG TCGTGAAGGG CGGCGTGACG
GTGTTCACGC GCGATGCGCG CTGA
 
Protein sequence
MKSILWHNLK LCAHGDPNDT IADAAIAVNG DGTIAWTGRA SDVPAGYVHW PREDLRGAWV 
TPGLVDCHTH LVYGGQRADE FAQRLAGASY EEIAQRGGGI VSTVRATRDA SEAALFEQAC
ARLRPLLAEG VTAIEIKSGY GLELASERRM LRVARQLGER FPVSVYTTFL GAHALPPEYA
GRADEYIDEV CERMLPALAD EGLVDAVDVF CERIGFTLAQ SERVFEAAAR RGLPVKMHAE
QLSNGGGSAL AARYRALSAD HLEYLDAAGV AAMRASGTTA VLLPGAYYFI RETKLPPIDL
LRRHGVPIAL ATDHNPGTSP LTSLLLTMNM GCTVFKLTVQ EALLGVTRHA AAALGASDRH
GSLAPGRQAD FAVWSVSTLA ELAYWFGRPL CERVVKGGVT VFTRDAR