Gene BURPS668_2663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2663 
SymbolhutI 
ID4885491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2638405 
End bp2639628 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content72% 
IMG OID640128591 
Productimidazolonepropionase 
Protein accessionYP_001059687 
Protein GI126439919 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGA TTCTCTGGCA CAACCTGAAG CTGTGCGCGC ACGGCGACCC GAACGACACG 
ATCGCGGATG CGGCGATCGC GGTGAACGGC GACGGCACGA TCGCCTGGAC CGGACGCGCG
AGCGACGTGC CGGCCGGCTA CGTGCACTGG CCGCGCGAGG ACCTGCGCGG CGCATGGGTG
ACGCCCGGCC TCGTCGATTG CCACACGCAC CTCGTCTACG GCGGCCAGCG CGCGGACGAG
TTCGCGCAGC GCCTGGCGGG GGCGAGCTAC GAGGAGATCG CGCGGCGCGG CGGCGGCATC
GTATCGACCG TGCGCGCGAC GCGCGACGCG AGCGAGGCGG CGCTGTTCGA GCAGGCATGC
GCGCGGCTGC GGCCGCTCCT TGCCGAGGGC GTGACCGCGA TCGAGATCAA GTCCGGCTAC
GGGCTCGAAC TCGCGAGCGA GCGGCGGATG CTGCGCGTCG CGCGGCAGCT CGGCGAGCGC
TTTCCGGTGA GCGTCTATAC GACGTTCCTC GGCGCGCACG CGCTGCCGCC CGAGTACGCG
GGCCGCGCGG ACGAATATAT CGACGAGGTT TGCGAACGGA TGCTGCCCGC GCTCGCCGAC
GAAGGGCTCG TCGACGCGGT CGACGTGTTT TGCGAGCGGA TCGGCTTCAC GCTCGCGCAG
AGCGAGCGCG TGTTCGAAGC GGCGGCGCGG CGCGGGCTGC CCGTCAAGAT GCACGCGGAG
CAACTGTCGA ACGGCGGCGG CTCCGCGCTC GCCGCGCGCT ATCGCGCGCT GTCGGCCGAC
CACCTCGAAT ATCTGGACGC GGCGGGCGTC GCCGCGATGC GTGCATCGGG CACGACGGCC
GTGCTGCTGC CGGGCGCGTA CTACTTCATC CGCGAGACGA AGCTGCCGCC GATCGATCTG
CTGCGCCGCC ACGGCGTGCC GATCGCGCTC GCGACCGATC ACAATCCGGG CACCTCGCCG
CTCACGTCGC TGCTGCTCAC GATGAACATG GGCTGCACGG TGTTCAAGCT GACCGTGCAG
GAGGCGCTCC TCGGCGTCAC GCGCCACGCG GCGGCGGCGC TCGGCGCGAG CGACCGGCAC
GGCTCGCTCG CGCCCGGGCG GCAGGCGGAT TTCGCGGTAT GGCCGGTCTC GACGCTCGCC
GAGCTCGCGT ACTGGTTCGG CCGGCCGCTG TGCGAGCGGG TCGTGAAGGG CGGCGTGACG
GTGTTCACGC GCGATGCGCG CTGA
 
Protein sequence
MKSILWHNLK LCAHGDPNDT IADAAIAVNG DGTIAWTGRA SDVPAGYVHW PREDLRGAWV 
TPGLVDCHTH LVYGGQRADE FAQRLAGASY EEIARRGGGI VSTVRATRDA SEAALFEQAC
ARLRPLLAEG VTAIEIKSGY GLELASERRM LRVARQLGER FPVSVYTTFL GAHALPPEYA
GRADEYIDEV CERMLPALAD EGLVDAVDVF CERIGFTLAQ SERVFEAAAR RGLPVKMHAE
QLSNGGGSAL AARYRALSAD HLEYLDAAGV AAMRASGTTA VLLPGAYYFI RETKLPPIDL
LRRHGVPIAL ATDHNPGTSP LTSLLLTMNM GCTVFKLTVQ EALLGVTRHA AAALGASDRH
GSLAPGRQAD FAVWPVSTLA ELAYWFGRPL CERVVKGGVT VFTRDAR