Gene BURPS668_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2667 
SymbolhutH 
ID4883057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2642715 
End bp2644238 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content71% 
IMG OID640128595 
Producthistidine ammonia-lyase 
Protein accessionYP_001059691 
Protein GI126438543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0314474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACGC TGACCCCAGG CCGTCTGACT CTCCCGCAAC TGCGCCGGAT CGCCCGCGAG 
AACGTGCAGA TCGCGCTCGA TCCCGCGAGC TTCGCCGCGA TCGACCGGGG CGCGCAGGCC
GTCGCCGACA TCGCCGCGAA GGGCGAGCCG GCGTACGGCA TCAACACGGG CTTCGGGCGC
CTCGCGAGCA CGCACATTCC GCACGACCAG CTCGAGCTGC TGCAGAAGAA CCTCGTGCTG
TCGCACGCGG TGGGCGTCGG CGAGCCGATG GCGCGCCCCG TCGTGCGCCT GTTGATGGCG
CTCAAGCTCT CGAGCCTCGG CCGCGGCCAC TCGGGCATTC GTCGCGTCGT GATGGACGCG
CTCGTCGCGC TGTTCAACGC GGACGTGCTG CCGCTCATTC CGGTCAAGGG CTCGGTGGGC
GCGTCGGGCG ACCTTGCGCC GCTCGCGCAC ATGTCGGCCG TGCTGCTCGG CATCGGCGAC
GTGTTCATCC GCGGCGAGCG CGCGAGCGCG GCCGAAGGGC TGCGTGTCGC GGGCCTCGCG
CCGCTTACGC TCGAAGCGAA GGAGGGCCTC GCGCTGCTGA ACGGCACGCA GGCGTCGACC
GCGCTCGCGC TCGACAACCT GTTCGCGATC GAGGACCTGT ACCGCACGGC GCTCGTGTCG
GGCGCGCTGT CGGTCGACGC GGCGGCGGGC TCGGTGAAGC CGTTCGACGC GCGCATCCAC
GAGCTGCGCG GCCATCGCGG CCAGATCGAC GCGGCCGCCG CGTACCGGTC GCTGCTCGAC
GGCTCGGCGA TCAACGTGTC GCACCGCGAT TGCGACAAGG TGCAGGACCC GTACAGCCTG
CGCTGCCAGC CGCAGGTGAT GGGCGCGTGT CTCGACCAGA TCCGCCACGC GGCCGGCGTG
CTGCTCATCG AGGCGAACGC GGTGTCGGAC AACCCGCTGA TCTTCCCGGA CACGGGCGAG
GTGCTGTCGG GCGGCAATTT CCACGCGGAG CCCGTCGCGT TCGCGGCCGA CAATCTCGCG
ATCGCCGCGG CCGAGATCGG CGCGCTCGCC GAGCGCCGCA TCGCGCTGTT GATCGACGCG
ACGCTCTCCG GCCTGCCGCC TTTCCTCGTG AAGGACGGCG GCGTGAACTC GGGCTTCATG
ATCGCGCACG TGACGGCCGC CGCGCTCGCG TCGGAAAACA AGACGCTCGC GCATCCGGCG
TCGGTCGATT CGCTGCCGAC GTCGGCGAAC CAGGAAGACC ACGTGTCGAT GGCGACGTTC
GCCGCGCGCA AGCTCGCGGA CATCGCGGAG AACGTCGCGA ACATCCTCGC GATCGAGCTG
CTCGCCGCGG CGCAAGGCGT CGACCTGCGC GCGCCGCACG CAACGAGCCC GGCGCTGCAG
CACGCGATGA AGACGATTCG CGCGGACGTC GCGCACTACG ATCTCGACCA CTACTTCGCG
CCCGACATCG CGGTGGTCGC GCGGCGCGTG CGCGAGCGCG CGTTCGCGAC GCTGAGCCCG
CTGTCGTTCG AATCGGAACA ATAA
 
Protein sequence
MITLTPGRLT LPQLRRIARE NVQIALDPAS FAAIDRGAQA VADIAAKGEP AYGINTGFGR 
LASTHIPHDQ LELLQKNLVL SHAVGVGEPM ARPVVRLLMA LKLSSLGRGH SGIRRVVMDA
LVALFNADVL PLIPVKGSVG ASGDLAPLAH MSAVLLGIGD VFIRGERASA AEGLRVAGLA
PLTLEAKEGL ALLNGTQAST ALALDNLFAI EDLYRTALVS GALSVDAAAG SVKPFDARIH
ELRGHRGQID AAAAYRSLLD GSAINVSHRD CDKVQDPYSL RCQPQVMGAC LDQIRHAAGV
LLIEANAVSD NPLIFPDTGE VLSGGNFHAE PVAFAADNLA IAAAEIGALA ERRIALLIDA
TLSGLPPFLV KDGGVNSGFM IAHVTAAALA SENKTLAHPA SVDSLPTSAN QEDHVSMATF
AARKLADIAE NVANILAIEL LAAAQGVDLR APHATSPALQ HAMKTIRADV AHYDLDHYFA
PDIAVVARRV RERAFATLSP LSFESEQ