Gene BTH_I0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0471 
SymbolhisC-1 
ID3848756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp524741 
End bp525808 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID637840144 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_441029 
Protein GI83719412 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGTT ACTGGAGCGA CATCGTCCAT CAACTCGAGC CGTACGTGCC GGGCGAGCAG 
CCGGCGCTCG CGCATCCCGT CAAGCTGAAC ACGAACGAGA ACCCGTATCC GCCGTCGCCG
CGCGCGCTCG ACGCGATTCG GCATGAGCTC GGCGCCACGG GCGAGGCGCT GCGCCGCTAT
CCCGACCCCG TCGCGCGCAA GCTGCGCGAG ACGGTCGCCG CCCATCACGG AATCGCGCCC
GAGCAGGTGT TCGCCGGCAA CGGCTCCGAC GAAGTGCTCG CGCATGCGTT CCAGGCGCTC
CTGCAGCACG ACAGGCCGCT GCGCTTCCCG GACATCACGT ACAGCTTCTA CCCGACCTAC
GCACGGCTCT ATCGCGTCGC ATACGAGACG GTGCCGCTCG CCGACGATTT CTCGATCGTC
GTCGACGACT ATCTCGACGA CGCCGGCTGC GTGCTGTTCC CGAATCCGAA CGCGCCGACG
GGCCGTGCGC TGCCGCTTGC CGACATCGAG CGGATCGTCG CCGCGAATCC GAGCTCGGTC
GTCGTGATCG ACGAGGCGTA TGTCGACTTC GGCGCGGAGT CGGCCGTGTC GCTGATCTCG
CGCTACCCGA ACCTGCTCGT CGTGCATACC GCGTCGAAGG CGCGCTCGCT CGCGGGCATG
CGCGTCGGCT TCGCGTTCGG CGACGCCGCG CTGATCGATG CGCTCACACG CGTGAAGGAC
AGCTTCAACT CGTACCCGCT CGACCGTCTC GCGCAGGTGG CGACGCAGGC GTCGTACGAG
GACGACGCGT GGTTCGAAGC GACACGCAAG CAGGTGATCG CGAGCCGCGA GCGGCTTGTC
GCCGCGCTCG CCGCGCTCGG CTTCGACGTC GTGCCGTCGG CCGCGAATTT CGTGTTCGCG
CGCCATCCTC GCCACGATGC GGCGACGCTT GCCGCACGAC TGAAGCTACG GGAAATTTTC
GTGCGGCACT TCAAGCTGCC GCGAATCGAC CAGCATTTGC GCATCACGGT CGGCACCGAC
GCAGAATGCG ACGCGCTCGT CGCCGCGCTG CGCGAATTGC TCGCGTAA
 
Protein sequence
MSRYWSDIVH QLEPYVPGEQ PALAHPVKLN TNENPYPPSP RALDAIRHEL GATGEALRRY 
PDPVARKLRE TVAAHHGIAP EQVFAGNGSD EVLAHAFQAL LQHDRPLRFP DITYSFYPTY
ARLYRVAYET VPLADDFSIV VDDYLDDAGC VLFPNPNAPT GRALPLADIE RIVAANPSSV
VVIDEAYVDF GAESAVSLIS RYPNLLVVHT ASKARSLAGM RVGFAFGDAA LIDALTRVKD
SFNSYPLDRL AQVATQASYE DDAWFEATRK QVIASRERLV AALAALGFDV VPSAANFVFA
RHPRHDAATL AARLKLREIF VRHFKLPRID QHLRITVGTD AECDALVAAL RELLA