Gene BURPS1710b_0750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0750 
SymbolhisC 
ID3690866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp762822 
End bp763892 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID637727206 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_332164 
Protein GI76812095 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.331282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGTT ACTGGAGCGA CATCGTCCGT CAACTCGAGC CGTATGTGCC GGGCGAGCAG 
CCGGCGCTCG CGCATCCCGT CAAGCTGAAC ACGAACGAGA ATCCGTATCC GCCGTCGCCG
CGCGCGCTCG ACGCGATCCG GCGCGAGCTC GGCGACACGG GCGAAGCGCT GCGCCGCTAT
CCGGACCCGG TCGCGCGCAG GCTGCGCGAG ACGGTGGCGG CCTATCACGG CATCGCGCCC
GAGCAGGTGT TCGCCGGCAA CGGCTCCGAC GAAGTGCTCG CGCACGCGTT CCAGGCGCTC
CTGCAACACG ACAGGCCGCT GCGCTTCCCG GACATCACGT ACAGCTTCTA CCCGACCTAT
GCGCGGCTCT ATCGCGTCGC ATACGAGACG GTACCGCTCG CCGACGATTT CTCGATCGTC
GTCGACGACT ATCTCGACGA CGCCGGCTGC GTGCTGTTCC CGAACCCGAA CGCGCCGACG
GGCCGCGCGC TGCCGCTTGC CGACATCGAG CGGATCGTCG CCGCCAACCC GAGCTCGGTT
GTCGTGATCG ACGAGGCCTA TGTCGATTTC GGCGCGGAAT CGGCCGTCTC GCTGATCGCG
CGCTATCCGA ATCTGCTCGT CGTGCATACC GTGTCGAAGG CGCGCTCGCT CGCCGGCATG
CGCGTCGGCT TCGCGTTCGG CGACGCCGCG CTGATCGACG CGCTCACGCG CGTGAAGGAC
AGCTTCAACT CGTATCCGCT CGATCGTCTC GCGCAAGTCG CGACGCAAGC GTCGTACGAG
GACGAGGCGT GGTTCCAGGC GACGCGCAAG CAGGTGATCG CGAGCCGCGA GCGGCTCGTC
GGCGCGCTGG CGGCGCTCGG CTTCGACGTC GTGCCGTCGG CGGCGAATTT CGTGTTCGCG
CGCCCTCGTA GCCACGATGC GGCGACGCTC GCCGCGCAAC TGAAACAGCG GGAAATTTTC
GTGCGGCACT TCAAGCTGCC GCGGATCGAC CAGCACTTGC GCATCACGGT CGGCTCGGAC
GCCGAATGCG ACGCGCTCGT CGCGGCGCTG CGGGAGCTGC TCGCCGCTTA A
 
Protein sequence
MSRYWSDIVR QLEPYVPGEQ PALAHPVKLN TNENPYPPSP RALDAIRREL GDTGEALRRY 
PDPVARRLRE TVAAYHGIAP EQVFAGNGSD EVLAHAFQAL LQHDRPLRFP DITYSFYPTY
ARLYRVAYET VPLADDFSIV VDDYLDDAGC VLFPNPNAPT GRALPLADIE RIVAANPSSV
VVIDEAYVDF GAESAVSLIA RYPNLLVVHT VSKARSLAGM RVGFAFGDAA LIDALTRVKD
SFNSYPLDRL AQVATQASYE DEAWFQATRK QVIASRERLV GALAALGFDV VPSAANFVFA
RPRSHDAATL AAQLKQREIF VRHFKLPRID QHLRITVGSD AECDALVAAL RELLAA