Gene BURPS1710b_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2149 
SymbolhisC 
ID3689727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2352965 
End bp2353957 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content65% 
IMG OID637728605 
Productputative histidinol-phosphate aminotransferase 
Protein accessionYP_333544 
Protein GI76809745 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000106616 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACCG AAGTGCGGGC GGCGGCGCAA GCCGTCTGCC TGGCGTTCAA TGAAAACCCG 
GAAGCGGTGG AGCCGCGCGT GCAGGCCGCG ATTGCTGCCG CGGCCGCGCG GATCAATCGC
TACCCGTTTG ACGCCGAACC GCGCGTCATG CGCAAGCTCG CCGAGCATTT CAGCTGTCCC
GAGGACAACC TGATGCTGGT GCGCGGCATC GACGAATGCT TCGATCGAAT CAGCGCCGAA
TTTTCGTCGA TGCGCTTCGT TACCGCATGG CCGGGCTTCG ACGGCTATCG CGCACGCATC
GCCGTCAGCG GGCTGAGACA CTTCGAAATC GGCCTGACCG ACGATCTGCT GCTCGATCCG
AACGATCTCG CCCAAGTCTC GCGTGACGAT TGCGTCGTGC TCGCCAATCC TTCGAATCCG
ACCGGCCAGG CGCTGAGCGC GGGCGAGCTC GAGCAATTGA GGCAGCGCGC GGGCAAGTTG
CTGATCGACG AAACCTACGT CGATTATTCG TCGTTTCGCG CCCGCGGCCT GGCTTACGGC
GAGAACGAAC TGGTGTTTCG TTCGTTCTCG AAATCCTACG GCCTCGCCGG CTTGCGGCTC
GGCGCGCTGT TCGGGCCGAG CGAGCTGATT GCCGCGATGA AGCGCAAGCA GTGGTTCTGC
AACGTCGGCA CGCTCGATCT GCATGCGCTC GAAGCCGCGC TCGACAACGA TCGCGCACGT
GAGGCGCACA TCGCGAAGAC GCTCGCGCAG CGCCGCCGCG TCGCCGACGC GCTGCGCGGG
CTCGGCTACC GCGTCGCGTC GTCCGAGGCC AATTTCGTGC TCGTCGAAAA CGCCGCCGGC
GAGCGCACGC TGCGCTTCCT GCGCGAACGG GGCATTCAGG TGAAGGACGC CGGCCAGTTC
GGACTTCACC ACCACATCAG AATCAGCATC GGCCGTGAAG AGGACAACGA TCGGTTGCTC
GCGGCGCTGG CCGAATATTC CGACCACTCA TAA
 
Protein sequence
MDTEVRAAAQ AVCLAFNENP EAVEPRVQAA IAAAAARINR YPFDAEPRVM RKLAEHFSCP 
EDNLMLVRGI DECFDRISAE FSSMRFVTAW PGFDGYRARI AVSGLRHFEI GLTDDLLLDP
NDLAQVSRDD CVVLANPSNP TGQALSAGEL EQLRQRAGKL LIDETYVDYS SFRARGLAYG
ENELVFRSFS KSYGLAGLRL GALFGPSELI AAMKRKQWFC NVGTLDLHAL EAALDNDRAR
EAHIAKTLAQ RRRVADALRG LGYRVASSEA NFVLVENAAG ERTLRFLRER GIQVKDAGQF
GLHHHIRISI GREEDNDRLL AALAEYSDHS