Gene BURPS668_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1993 
Symbol 
ID4882856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1970451 
End bp1971461 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID640127921 
Producthistidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
Protein accessionYP_001059028 
Protein GI126438425 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.156629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGTTG GCGAGGCAAT GGATACCGAA GTGCAGGCGG CGGCGCAAAC CGTCTGCCTG 
GCGTTCAATG AAAACCCGGA AGCGGTGGAG CCGCGCGTGC AGGCCGCGAT TGCTGCCGCG
GCCGCGCGGA TCAATCGCTA CCCGTTTGAT GCCGAACCGC GCGTCATGCG CAAGCTCGCC
GAGCATTTCA GCTGTCCCGA GGACAACCTG ATGCTGGTGC GCGGCATCGA CGAATGCTTC
GATCGAATCA GCGCCGAATT TTCGTCGATG CGCTTCGTTA CCGCATGGCC GGGCTTCGAC
GGCTATCGCG CACGCATCGC CGTCAGCGGG CTGAGACACT TCGAAATCGG CCTGACCGAC
GATCTGCTGC TCGATCCGAA CGATCTCGCC CAAGTCTCGC GTGACGATTG CGTCGTGCTC
GCCAATCCTT CGAATCCGAC CGGCCAGGCG CTGAGCGCGG GCGAGCTCGA TCAATTGAGG
CAGCGCGCGG GCAAGTTGCT GATCGACGAA ACCTACGTCG ATTATTCGTC GTTTCGCGCC
CGCGGCCTGG CTTACGGCAA GAACGAACTG GTGTTTCGTT CGTTCTCGAA ATCCTACGGC
CTTGCCGGCT TGCGGCTCGG CGCGCTGTTC GGGCCGAGCG AGCTGATTGC TGCGATGAAG
CGCAAGCAGT GGTTCTGCAA CGTCGGCACG CTCGATCTGC ATGCGCTCGA AGCCGCGCTC
GACAACGATC GCGCACGTGA GGCGCACATC GCGAAGACGC TCGCGCAGCG CCGCCGCGTC
GCCGACGCGC TGCGCGGGCT CGGCTACCGC GTCGCGTCGT CCGAGGCCAA TTTCGTGCTC
GTCGAAAACG CCGCCGGCGA GCGCACGCTG CGCTTCCTGC GCGAACGGGG CATTCAGGTG
AAGGACGCCG GCCAGTTCGG ACTTCACCAC CACATCAGAA TCAGCATCGG CCGTGAAGAG
GACAACGATC GGTTGCTCGC GGCGCTGGCC GAATATTCCG ACCACTCATA A
 
Protein sequence
MSVGEAMDTE VQAAAQTVCL AFNENPEAVE PRVQAAIAAA AARINRYPFD AEPRVMRKLA 
EHFSCPEDNL MLVRGIDECF DRISAEFSSM RFVTAWPGFD GYRARIAVSG LRHFEIGLTD
DLLLDPNDLA QVSRDDCVVL ANPSNPTGQA LSAGELDQLR QRAGKLLIDE TYVDYSSFRA
RGLAYGKNEL VFRSFSKSYG LAGLRLGALF GPSELIAAMK RKQWFCNVGT LDLHALEAAL
DNDRAREAHI AKTLAQRRRV ADALRGLGYR VASSEANFVL VENAAGERTL RFLRERGIQV
KDAGQFGLHH HIRISIGREE DNDRLLAALA EYSDHS