Gene BURPS1106A_A2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2454 
Symbol 
ID4903937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2420445 
End bp2421488 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content67% 
IMG OID640145558 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001076485 
Protein GI126457348 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACTGA TCAGCGATGC GACCTTGCGC GACGGCAACC ATGCGATTCG TCACCAACTG 
AGCGCCGCGC AGATACATGC GTATGCGCGC GCGGCCGACG AAGCCGGCAT CGATGTCGTC
GAAGTCGGCC ACGGCAATGG TCTCGGAGGC TCGTCTTGCC TGCTCGGGCA GACGCCGATC
GGCGATCGCC TGATGCTCGA GACCGCGCGC GCCGCGCTGC GCACGAGCCG GCTCGGCGTG
CATTTCATTC CGGGGCTCGG CAAGGCGGCG GACATCTCGC TTGCGCTCGA GATCGGCGTC
GATGTCGTGC GCGTCGCGAC GCATTGCACC GAGGCGAACG TGTCGGCGCG CTTCATCGAG
CAGACCCGGA CGGCCGGACG CACGGCGTTC GGCGTGCTGA TGATGTCGCA CATGGCGCCG
CCCGATACGC TGCTCGCGCA GGCGAAGCTG ATGGAGCGCT ACGGCGCGCA GGCAGTGGTG
CTGATGGACA GCGCCGGGTA TTCGACGCCG TCGCTCGTGC GCGCGAAGGT CGAGCGCCTC
GTCGACGGTC TCGACATCGA CGTCGGCTTT CACGCGCACA ACAACCTCGG GCTCGCGGTC
GCGAACAGCC TCGTCGCGCT CGAAGCGGGG GCGCGCATCG TCGACGCATG CGTGAAAGGC
TTCGGGGCCG GCGCGGGCAA TACGCAGCTC GAAACGCTCG TCGCCGCGAT GGAGCGCGAA
GGGCACGACA CGCGCACGAC GTTCGAGCGC GTGATGACGC TCGCGCGCGG CACGGAGACG
TTTCTCAATC CGAAGACGCC GCACATCCAG CCGGCGAACA TCGCGAGCGG GCTGTACGGC
CTTTTCTCCG GCTACGTGCC GCATATCCAG AAAGCCGCGC AGGAATTCGG CGTCAACGAA
TTCGAGCTGT ACAAGCGGCT TGCGGAGCGC AAGCTCGTCG CCGGGCAGGA GGACATCATC
ATCGAAGAGG CAAGCCGTCT CGCGCGCGAA CGGGATGTGC AGCGCGCAAC GGGCGGCGTG
CGCGTTCGCG AGCTGTCCGC GTGA
 
Protein sequence
MILISDATLR DGNHAIRHQL SAAQIHAYAR AADEAGIDVV EVGHGNGLGG SSCLLGQTPI 
GDRLMLETAR AALRTSRLGV HFIPGLGKAA DISLALEIGV DVVRVATHCT EANVSARFIE
QTRTAGRTAF GVLMMSHMAP PDTLLAQAKL MERYGAQAVV LMDSAGYSTP SLVRAKVERL
VDGLDIDVGF HAHNNLGLAV ANSLVALEAG ARIVDACVKG FGAGAGNTQL ETLVAAMERE
GHDTRTTFER VMTLARGTET FLNPKTPHIQ PANIASGLYG LFSGYVPHIQ KAAQEFGVNE
FELYKRLAER KLVAGQEDII IEEASRLARE RDVQRATGGV RVRELSA