Gene BMASAVP1_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0386 
Symbol 
ID4676970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp398101 
End bp399219 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID639842913 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_989996 
Protein GI121597831 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.195218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAACA TCGACAATCC GCAACGCGAT CGGGAAACCG GCTTCGCCGA CGCGACGCAG 
GACACGACGC GCATCGACGA CGTGCGCATC GGCGCGGTGC GCCCGCTCAT CTCGCCCGCG
CTGCTGCAGG ACGAACTGCC GGTGCCGAGC GCCGTCCAGG CGCTCGTCGA AGCGAGCCGC
GACGCGATCG GCGACGTGCT GCACGGCCGC GACGACCGCC TGCTCGCGAT CGTCGGCCCG
TGCTCGATCC ACGATCACGA TCAGGCGCTC GACTACGCGC GCCGGCTGAA AAGCGCCGCC
GACGCGCTGC GCGACGACCT GCTGATCGTG ATGCGCGTGT ATTTCGAGAA GCCGCGCACG
ACGGTCGGCT GGAAGGGCTA CATCAACGAT CCGCGCCTCG ACGGCAGCTT CCGCATCAAC
GAAGGGCTGC GCGCCGCGCG CCGGCTGCTG ATCGACATCA ACGCGCTCGG CCTGCCCGCC
GGCACCGAAT TCCTCGATCT GCTGAGCCCG CAGTACATCG CGGATCTGAT CGCCTGGGGC
GCGATCGGCG CGCGCACGAC CGAGAGCCAG AGCCACCGGC AGCTCGCGTC GGGGCTGAGC
TGCCCGATCG GCTTCAAGAA CGGCACCGAC GGCGGCGTGC AGGTCGCGGC CGACGCGATC
GTCGCGGCGC GCGCGAGCCA CGCGTTCATG GGCATGACGA AGATGGGGAT GGCCGCGATT
TTCGAGACGC GCGGCAACGA CGCCGCGCAC GTGATCCTGC GCGGCGGCAA GCGGGGCCCG
AACTACGATC GCGCGAGCGT CGACGAGGCG TGCGCGGTGC TGCGCGCGGC GGGCCAGCGC
GAGCAGGTGA TGATCGACTG CTCGCACGCG AATTCGAACA AGTCGCACCT GCGGCAGGTC
GACGTCGCCG AGGACCTCGC GCGCCAGTTG TCGGACGGCG AGCGACGCAT CACCGGCGTG
ATGGTCGAGA GCAACCTGGA GGCCGGCCGG CAGGACCTGA AGCCCGGCGT GCCGCTGCAA
TACGGCGTGT CGATCACCGA CGCGTGCCTG AGCTGGGCGC AGACCGAGCC CGTGCTCGAC
ACGCTCGCGC AGGCGGTGCG GCGGCGGCGC GCCGCCTGA
 
Protein sequence
MQNIDNPQRD RETGFADATQ DTTRIDDVRI GAVRPLISPA LLQDELPVPS AVQALVEASR 
DAIGDVLHGR DDRLLAIVGP CSIHDHDQAL DYARRLKSAA DALRDDLLIV MRVYFEKPRT
TVGWKGYIND PRLDGSFRIN EGLRAARRLL IDINALGLPA GTEFLDLLSP QYIADLIAWG
AIGARTTESQ SHRQLASGLS CPIGFKNGTD GGVQVAADAI VAARASHAFM GMTKMGMAAI
FETRGNDAAH VILRGGKRGP NYDRASVDEA CAVLRAAGQR EQVMIDCSHA NSNKSHLRQV
DVAEDLARQL SDGERRITGV MVESNLEAGR QDLKPGVPLQ YGVSITDACL SWAQTEPVLD
TLAQAVRRRR AA