Gene BURPS668_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3699 
SymbolaroB 
ID4881745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3622034 
End bp3623113 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content68% 
IMG OID640129627 
Product3-dehydroquinate synthase 
Protein accessionYP_001060703 
Protein GI126441391 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00532697 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACCG TCAACGTCGA CCTGGGCGAG CGCGCCTATC CGATCCACAT CGGCGCCGAT 
CTGATCGGCC GCACCGAGCT TTTCGCGCCG CACATCGCGG GCGCATCCGT CACGATCGTC
ACGAACACCA CCGTCGAGCC GCTCTACGGC GACACGCTGC GCGCCGCGCT CGCGCCGCTC
GGCAAGCGCG TGTCGACCGT CGTCCTGCCC GACGGCGAAG CGTACAAGAA CTGGGAAACG
CTCAATCTGA TCTTCGATGG CCTGCTCGAG CAGCACGCCG ATCGCAAGAC GACGCTGATC
GCGCTCGGCG GCGGCGTGAT CGGCGACATG ACGGGCTTCG CGGCCGCATG CTATATGCGC
GGCGTGCCGT TCATCCAGGT GCCGACGACG CTCCTGTCGC AGGTTGATTC GTCGGTCGGC
GGCAAGACGG GCATCAACCA TCCGCTCGGC AAGAACATGA TCGGCGCGTT CTATCAGCCG
CAGGCGGTGA TCGCCGATAT CGGCGCGCTG TCGACGCTGC CCGATCGCGA GCTTGCCGCG
GGCGTCGCCG AGATCGTCAA GACGGGCGCG ATCGCCGATG CCGCGTTCTT CGACTGGATC
GAGGCGAACG TGGGCGCGCT CACTCGCCGC GATCCCGACG CGCTCGCGCA CGCGGTCAAG
CGCTCGTGCG AGATCAAGGC GGGCGTCGTC GCGGCGGACG AGCGCGAGGG CGGTCTGCGC
GCGATCCTTA ATTTTGGCCA TACGTTCGGG CACGCGATCG AAGCGGGGCT CGGCTACGGC
GAGTGGCTGC ACGGCGAGGC GGTGGGCTGC GGCATGGTGA TGGCGGCCGA CCTGTCGGTG
CGAACCGGCC ATCTCGACGA AGCGTCGCGC GCGCGGCTGT GCCGCGTCGT CGAGGCCGCG
CATCTGCCGA CGCGCGCGCC GGATCTCGGC GACGCGCGTT ATGTCGAGCT GATGCGCGTC
GACAAGAAGG CCGAGGCGGG CGCGATCAAG TTCATACTGC TCAAACGCTT CGGCGAAACG
ATCATCACTC CGGCGCCCGA CGACGCCGTT CTCGCGACAC TGGCGGCAAC CACCCGGTAA
 
Protein sequence
MITVNVDLGE RAYPIHIGAD LIGRTELFAP HIAGASVTIV TNTTVEPLYG DTLRAALAPL 
GKRVSTVVLP DGEAYKNWET LNLIFDGLLE QHADRKTTLI ALGGGVIGDM TGFAAACYMR
GVPFIQVPTT LLSQVDSSVG GKTGINHPLG KNMIGAFYQP QAVIADIGAL STLPDRELAA
GVAEIVKTGA IADAAFFDWI EANVGALTRR DPDALAHAVK RSCEIKAGVV AADEREGGLR
AILNFGHTFG HAIEAGLGYG EWLHGEAVGC GMVMAADLSV RTGHLDEASR ARLCRVVEAA
HLPTRAPDLG DARYVELMRV DKKAEAGAIK FILLKRFGET IITPAPDDAV LATLAATTR