Gene BURPS1106A_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3757 
SymbolaroB 
ID4902987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3669227 
End bp3670306 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content68% 
IMG OID640136983 
Product3-dehydroquinate synthase 
Protein accessionYP_001067987 
Protein GI126455013 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACCG TCAACGTCGA CCTGGGCGAG CGCGCCTATC CGATCCACAT CGGCGCCGAT 
CTGATCGGCC GCACCGAGCT TTTCGCGCCG CACATCGCGG GCGCATCCGT CACGATCGTC
ACGAACACCA CCGTCGAGCC GCTCTACGGC GACACGCTGC GCGCCGCGCT CGCGCCGCTC
GGCAAGCGCG TGTCGACCGT CGTCCTGCCC GACGGCGAAG CGTACAAGAA CTGGGAAACG
CTCAATCTGA TCTTCGACGG CCTGCTCGAG CAGCACGCCG ATCGCAAGAC GACGCTGATC
GCGCTCGGCG GCGGCGTGAT CGGCGACATG ACGGGCTTCG CGGCCGCATG CTATATGCGC
GGCGTGCCGT TCATCCAGGT GCCGACGACG CTCCTGTCGC AGGTTGATTC GTCGGTCGGC
GGCAAGACGG GCATCAACCA TCCGCTCGGC AAGAACATGA TCGGCGCGTT CTATCAGCCG
CAGGCGGTGA TCGCCGATAT CGGCGCGCTG TCGACGCTGC CCGATCGCGA GCTTGCCGCG
GGCGTCGCCG AGATCGTCAA GACGGGCGCG ATCGCCGATG CCGCGTTCTT CGACTGGATC
GAGGCGAACG TGGGCGCGCT CACTCGCCGC GATCCCGACG CGCTCGCGCA CGCGGTCAAG
CGCTCGTGCG AGATCAAGGC GGGCGTCGTC GCGGCGGACG AGCGCGAGGG CGGTCTGCGC
GCGATCCTCA ATTTTGGCCA TACGTTCGGG CACGCGATCG AAGCGGGGCT CGGCTACGGC
GAGTGGCTGC ACGGCGAGGC GGTGGGCTGC GGCATGGTGA TGGCGGCCGA CCTGTCGGTG
CGAACCGGCC ATCTCGACGA AGCGTCGCGC GCGCGGCTGT GCCGCGTCGT CGAGGCCGCG
CATCTGCCGA CGCGCGCGCC GGATCTCGGC GACGCGCGTT ATGTCGAGCT GATGCGCGTC
GACAAGAAGG CCGAGGCGGG CGCGATCAAG TTCATACTGC TCAAACGCTT CGGCGAAACG
ATCATCACTC CGGCGCCCGA CGACGCCGTT CTCGCGACAC TGGCGGCAAC CACCCGGTAA
 
Protein sequence
MITVNVDLGE RAYPIHIGAD LIGRTELFAP HIAGASVTIV TNTTVEPLYG DTLRAALAPL 
GKRVSTVVLP DGEAYKNWET LNLIFDGLLE QHADRKTTLI ALGGGVIGDM TGFAAACYMR
GVPFIQVPTT LLSQVDSSVG GKTGINHPLG KNMIGAFYQP QAVIADIGAL STLPDRELAA
GVAEIVKTGA IADAAFFDWI EANVGALTRR DPDALAHAVK RSCEIKAGVV AADEREGGLR
AILNFGHTFG HAIEAGLGYG EWLHGEAVGC GMVMAADLSV RTGHLDEASR ARLCRVVEAA
HLPTRAPDLG DARYVELMRV DKKAEAGAIK FILLKRFGET IITPAPDDAV LATLAATTR