Gene BURPS668_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3716 
SymbolhemB 
ID4882188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3640199 
End bp3641197 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content66% 
IMG OID640129644 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001060720 
Protein GI126440613 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.874214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCC ATCCCCTGCA CCGCCCGCGC CGCATGCGCC GCGACGATTT CTCGCGACGC 
CTGATGCGCG AAAATATCCT CACCACGAAC GATCTGATCT ACCCGGTGTT CGTCGTCGAA
GGCACGAACG TGCGCCAGCC GGTGCCGTCG ATGCCGGGCG TCGAGCGCGT GTCGATCGAT
CTGCTGATGG GTGTCGCCGA GCAATGCGTC GAGCTCGGCG TGCCGGTCCT GTCGCTCTTT
CCGGCCATCG AGCCGTCGCT GAAGACGCCC GACGGCCGCG AAGCGGCCAA TCCCGAAGGG
CTGATCCCGC GTGCGGTACG CGAGCTGAAG CGCCGCTTCC CCGAACTCGG CGTGCTGACC
GACGTCGCGC TCGATCCGTA CACGAGCCAC GGCCAGGACG GCGTGCTCGA CGAGGCCGGC
TATGTGCTCA ACGACGAAAC GCTCGAGATT CTCGTCGAGC AGGCGCGCGC GCAGGCCGAA
GCGGGCGTCG ACATCGTCGC GCCGTCGGAC ATGATGGACG GGCGCATCGG CGCGGTGCGC
GAGATGCTCG AGCGTGAAGG CCACATCTAT ACGCGGATCA TGGCCTACTC GGCGAAGTAC
GCGTCGGCGT TCTACGGCCC GTTCCGCGAC GCGGTCGGCT CCGCGTCGAA TCTCGGCAAG
GGCAACAAGA TGACCTACCA GATGGACCCC GCGAACAGCG ACGAGGCGCT GCGCGAAGTG
CGCCTCGACA TCGACGAGGG CGCGGACATG GTCATGGTGA AGCCCGGCAT GCCGTATCTC
GACATCGTGC GCCGCGTGAA GGACGAATTC CGCTATCCGA CCTACGTCTA TCAGGTGAGC
GGCGAATACG CGATGCTGAA GGCCGCCGCG CAGAACGGCT GGCTCGATCA CGACAAGGTC
GTGCTCGAAT CGCTGCTCGC GTTCAAGCGC GCGGGCGCGG ACGGCATTCT CACGTACTTC
GCGCTCGACG CGGCGCGGCT GCTGCGCGCG CAGAAGTAA
 
Protein sequence
MSIHPLHRPR RMRRDDFSRR LMRENILTTN DLIYPVFVVE GTNVRQPVPS MPGVERVSID 
LLMGVAEQCV ELGVPVLSLF PAIEPSLKTP DGREAANPEG LIPRAVRELK RRFPELGVLT
DVALDPYTSH GQDGVLDEAG YVLNDETLEI LVEQARAQAE AGVDIVAPSD MMDGRIGAVR
EMLEREGHIY TRIMAYSAKY ASAFYGPFRD AVGSASNLGK GNKMTYQMDP ANSDEALREV
RLDIDEGADM VMVKPGMPYL DIVRRVKDEF RYPTYVYQVS GEYAMLKAAA QNGWLDHDKV
VLESLLAFKR AGADGILTYF ALDAARLLRA QK