Gene BURPS1106A_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3774 
SymbolhemB 
ID4901481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3687318 
End bp3688406 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID640137000 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001068004 
Protein GI126455399 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.342004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTCGATC GCTATCATAT ACCATTGTCT CAAAAGTTTA CCGAAGGGAA CTGTAAGGTT 
TCGTCGGTTA CCTATCCCAG CCATTCCGCC ATGAGCATCC ATCCCCTGCA CCGCCCGCGC
CGCATGCGCC GCGACGATTT CTCGCGACGC CTGATGCGCG AAAATATCCT CACCACGAAC
GATCTGATCT ACCCGGTGTT CGTCGTCGAA GGCACGAACG TGCGCCAGCC GGTGCCGTCG
ATGCCGGGCG TCGAGCGCGT GTCGATCGAT CTGCTGATGG GTGTCGCCGA GCAATGCGTC
GAGCTCGGCG TGCCGGTCCT GTCGCTCTTT CCGGCCATCG AGCCGTCGCT GAAGACGCCC
GACGGCCGCG AAGCGGCCAA TCCCGAAGGG CTGATCCCGC GTGCGGTACG CGAGCTGAAG
CGCCGCTTCC CCGAACTCGG CGTGCTGACC GACGTCGCGC TCGATCCGTA CACGAGCCAC
GGCCAGGACG GCGTGCTCGA CGAGGCCGGC TATGTGCTCA ACGACGAAAC GCTCGAGATT
CTCGTCGAGC AGGCGCGCGC GCAGGCCGAA GCGGGTGTCG ACATCGTCGC GCCGTCGGAC
ATGATGGACG GGCGCATCGG CGCGGTGCGC GAGATGCTCG AGCGTGAAGG CCACATCTAC
ACGCGGATCA TGGCCTACTC GGCGAAGTAC GCGTCGGCGT TCTACGGCCC GTTCCGCGAC
GCGGTCGGCT CCGCGTCGAA TCTCGGCAAG GGCAACAAGA TGACCTACCA GATGGACCCC
GCGAACAGCG ACGAGGCGCT GCGCGAAGTG CGCCTCGACA TCGACGAGGG CGCGGACATG
GTCATGGTGA AGCCCGGCAT GCCGTATCTC GACATCGTGC GCCGCGTGAA GGACGAATTC
CGCTATCCGA CCTACGTCTA TCAGGTGAGC GGCGAATACG CGATGCTGAA GGCCGCCGCG
CAGAACGGCT GGCTCGATCA CGACAAAGTC GTGCTCGAAT CGCTGCTCGC GTTCAAGCGC
GCGGGCGCGG ACGGCATTCT CACGTACTTC GCGCTCGACG CGGCGCGGCT GCTGCGCGCG
CAGAAGTAA
 
Protein sequence
MLDRYHIPLS QKFTEGNCKV SSVTYPSHSA MSIHPLHRPR RMRRDDFSRR LMRENILTTN 
DLIYPVFVVE GTNVRQPVPS MPGVERVSID LLMGVAEQCV ELGVPVLSLF PAIEPSLKTP
DGREAANPEG LIPRAVRELK RRFPELGVLT DVALDPYTSH GQDGVLDEAG YVLNDETLEI
LVEQARAQAE AGVDIVAPSD MMDGRIGAVR EMLEREGHIY TRIMAYSAKY ASAFYGPFRD
AVGSASNLGK GNKMTYQMDP ANSDEALREV RLDIDEGADM VMVKPGMPYL DIVRRVKDEF
RYPTYVYQVS GEYAMLKAAA QNGWLDHDKV VLESLLAFKR AGADGILTYF ALDAARLLRA
QK