Gene BURPS1106A_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1077 
SymbolhemC 
ID4902015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1058658 
End bp1059647 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content73% 
IMG OID640134307 
Productporphobilinogen deaminase 
Protein accessionYP_001065357 
Protein GI126455102 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.941716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCCG AGACTCTTCC GGCCGAGCTG CCCGCGACGC TGACGATCGC GTCGCGCGAG 
AGCCGCCTCG CCATGTGGCA AGCCGAGCAT GTGCGTGATG CGCTGCGCAA ATTATATCCA
GCTTGTGACG TGAAAATCCT CGGGATGACG ACGCGCGGCG ATCAAATTCT CGATCGCACG
CTATCGAAGG TCGGCGGCAA GGGCCTGTTC GTCAAGGAGC TCGAGAGCGC GCTCGCCGAC
GGCCGCGCGG ATCTCGCCGT GCATTCGCTG AAGGACGTGC CGATGGCGCT GCCCGAAGGC
TTCGCGCTCG CGGCGGTGAT GAGCCGCGAG GATCCGCGCG ACGCGTTCGT GTCGAACGAT
TACGCGTCGC TCGACGCGCT GCCGGCGGGC GCCGTCGTCG GCACGTCGAG CCTGCGCCGC
GAGGCGATGC TGCGCGCGCG CCATCCGCGG CTCGACGTGC GGCCGCTGCG CGGCAATCTC
GACACGCGGC TCGCGAAGCT CGACCGCGGC GATTACGCGG CGATCATCCT CGCCGCCGCG
GGCCTCAAGC GTCTCGGCCT CGCCGCGCGG ATCCGCGCGC TGCTCGACGT CGACGACAGC
CTGCCCGCCG CGGGGCAGGG CGCGCTCGGC ATCGAGATCG CCGCGCGCCG CGCCGATGTC
GCCGCGTGGC TCGCGCCGCT GCATGATCAT GCGAGCGCGC TCGCGGTCGA GGCCGAACGC
GCGGTGTCGC GCGCGCTCGG CGGCAGTTGC GAGGTGCCGC TCGCCGCGCA CGCGGTGTGG
CGCGGCGGCG AGCTGCATCT GACGGGCAGC GTGTCGACGA CGGACGGCGC GCGCGTGCTC
GCCGCGCATG CGCACGCACG CGCGGCGACG GCCGCCGATG CGCTCGCGCT CGGCCGCAGG
GTGTCCGACG CGCTCGAGCG GCAAGGCGCG CGCGCGATCG TCGACGCGCT CGTCGCGGCG
AGCGCGCAAG CGCAAAAGGG CGGCGCGTGA
 
Protein sequence
MNSETLPAEL PATLTIASRE SRLAMWQAEH VRDALRKLYP ACDVKILGMT TRGDQILDRT 
LSKVGGKGLF VKELESALAD GRADLAVHSL KDVPMALPEG FALAAVMSRE DPRDAFVSND
YASLDALPAG AVVGTSSLRR EAMLRARHPR LDVRPLRGNL DTRLAKLDRG DYAAIILAAA
GLKRLGLAAR IRALLDVDDS LPAAGQGALG IEIAARRADV AAWLAPLHDH ASALAVEAER
AVSRALGGSC EVPLAAHAVW RGGELHLTGS VSTTDGARVL AAHAHARAAT AADALALGRR
VSDALERQGA RAIVDALVAA SAQAQKGGA