Gene BURPS668_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1071 
SymbolhemC 
ID4883458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1049698 
End bp1050687 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content73% 
IMG OID640126999 
Productporphobilinogen deaminase 
Protein accessionYP_001058121 
Protein GI126439305 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCCG AGACTCTTCC GGCCGAGCTG CCCGCGACGC TGACGATCGC GTCGCGCGAG 
AGCCGCCTCG CCATGTGGCA AGCCGAGCAT GTGCGTGATG CGCTGCGCAA ATTATATCCA
GCTTGTGACG TGAAAATCCT CGGGATGACG ACGCGCGGCG ATCAAATTCT CGATCGCACG
CTATCGAAGG TCGGCGGCAA GGGTCTGTTC GTCAAGGAGC TCGAGAGCGC GCTTGCCGAC
GGCCGCGCGG ATCTCGCCGT GCATTCGCTG AAGGACGTGC CGATGGTGCT GCCCGAAGGC
TTCGCGCTCG CGGCGGTGAT GAGCCGCGAG GATCCGCGCG ACGCGTTCGT GTCGAACGAT
TACGCGTCGC TCGACGCGCT GCCGGCGGGC GCCGTCGTCG GCACGTCGAG CCTGCGCCGC
GAGGCGATGC TGCGCGCGCG CCATCCGCGG CTCGACGTGC GGCCGCTGCG CGGCAATCTC
GACACGCGGC TCGCGAAGCT CGACCGCGGC GATTACGCGG CGATCATCCT CGCCGCCGCG
GGCCTCAAGC GTCTCGGCCT CGCCGCGCGG ATCCGCGCGC TGCTCGACGT CGACGACAGC
CTGCCCGCCG CGGGGCAGGG CGCGCTCGGC ATCGAGATCG CCGCGCGCCG CGCCGATGTC
GCCGCGTGGC TCGCGCCGCT GCACGATCAT GCGAGCGCGC TCGCGGTCGA GGCCGAACGC
GCGGTGTCGC GCGCGCTCGG CGGCAGTTGC GAGGTGCCGC TCGCCGCGCA CGCGGTGTGG
CGCGGCGGCG AGCTGCATCT GACGGGCAGC GTGTCGACGA CGGACGGCGC GCGCGTGCTC
GCCGCGCATG CGCACGCACG CGCGGCGACG GCCGCCGATG CGCTCGCGCT CGGCCGCAGG
GTGTCCGACG CGCTCGAGCG GCAAGGCGCG CGCGCGATCG TCGACGCGCT CGTCGCGGCG
AGCGCGCAAG CGCAAAAGGG CGGCGCGTGA
 
Protein sequence
MNSETLPAEL PATLTIASRE SRLAMWQAEH VRDALRKLYP ACDVKILGMT TRGDQILDRT 
LSKVGGKGLF VKELESALAD GRADLAVHSL KDVPMVLPEG FALAAVMSRE DPRDAFVSND
YASLDALPAG AVVGTSSLRR EAMLRARHPR LDVRPLRGNL DTRLAKLDRG DYAAIILAAA
GLKRLGLAAR IRALLDVDDS LPAAGQGALG IEIAARRADV AAWLAPLHDH ASALAVEAER
AVSRALGGSC EVPLAAHAVW RGGELHLTGS VSTTDGARVL AAHAHARAAT AADALALGRR
VSDALERQGA RAIVDALVAA SAQAQKGGA