Gene BURPS1106A_3326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3326 
Symbolhmp 
ID4901181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3249671 
End bp3250870 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID640136552 
Productflavohemoprotein 
Protein accessionYP_001067563 
Protein GI126453088 
COG category[C] Energy production and conversion 
COG ID[COG1017] Hemoglobin-like flavoprotein
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000871976 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCG AGCAAATGGC CCGAGTGAAA GCGACCGCCC CGGTTCTCGC GGAGCACGGC 
GCGACGATCA CGAAGCACTT TTATCAGCGG ATGTTCGGGC GCCACCCCGA GCTGAAGAAC
GTTTTCAACC AGACGCACCA GAAGACGGGC AGCCAGCCGG AGACGCTCGC GAAGGCGGTC
TACGCATACG CGGCGAACAT CGACAATCTC GGCGCGCTCG GCGGCGCCGT GTCGCGGATC
GCGCACAAGC ACGCGAGCCT GAACATCCGG CCGGAGCACT ACCCGATCGT CGGCGAGAAC
CTGCTCGCCT CGATCGTCGA GGTGCTCGGC GACGCGGTGG ACGCGGACAC GCTCGAAGCG
TGGCGCATCG CGTACGGCCA GCTCGCCGCG ATCCTGATCG GCGCGGAGGC GAACCTGTAC
GAGAACGCCG CATGGAGCGG CTTCCGCCCG TTCAAGGTCG CGAAGAAGGT GCGCGAGAGC
GACGAGATCA CGTCGTTCTA CCTGACGCCC GCCGACGGCG GCGCGGCGCC CGGGTTCGAG
CCGGGCCAGT ACATCTCGGT GAAGCGTTTC GTCGGCGACA TGGGCGTCGA TCAGCCGCGT
CAGTACAGCC TGTCCGACGC GCCGCACGGC AAGTGGCTGC GCATCTCGGT CAAGCGCGAG
GCGGGGCACA GCGAGGCGGT GCCGGCGGGC AAGGTGTCGA CGCTGATGCA CGACGGCGTC
GACGTCGATT CGGTCGTCGA AGTCACCGCG CCGATGGGCG ATTTCACGCT GAACCGCCAT
GCGGCGACGC CCGTCGTGCT GATTTCGGGC GGCGTCGGGA TCACGCCGAT GATGTCGATG
GCGTCGACGC TCGTCGCAGC GGGCAGCGAG CGCGAAGTGC GTTTCCTGCA CGCGTGCCGC
GCGGCGAACG TGCATGCGTT CCGCGACTGG CTGAACGACA CGACGGACGC GCATCCGAAC
GTGAAGCGCG CGGTGTTCTA CGAGGTGGTC GGCCCGAACG ACCGCGTGGG CGTCGATCAC
GACCACGAAG GCCGGATCAC GCCGGCTGCG CTCGAGCGCC ACGCGCTCGT GCCGGACGCC
GATTACTACA TCTGCGGGCC GATCGCGTTC ATGAAGCAGC AGCGCGACGC GCTCGTTGCG
CTCGGCGTCG CGCCGGAGCG CGTGCAGACG GAAATCTTCG GTTCGGGCGC GCTCGAATGA
 
Protein sequence
MTAEQMARVK ATAPVLAEHG ATITKHFYQR MFGRHPELKN VFNQTHQKTG SQPETLAKAV 
YAYAANIDNL GALGGAVSRI AHKHASLNIR PEHYPIVGEN LLASIVEVLG DAVDADTLEA
WRIAYGQLAA ILIGAEANLY ENAAWSGFRP FKVAKKVRES DEITSFYLTP ADGGAAPGFE
PGQYISVKRF VGDMGVDQPR QYSLSDAPHG KWLRISVKRE AGHSEAVPAG KVSTLMHDGV
DVDSVVEVTA PMGDFTLNRH AATPVVLISG GVGITPMMSM ASTLVAAGSE REVRFLHACR
AANVHAFRDW LNDTTDAHPN VKRAVFYEVV GPNDRVGVDH DHEGRITPAA LERHALVPDA
DYYICGPIAF MKQQRDALVA LGVAPERVQT EIFGSGALE