Gene BURPS1106A_A0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0550 
Symbol 
ID4905976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp527278 
End bp528648 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID640143656 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001074586 
Protein GI126456121 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.77127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCGCG CCGGCCGCGC GCGCCGTGTC ACCGCCCTTG CTCTCGCACT CGCGGCGCTT 
GCCGGTGCCG CCTCGCTCGC GGTATGTGCC GCGCCCGCTA GCACCACCAT GCTGCTGCCC
GGCGCGCCGC CCGCGCGTGT GGTCGATACG ATCGGCAACG GCACGCCGCA GGTCTCGTCC
AAGATCGATG CGTCCGCCGC GCGCTTCGTG CCGGACCCCA CGCTCGTCGC GCTCGGACGC
CGCATCTTCT TCGACACGCG CCTGTCCGAG CCGCGGGGGA TGTCGTGCGC CGGCTGCCAC
GATCCCGGCC GCGCGTTTGC GCCGACGCTG TCCGCGGCCG CGCTCGCGGG CCCCGGCGTG
CCGGAAGGCA GCCGGCCGGG ACGCTTCAGC CGGCGCAACG CGCCGTCGCT GCTCTACGTG
CGCTACGTGC CGCGCCGCCA CTTCTACCAG GATGACGACG CGCCCGCACC GTCGCCGTTC
GGCGGCCTGT TCAGCGATGG CCGCGCGGAC ACACTCGCCG AGCAGATCCG CGGGCCGCTG
TTCGACCCGA ACGAGATGAA CAACCGGTCG CCCGCCGCGC TGCTGCGCAA GGTCGACGCG
ACCGAACTCG CACCGGCGCT CGCCGCGCGC TTCGGCGACG GCGTGCGGCT CGACCCCGCA
CAGCTCGTGC GCGCGCTCGG CGCTTCGGTC GAGGCGTACC TGCAGAGCGA CGAGATGGCG
CCGTTCACAT CGCGCTTCGA CGCGTACCTG CGCCAGCGCA CGCCGCTTGA CGCACAGCAG
ATGCGCGGCC TCGCGCTGTT CAAGAATCCC GACAAAGGCA ACTGCATGAG CTGCCACACG
TTGTCGGATA CGTCGAGCCG CCCGGAACGG TCGCTGTTCA CCGATTTCGG TTACGACGCG
ATCGCCGTGC CGCGCAACCG CGCGCTGCCG GCCAATCGCG ACCCGCGCCA TTTCGACAAC
GGGCTGTGCG ACACCGCGCG CCGGCTGCGC TGGCCCGAAC CCGGCCAGTG GTGCGGCTAC
CTGCGCACGC CGAGCCTGCG CAACGTCGCG CTCAAGCAGA CCTTCATGCA CAACGGCGTG
TTCACGTCGC TGCGCGACGC GGTGGCGTTC TACAACACGC GCTCGACCGA TCCACGCCAC
TGGTATCACG GCGCCGCGAC GTTCGACGAC GTGCCGCCCG CGTACCGCGG CAACATCAAC
GTCAACTCGA CGCCGATGAA TCGCCGCCCC GGCACGCCGC CCGCGCTGAC CGAAGCGGAA
ATCGACGACC TCGTCGCGTT CCTCGGCACG CTGACCGACG CACGCTATGC CGCCGGCGCC
CCCCCTCATT TAAAGATTCA TGATTCGCAA GCCTTTACGA TTGCCCCATA A
 
Protein sequence
MSRAGRARRV TALALALAAL AGAASLAVCA APASTTMLLP GAPPARVVDT IGNGTPQVSS 
KIDASAARFV PDPTLVALGR RIFFDTRLSE PRGMSCAGCH DPGRAFAPTL SAAALAGPGV
PEGSRPGRFS RRNAPSLLYV RYVPRRHFYQ DDDAPAPSPF GGLFSDGRAD TLAEQIRGPL
FDPNEMNNRS PAALLRKVDA TELAPALAAR FGDGVRLDPA QLVRALGASV EAYLQSDEMA
PFTSRFDAYL RQRTPLDAQQ MRGLALFKNP DKGNCMSCHT LSDTSSRPER SLFTDFGYDA
IAVPRNRALP ANRDPRHFDN GLCDTARRLR WPEPGQWCGY LRTPSLRNVA LKQTFMHNGV
FTSLRDAVAF YNTRSTDPRH WYHGAATFDD VPPAYRGNIN VNSTPMNRRP GTPPALTEAE
IDDLVAFLGT LTDARYAAGA PPHLKIHDSQ AFTIAP