Gene BURPS1710b_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1805 
Symbol 
ID3691460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1968251 
End bp1969654 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content72% 
IMG OID637728261 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_333206 
Protein GI76812115 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCT TGCCGCGATA CGCCCGCCAG CACCGTTCGT TCTTCGTCGC GCCGCGCGCG 
TTCGCGGCGG CCGCCGCGCT TGCCGCGGGC GTCGCCGCGT GTGACGCGAA CGGGCCGGGC
GCGAGCGCCG CCGCGGCCGT CGCGCCCGCT GCGCTCGCTG TCCCAGCCGC CTCCGCTGCC
TCCGCTGCGC GTCCCGCGCC GCTCGCGCAG CCGGCCGCGC CCGCCGTCGT CGACAGTCAG
CCGCAGACGC GCGCGCAGGT GTACGAGGCG GTCAAGCAGA TGACGGCGCT CGGCAGGCAG
TTGTTCTTCG ATCCTTCGCT GTCGGGCAGC GGCAAGCTCG CCTGCGCGTC GTGCCACAGC
CCGCAGCACG CGTTCGGGCC GCCGAACGCG CTGCCCGCGC AATTCGGCGG CGACGATCTG
CGCCAGCAGG GCTTTCGCGC CGTGCCGACG CTCAAATACC TGCAGAAGGT GCCCGCGTTC
AGCGAGCACT ATCACGAATC GGACGACGAG GGCGACGAGA GCGTCGACGC CGGCCCGACG
GGCGGGCTCA CGTGGGACGG CCGCGTGGAC AGCGGCGCCG AGCAGGCGCG CGCGCCGCTC
ACGTCGCCGT TCGAGATGAA CGGCACGCCC GAGAAGGTCG CGCGCGCGGT GCGGGCCGCG
CCGTACGCGC CCGCGTTTCG CGCGGCGTTC GGCGCGCGCG TGCTCGACGA CGACCGCGCG
ACGTTCGAGG CGGTGCTGCA GGCGCTCGGC ACGTTCGAGC AGGCGCCCGA CGTGTTCTAT
CCGTACACGA GCAAGTACGA CGCGTACCTG GCGGGCCGCG CGCGGTTGAC GCGCGCCGAG
CTGCACGGGC TGCAGGTCTT CAACGACGAG AAGAAGGGCA ACTGCGCGAG CTGCCACGTG
AGCCGGCGCG GGCTCGACGG CTCGCCGCCG CAGTTCAGCG ATTTCGGCCT GATCGCGCTC
GGCGTGCCGC GCAATCGCGC GCTCGCGGTG AATCGGAATC CGAATTTTTA CGACCTCGGC
GCATGCGGGC CCGAGCGCCG GGACCTGAAG GGGCGCGACG AGTTCTGCGG GCTGTTCCGC
ACGCCGACGC TGCGTAACGT CGCGCTGAAG AAGACGTTCT TCCACAACGG CGTCTATCAC
TCGCTCGACG ACGTGCTGCG CTTCTACGCC GAGCGCGACA CGCATCCGGA GAAGTTCTAT
CCGGTGAAGC GCGGCGTCGT TCAGAAGTTC GACGACTTGC CGAAGCGCTA CTGGAAGAAC
CTGAACGACG AGCCGCCGTT CGAGCGCAAG CGCGGCGATC CGCCCGCGAT GACCGATGCG
GAGATCCGGG ACGTGATCGC GTTCCTCGGC ACGCTCACCG ACGGCTACGA TCCGCGCGCG
AAGCCGGCAG GCGGCGCGCG CTGA
 
Protein sequence
MRRLPRYARQ HRSFFVAPRA FAAAAALAAG VAACDANGPG ASAAAAVAPA ALAVPAASAA 
SAARPAPLAQ PAAPAVVDSQ PQTRAQVYEA VKQMTALGRQ LFFDPSLSGS GKLACASCHS
PQHAFGPPNA LPAQFGGDDL RQQGFRAVPT LKYLQKVPAF SEHYHESDDE GDESVDAGPT
GGLTWDGRVD SGAEQARAPL TSPFEMNGTP EKVARAVRAA PYAPAFRAAF GARVLDDDRA
TFEAVLQALG TFEQAPDVFY PYTSKYDAYL AGRARLTRAE LHGLQVFNDE KKGNCASCHV
SRRGLDGSPP QFSDFGLIAL GVPRNRALAV NRNPNFYDLG ACGPERRDLK GRDEFCGLFR
TPTLRNVALK KTFFHNGVYH SLDDVLRFYA ERDTHPEKFY PVKRGVVQKF DDLPKRYWKN
LNDEPPFERK RGDPPAMTDA EIRDVIAFLG TLTDGYDPRA KPAGGAR