Gene BURPS1710b_A1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1948 
Symbol 
ID3693457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2366047 
End bp2367471 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content71% 
IMG OID637732202 
Productdi-haem cytochrome c peroxidase family protein 
Protein accessionYP_337099 
Protein GI76819738 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0165417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACGG GTTCAATTGA CCGCCGTGCG CTGCGCGCGC ATCGCGTACG CGGGGTGTCG 
CGCGCCGGCC GCGCGCGCCG TGTCACCGCC CTTGCTCTCG CACTCGCGGC GCTTGCCGGT
GCCGCCTCGC TCGCGGTACG TGCCGCGCCC GCCAGCACCA CCATGCTGCT GCCCGGCGCG
CCGCCCGCGC GCGTGGTCGA TACGATCGGC AACGGCACGC CGCAGGTCTC GTCCAAGATC
GATGCGTCCG CCGCGCGCTT CGTGCCGGAC CCCACGCTCG TCGCGCTCGG ACGCCGCATC
TTCTTCGACA CGCGCCTGTC CGAGCCGCGG GGGATGTCGT GCGCCGGCTG CCACGATCCC
GGCCGCGCGT TTGCGCCGAC GCTGTCCGCA GCCGCGCTCG CGGGCCCCGG CGTGCCGGAA
GGCAGCCGGC CGGGACGCTT CAGCCGGCGC AACGCGCCGT CGCTGCTCTA CGTGCGCTAC
GTGCCGCGCC GCCACTTCTA CCAGGATGAC GACGCGCCCG CACCGTCGCC GTTCGGCGGC
CTGTTCAGCG ATGGCCGCGC GGACACACTC GCCGAGCAGA TCCGCGGGCC GCTGTTCGAC
CCGAACGAGA TGAACAACCG GTCGCCCGCC GCGCTGCTGC GCAAGGTCGA CGCGACCGAA
CTCGCACCGG CGCTCGCCGC GCGCTTCGGC GACGGCGTGC GGCTCGACCC CGCACAGCTC
GTGCGCGCGC TCGGCGCTTC GGTCGAGGCG TACCTGCAGA GCGACGAGAT GGCGCCGTTC
ACATCGCGCT TCGACGCGTA CCTGCGCCAG CGCACGCCGC TTGACGCACA GCAGATGCGC
GGCCTCGCGC TGTTCAAGAA TCCCGACAAA GGCAACTGCA TGAGCTGCCA CACGTTGTCG
GATACGTCGA GCCGCCCGGA ACGGTCGCTG TTCACCGATT TCGGTTACGA CGCGATCGCC
GTGCCGCGCA ACCGCGCGCT GCCGGCCAAT CGCGACCCGC GCCATTTCGA CAACGGGCTG
TGCGACACCG CGCGCCGGCT GCGCTGGCCC GAACCCGGCC AGTGGTGCGG CTACCTGCGC
ACGCCGAGCC TGCGCAACGT CGCGCTCAAG CAGACCTTCA TGCACAACGG CGTGTTCACG
TCGCTGCGCG ACGCGGTGGC GTTCTACAAC ACGCGCTCGA CCGATCCACG CCACTGGTAT
CACGGCGCCG CGACGTTCGA CGACGTGCCG CCCGCGTACC GCGGCAACAT CAACGTCAAC
TCGACGCCGA TGAATCGCCG CCCCGGCACG CCGCCCGCGC TGACCGAAGC GGAAATCGAC
GACCTCGTCG CGTTCCTCGG CACGCTGACC GACGCACGCT ATGCCGCCGG CGTCCCCCCT
CATTTAAAGA TTCATGATTC GCAAGCCTTT ACGATTGCCC CATAA
 
Protein sequence
MSTGSIDRRA LRAHRVRGVS RAGRARRVTA LALALAALAG AASLAVRAAP ASTTMLLPGA 
PPARVVDTIG NGTPQVSSKI DASAARFVPD PTLVALGRRI FFDTRLSEPR GMSCAGCHDP
GRAFAPTLSA AALAGPGVPE GSRPGRFSRR NAPSLLYVRY VPRRHFYQDD DAPAPSPFGG
LFSDGRADTL AEQIRGPLFD PNEMNNRSPA ALLRKVDATE LAPALAARFG DGVRLDPAQL
VRALGASVEA YLQSDEMAPF TSRFDAYLRQ RTPLDAQQMR GLALFKNPDK GNCMSCHTLS
DTSSRPERSL FTDFGYDAIA VPRNRALPAN RDPRHFDNGL CDTARRLRWP EPGQWCGYLR
TPSLRNVALK QTFMHNGVFT SLRDAVAFYN TRSTDPRHWY HGAATFDDVP PAYRGNINVN
STPMNRRPGT PPALTEAEID DLVAFLGTLT DARYAAGVPP HLKIHDSQAF TIAP