Gene BURPS1106A_1649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1649 
Symbol 
ID4899417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1601266 
End bp1602681 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content72% 
IMG OID640134879 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001065920 
Protein GI126452479 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCC GCTTGCCGCG ATACGCCCGC CAGCACCGTT CGTTCTTCGT CGCGCCGCGC 
GCGTTCGCGG CGGCCGCCGC GCTTGCCGCG GGCGTCGCCG CGTGTGACGC GAACGGGCCG
GGCGCGGGCG CCGCCGCGGC CGTCGCGCCC GCTGCGCTCG CTGTCCCAGC CGCCTCCGCT
GCCTCCGCTG CCTCCGCTGC GCGTCCCGCG CCTCTCGCGC AGCCGGCCGC GCCCGCCGTC
GTCGACAGTC AGCCGCAGAC GCGCGCGCAG GTGTACGAGG CGGTCAAGCA GATGACGGCG
CTCGGCAGGC AGTTGTTCTT CGATCCTTCG CTGTCGGGCA GCGGCAAGCT CGCCTGCGCG
TCGTGCCACA GCCCGCAGCA CGCGTTCGGG CCGCCGAACG CGTTGCCCGC GCAATTCGGC
GGCGACGATC TGCGCCAGCA GGGCTTTCGC GCCGTGCCGA CGCTCAAATA CCTGCAGAAG
GTGCCCGCGT TCAGCGAGCA CTATCACGAA TCGGACGACG AGGGCGACGA GAGCGTCGAC
GCCGGCCCGA CGGGCGGGCT CACGTGGGAC GGCCGCGCGG ACAGCGGCGC CGAGCAGGCG
CGCGCGCCGC TCACGTCGCC GTTCGAGATG AACGGCACGC CCGAGAAGGT CGCGCGCGCG
GTGCGGGCCG CGCCGTACGC GCCCGCGTTT CGCGCGGCGT TCGGCGCGCG CGTGCTCGAC
GACGACCGCG CGACGTTCGA GGCGGTGCTG CAGGCGCTCG GCACGTTCGA GCAGGTGCCC
GACGTGTTCT ATCCGTACAC GAGCAAGTAC GACGCGTACC TGGCGGGCCG CGCGCGGTTG
ACGCGCGCCG AGCTGCACGG GCTGCAGGTC TTCAACGACG AGAAGAAGGG CAACTGCGCG
AGCTGCCACG TGAGCCGGCG CGGGCTCGAC GGCTCGCCGC CGCAGTTCAG CGATTTCGGC
CTGATCGCGC TCGGCGTGCC GCGCAATCGC GCGCTCGCGG TGAATCGGAA TCCGAATTTT
TACGACCTCG GCGCATGCGG GCCCGAGCGC CGGGACCTGA AGGGGCGCGA CGAGTTCTGC
GGGCTGTTCC GCACGCCGAC GCTGCGTAAC GTCGCGCTGA AGAAGACGTT CTTCCACAAC
GGCGTCTATC ACTCGCTCGA CGACGTGCTG CGCTTCTACG CCGAGCGCGA CACGCATCCG
GAGAAGTTCT ATCCGGTGAA GCGCGGCGTC GTTCAGAAGT TCGACGACTT GCCGAAGCGC
TACTGGAAGA ACCTGAACGA CGAGCCGCCG TTCGAGCGCA AGCGCGGCGA TCCGCCCGCG
ATGACCGATG CGGAGATCCG GGACGTGATC GCGTTCCTCG GCACGCTCAC CGACGGCTAC
GATCCGCGCG CGAAGCCGGC AGGCGGCGCG CGCTGA
 
Protein sequence
MMRRLPRYAR QHRSFFVAPR AFAAAAALAA GVAACDANGP GAGAAAAVAP AALAVPAASA 
ASAASAARPA PLAQPAAPAV VDSQPQTRAQ VYEAVKQMTA LGRQLFFDPS LSGSGKLACA
SCHSPQHAFG PPNALPAQFG GDDLRQQGFR AVPTLKYLQK VPAFSEHYHE SDDEGDESVD
AGPTGGLTWD GRADSGAEQA RAPLTSPFEM NGTPEKVARA VRAAPYAPAF RAAFGARVLD
DDRATFEAVL QALGTFEQVP DVFYPYTSKY DAYLAGRARL TRAELHGLQV FNDEKKGNCA
SCHVSRRGLD GSPPQFSDFG LIALGVPRNR ALAVNRNPNF YDLGACGPER RDLKGRDEFC
GLFRTPTLRN VALKKTFFHN GVYHSLDDVL RFYAERDTHP EKFYPVKRGV VQKFDDLPKR
YWKNLNDEPP FERKRGDPPA MTDAEIRDVI AFLGTLTDGY DPRAKPAGGA R