Gene BURPS668_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1628 
Symbol 
ID4883421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1592742 
End bp1594148 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content72% 
IMG OID640127556 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001058669 
Protein GI126441446 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.171293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCC GCTTGCCGCG ATACGCCCGC CAGCACCGTT CGTTCTTCGT CGCGCCGCGC 
GCGTTCGCGG CGGCCGCCGC GCTTGCCGCG GGCGTCGCCG CGTGTGACGC GAACGGGCCG
GGCGCGAGCG CCGCCGCGGC CGTCGCGCCC GCTGCGCTCG CTGTCCCAGC CGCCTCCGCT
GCCTCCGCTG CGCGTCCCGC GCCGCTCGCG CAGCCGGCCG CGCCCGCCGT CGTCGACAGT
CAGCCGCAGA CGCGCGCGCA GGTGTACGAG GCGGTCAAGC AGATGACGGC GCTCGGCAGG
CAGTTGTTCT TCGATCCTTC GCTGTCGGGC AGCGGCAAGC TCGCCTGCGC GTCGTGCCAC
AGCCCGCAGC ACGCGTTCGG GCCGCCGAAC GCGTTGCCCG CGCAATTCGG CGGCGACGAT
CTGCGCCAGC AGGGCTTTCG CGCCGTGCCG ACGCTCAAGT ACCTGCAGAA GGTGCCCGCG
TTCAGCGAGC ACTATCACGA ATCGGACGAC GAGGGCGACG AGAGCGTCGA CGCCGGCCCG
ACGGGCGGGC TCACGTGGGA CGGCCGCGTG GACAGCGGCG CCGAGCAGGC GCGCGCGCCG
CTCACGTCGC CGTTCGAGAT GAACGGCACG CCCGAGAAGG TCGCGCGCGC GGTGCGGGCC
GCGCCGTACG CGCCCGCGTT TCGCGCGGCG TTCGGCGCGC GCGTGCTCGA CGACGACCGC
GCGACGTTCG AGGCGGTGCT GCAGGCGCTC GGCACGTTCG AGCAGGCGCC CGACGTGTTC
TATCCGTACA CGAGCAAGTA CGACGCGTAC CTGGCGGGCC GCGCGCGGTT GACGCGCGCC
GAGCTGCACG GGCTGCAGGT CTTCAACGAC GAGAAGAAGG GCAACTGCGC GAGCTGCCAC
GTGAGCCGGC GCGGGCTCGA CGGCTCGCCG CCGCAGTTCA GCGATTTCGG CCTGATCGCG
CTCGGCGTGC CGCGCAATCG CGCGCTCGCG GCGAATCGGA ATCCGAATTT TTACGACCTC
GGCGCATGCG GGCCCGAGCG CCGGGACCTG AAGGGACGCG ACGAGTTCTG CGGGCTGTTC
CGCACGCCGA CGCTGCGTAA CGTCGCGCTG AAAAAGACGT TCTTCCACAA CGGCGTCTAT
CACTCGCTCG ACGACGTGCT GCGCTTCTAC GCCGAGCGCG ACACGCATCC GGAGAAGTTC
TATCCGGTGA AGCGCGGCGT CGTTCAGAAG TTCGACGACT TGCCGAAGCG CTACTGGAAG
AACCTGAACG ACGAGCCGCC GTTCGGGCGC AGGCGCGGCG ATCCGCCCGC GATGACCGAT
GCGGAGATCC GGGACGTGAT CGCGTTCCTC GGCACGCTCA CCGACGGCTA CGATCCGCGC
GCGAAGCCGG CAGGCGGCGC GCGCTGA
 
Protein sequence
MMRRLPRYAR QHRSFFVAPR AFAAAAALAA GVAACDANGP GASAAAAVAP AALAVPAASA 
ASAARPAPLA QPAAPAVVDS QPQTRAQVYE AVKQMTALGR QLFFDPSLSG SGKLACASCH
SPQHAFGPPN ALPAQFGGDD LRQQGFRAVP TLKYLQKVPA FSEHYHESDD EGDESVDAGP
TGGLTWDGRV DSGAEQARAP LTSPFEMNGT PEKVARAVRA APYAPAFRAA FGARVLDDDR
ATFEAVLQAL GTFEQAPDVF YPYTSKYDAY LAGRARLTRA ELHGLQVFND EKKGNCASCH
VSRRGLDGSP PQFSDFGLIA LGVPRNRALA ANRNPNFYDL GACGPERRDL KGRDEFCGLF
RTPTLRNVAL KKTFFHNGVY HSLDDVLRFY AERDTHPEKF YPVKRGVVQK FDDLPKRYWK
NLNDEPPFGR RRGDPPAMTD AEIRDVIAFL GTLTDGYDPR AKPAGGAR