Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1628 |
Symbol | |
ID | 4883421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1592742 |
End bp | 1594148 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640127556 |
Product | di-haem cytochrome c peroxidase |
Protein accession | YP_001058669 |
Protein GI | 126441446 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.171293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGCC GCTTGCCGCG ATACGCCCGC CAGCACCGTT CGTTCTTCGT CGCGCCGCGC GCGTTCGCGG CGGCCGCCGC GCTTGCCGCG GGCGTCGCCG CGTGTGACGC GAACGGGCCG GGCGCGAGCG CCGCCGCGGC CGTCGCGCCC GCTGCGCTCG CTGTCCCAGC CGCCTCCGCT GCCTCCGCTG CGCGTCCCGC GCCGCTCGCG CAGCCGGCCG CGCCCGCCGT CGTCGACAGT CAGCCGCAGA CGCGCGCGCA GGTGTACGAG GCGGTCAAGC AGATGACGGC GCTCGGCAGG CAGTTGTTCT TCGATCCTTC GCTGTCGGGC AGCGGCAAGC TCGCCTGCGC GTCGTGCCAC AGCCCGCAGC ACGCGTTCGG GCCGCCGAAC GCGTTGCCCG CGCAATTCGG CGGCGACGAT CTGCGCCAGC AGGGCTTTCG CGCCGTGCCG ACGCTCAAGT ACCTGCAGAA GGTGCCCGCG TTCAGCGAGC ACTATCACGA ATCGGACGAC GAGGGCGACG AGAGCGTCGA CGCCGGCCCG ACGGGCGGGC TCACGTGGGA CGGCCGCGTG GACAGCGGCG CCGAGCAGGC GCGCGCGCCG CTCACGTCGC CGTTCGAGAT GAACGGCACG CCCGAGAAGG TCGCGCGCGC GGTGCGGGCC GCGCCGTACG CGCCCGCGTT TCGCGCGGCG TTCGGCGCGC GCGTGCTCGA CGACGACCGC GCGACGTTCG AGGCGGTGCT GCAGGCGCTC GGCACGTTCG AGCAGGCGCC CGACGTGTTC TATCCGTACA CGAGCAAGTA CGACGCGTAC CTGGCGGGCC GCGCGCGGTT GACGCGCGCC GAGCTGCACG GGCTGCAGGT CTTCAACGAC GAGAAGAAGG GCAACTGCGC GAGCTGCCAC GTGAGCCGGC GCGGGCTCGA CGGCTCGCCG CCGCAGTTCA GCGATTTCGG CCTGATCGCG CTCGGCGTGC CGCGCAATCG CGCGCTCGCG GCGAATCGGA ATCCGAATTT TTACGACCTC GGCGCATGCG GGCCCGAGCG CCGGGACCTG AAGGGACGCG ACGAGTTCTG CGGGCTGTTC CGCACGCCGA CGCTGCGTAA CGTCGCGCTG AAAAAGACGT TCTTCCACAA CGGCGTCTAT CACTCGCTCG ACGACGTGCT GCGCTTCTAC GCCGAGCGCG ACACGCATCC GGAGAAGTTC TATCCGGTGA AGCGCGGCGT CGTTCAGAAG TTCGACGACT TGCCGAAGCG CTACTGGAAG AACCTGAACG ACGAGCCGCC GTTCGGGCGC AGGCGCGGCG ATCCGCCCGC GATGACCGAT GCGGAGATCC GGGACGTGAT CGCGTTCCTC GGCACGCTCA CCGACGGCTA CGATCCGCGC GCGAAGCCGG CAGGCGGCGC GCGCTGA
|
Protein sequence | MMRRLPRYAR QHRSFFVAPR AFAAAAALAA GVAACDANGP GASAAAAVAP AALAVPAASA ASAARPAPLA QPAAPAVVDS QPQTRAQVYE AVKQMTALGR QLFFDPSLSG SGKLACASCH SPQHAFGPPN ALPAQFGGDD LRQQGFRAVP TLKYLQKVPA FSEHYHESDD EGDESVDAGP TGGLTWDGRV DSGAEQARAP LTSPFEMNGT PEKVARAVRA APYAPAFRAA FGARVLDDDR ATFEAVLQAL GTFEQAPDVF YPYTSKYDAY LAGRARLTRA ELHGLQVFND EKKGNCASCH VSRRGLDGSP PQFSDFGLIA LGVPRNRALA ANRNPNFYDL GACGPERRDL KGRDEFCGLF RTPTLRNVAL KKTFFHNGVY HSLDDVLRFY AERDTHPEKF YPVKRGVVQK FDDLPKRYWK NLNDEPPFGR RRGDPPAMTD AEIRDVIAFL GTLTDGYDPR AKPAGGAR
|
| |