Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1649 |
Symbol | |
ID | 4899417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1601266 |
End bp | 1602681 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640134879 |
Product | di-haem cytochrome c peroxidase |
Protein accession | YP_001065920 |
Protein GI | 126452479 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.353955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGCC GCTTGCCGCG ATACGCCCGC CAGCACCGTT CGTTCTTCGT CGCGCCGCGC GCGTTCGCGG CGGCCGCCGC GCTTGCCGCG GGCGTCGCCG CGTGTGACGC GAACGGGCCG GGCGCGGGCG CCGCCGCGGC CGTCGCGCCC GCTGCGCTCG CTGTCCCAGC CGCCTCCGCT GCCTCCGCTG CCTCCGCTGC GCGTCCCGCG CCTCTCGCGC AGCCGGCCGC GCCCGCCGTC GTCGACAGTC AGCCGCAGAC GCGCGCGCAG GTGTACGAGG CGGTCAAGCA GATGACGGCG CTCGGCAGGC AGTTGTTCTT CGATCCTTCG CTGTCGGGCA GCGGCAAGCT CGCCTGCGCG TCGTGCCACA GCCCGCAGCA CGCGTTCGGG CCGCCGAACG CGTTGCCCGC GCAATTCGGC GGCGACGATC TGCGCCAGCA GGGCTTTCGC GCCGTGCCGA CGCTCAAATA CCTGCAGAAG GTGCCCGCGT TCAGCGAGCA CTATCACGAA TCGGACGACG AGGGCGACGA GAGCGTCGAC GCCGGCCCGA CGGGCGGGCT CACGTGGGAC GGCCGCGCGG ACAGCGGCGC CGAGCAGGCG CGCGCGCCGC TCACGTCGCC GTTCGAGATG AACGGCACGC CCGAGAAGGT CGCGCGCGCG GTGCGGGCCG CGCCGTACGC GCCCGCGTTT CGCGCGGCGT TCGGCGCGCG CGTGCTCGAC GACGACCGCG CGACGTTCGA GGCGGTGCTG CAGGCGCTCG GCACGTTCGA GCAGGTGCCC GACGTGTTCT ATCCGTACAC GAGCAAGTAC GACGCGTACC TGGCGGGCCG CGCGCGGTTG ACGCGCGCCG AGCTGCACGG GCTGCAGGTC TTCAACGACG AGAAGAAGGG CAACTGCGCG AGCTGCCACG TGAGCCGGCG CGGGCTCGAC GGCTCGCCGC CGCAGTTCAG CGATTTCGGC CTGATCGCGC TCGGCGTGCC GCGCAATCGC GCGCTCGCGG TGAATCGGAA TCCGAATTTT TACGACCTCG GCGCATGCGG GCCCGAGCGC CGGGACCTGA AGGGGCGCGA CGAGTTCTGC GGGCTGTTCC GCACGCCGAC GCTGCGTAAC GTCGCGCTGA AGAAGACGTT CTTCCACAAC GGCGTCTATC ACTCGCTCGA CGACGTGCTG CGCTTCTACG CCGAGCGCGA CACGCATCCG GAGAAGTTCT ATCCGGTGAA GCGCGGCGTC GTTCAGAAGT TCGACGACTT GCCGAAGCGC TACTGGAAGA ACCTGAACGA CGAGCCGCCG TTCGAGCGCA AGCGCGGCGA TCCGCCCGCG ATGACCGATG CGGAGATCCG GGACGTGATC GCGTTCCTCG GCACGCTCAC CGACGGCTAC GATCCGCGCG CGAAGCCGGC AGGCGGCGCG CGCTGA
|
Protein sequence | MMRRLPRYAR QHRSFFVAPR AFAAAAALAA GVAACDANGP GAGAAAAVAP AALAVPAASA ASAASAARPA PLAQPAAPAV VDSQPQTRAQ VYEAVKQMTA LGRQLFFDPS LSGSGKLACA SCHSPQHAFG PPNALPAQFG GDDLRQQGFR AVPTLKYLQK VPAFSEHYHE SDDEGDESVD AGPTGGLTWD GRADSGAEQA RAPLTSPFEM NGTPEKVARA VRAAPYAPAF RAAFGARVLD DDRATFEAVL QALGTFEQVP DVFYPYTSKY DAYLAGRARL TRAELHGLQV FNDEKKGNCA SCHVSRRGLD GSPPQFSDFG LIALGVPRNR ALAVNRNPNF YDLGACGPER RDLKGRDEFC GLFRTPTLRN VALKKTFFHN GVYHSLDDVL RFYAERDTHP EKFYPVKRGV VQKFDDLPKR YWKNLNDEPP FERKRGDPPA MTDAEIRDVI AFLGTLTDGY DPRAKPAGGA R
|
| |