Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0770 |
Symbol | |
ID | 4882530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 748084 |
End bp | 750972 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640126698 |
Product | cytochrome c oxidase, subunit I/subunit III |
Protein accession | YP_001057822 |
Protein GI | 126442084 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 [COG1845] Heme/copper-type cytochrome/quinol oxidase, subunit 3 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0306921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CCGACACCGA TCGCCGCGCC CGCCTCGCCT ACCGCCGCGC GCCGGAGATC GGCGAAGCGG GCCCGGGCTC GGCCGCCGAG CGGCGGCTCG CGGCGCTGTG GGAAAGCCGC CCCGGCTGGC GGGGCTGGCT CGCGACGGTC GACCACAAGC GCATCGGGCT GCGCTACATC GTCACCGCGT TCGCGTTCCT GCTCGCGGGC GGCGCCGAGG CGCTCGTGAT GCGCATCCAG CTCGCGCAGC CCAACGGCAC GCTGCTGAAC CCCGAGCAAT ACAATCAGCT GTTCACGATG CACGGCGTGA CGATGATCTT CCTGTACGCG CTGCCCGTGC TGTCCGGCTT CGCGAACTAT CTGTGGCCGC TGATGCTCGG CTCGCGCGAC ATGGCGTTTC CGCGCCTGAA CGCGTTCTCG TACTGGGTGT TCGTCGCGGC GGGGGCGTTC CTCTACGCGA GCTTTCCGTT GGGCGAAGCG CCGAACGGCG GCTGGTTCAA CTACGTGCCG CTCACGACGC TCGACTACAG CCGCGGCCCG AACATCGACG TCTACGCGCT CGGCATGATC CTGCTCGGCG TCTCGACGAC GGTGGGCGCG GCGAACTTCG TCGTCACGCT GCTGCGCATG CGCGCGCCCG GCATGTCGAT CGACCGGCTG CCGATCATCG TCTGGGGCAC GCTCACCGCG TCGTTCGCGA ACCTGTTCGC GGTGCCCGCC GTGAGCCTCG CGTTCCTGCT GCTCTGGCTC GATCGCAACG TCGGCACGCA TTTCTTCGAC GTCGCGGCGG GCGGCCGCCC GCTGCTGTGG CAGCACCTGT TCTGGATGTT CGCGCACCCG TGGGTCTACG TGGTCGTGCT GCCCGCGATG GGCATCGTGT CCGACGCGAT GCCGACGTTC TGCCGGCGCC CGCTCGTCGC GTACGCGGCC GTCGCGGTAT CGACGGTCGC GACGATGCTG ATCGGCTTCG AGGTGTGGGT CCATCACATG TTCGCCACCG GCATCGCGCC GCTCGCGCTC GCGTTCTTCG GCGCGGCGAG CATGCTGATC TCGGTGCCGA GCGCGGTTGC GGTGTTCGCA TGGCTCGCGA CGATCTGGAC GGGCCGCCCG GTGTTCAAGA CGCCGTTCCT CTATTTCGCG GGCTTCGTGC TGATGTTCGT GATCGGCGGC GTGTCCGGCG TGATGACGGC CGCGGTGCCG CTCGACTGGC AATTGACCGA GACGTACTTC ATCGTCGCGC ATCTGCACTA CGTGCTGCTC GGCATCAACG TGTTTCCGGT GCTCGGCGGC ATCGCGTACT GGTTTCCGAA GTTCACCGGC CGGATGATGA ACGAGCGATT CGGCAAGCTC ACGTTCTTCG TGATCCTGAT CGGTTTCAAC GTGGGCTTCT TTCCGATGCA TCTGTCCGGC CTCTTCGGGA TGCCGCGGCG GATCTACACG TATCCGCCCG GCATGGGATG GGACACGACG AATCTCGTGA CGAGCCTCGG CTCGTTCGTG CTCGGCGCGG GCGTGCTGAT GTTCGTCGGC CACGCGCTGT GGAGCATGAA GCGCGGCGCG CGCGCGAGCG CCGATCCGTG GGGCGCGGCC GGCCTCGAAT GGTCGGTGAG CTCGCCCGCG CCCGCGTACA ACTTCGCCGC GCTGCCGATC GTCGCGTCGC GCCATCCGCT GTGGGAGGCG CAACTCGCGC CGCACGCGCG CCGCTCGAGC CTGCGGCGCG GCTATCTGCT CGCCGACGGA CGCGAGGCGC TCGGCGTCAC GCCGCTGTCC GGCAGGCCGG ACGTGATCCT GAAAATGCCC GACGACACGT CCTCGCCGCT CGCGCTCGCG CTCTTTGCGA CGCTCGCGTG CGCGGGGCTC GCGCTTCGCT CGCCGGGCAT CGTCGCGGCG GGCGCGCTCG GCTGCGCGGC CGCGATGCTC GCATGGCTAT GGCCGCGGCG CTCGCTCGGG CAGCGCGAGC CGCCGCTCGC TGCGGCGGCG CCGGCGCGCG CGCCGAACGG CACGGATGTC GCGCACACGG CGCACGCCGA CGGCGGGGAA CATACGGGAC GCGCGGCTGT CGCGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCGACGAACG CGACGAACGC GACGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCGACGAACG CGACGAACGC GACGAACGCG ACGAACGCCG CGAACGCCGC GAACGCCGCG GGCGCCACAT GCGCGGCGTA CGCGCGCGAG CTGCCCGTCG GCAGCGCGGG CGAACACGCG GGGGGCTGGT GGGGCATGGC CACGCTGATC GCGACCGAGG CGGCGCTGTT CGGCTATCTG ATCTTCAGCT ACTTCTACCT GCAAAGCCAG ACACCTGAGC GATGGCCGCC GGAAGGGCTG CCGAAGCTCG CGCTCGCGTC GTTCAACACC GCGGTGCTCG TGTCGAGCAG CGTGTTCGTC TGGCTCGCCG ACCGGCTCGT CGCGCGCCGG CGCCCGCGCG CGGCGAGCGT CGCGCTCGCC GTGGCGATCG CGCTCGGCGC CGCGTTCGCG CTGATCCAAT GGCACGAGTG GCGCGGCCAT CCGTACGGAA TGACCGCGCA TCTGTACGGC TCGCTGTATT TCACGATCAC GGGCTTTCAC CTCGCGCACG TCGTCGTGGG GCTCGCCGTG CTCGCGGCGC TCGCCTTCTG GACGATGCGC GGCTACTTCG ACGACAGGCG GCGCGCGGCG CTGTCGATCG GCGGGCTCTA CTGGCACTTC GTCGACATCG TCTGGCTCTT CATCTTCACG ACGCTCTACC TCACGCCGAT CTGGCTCAGG GGGCGATGA
|
Protein sequence | MSTADTDRRA RLAYRRAPEI GEAGPGSAAE RRLAALWESR PGWRGWLATV DHKRIGLRYI VTAFAFLLAG GAEALVMRIQ LAQPNGTLLN PEQYNQLFTM HGVTMIFLYA LPVLSGFANY LWPLMLGSRD MAFPRLNAFS YWVFVAAGAF LYASFPLGEA PNGGWFNYVP LTTLDYSRGP NIDVYALGMI LLGVSTTVGA ANFVVTLLRM RAPGMSIDRL PIIVWGTLTA SFANLFAVPA VSLAFLLLWL DRNVGTHFFD VAAGGRPLLW QHLFWMFAHP WVYVVVLPAM GIVSDAMPTF CRRPLVAYAA VAVSTVATML IGFEVWVHHM FATGIAPLAL AFFGAASMLI SVPSAVAVFA WLATIWTGRP VFKTPFLYFA GFVLMFVIGG VSGVMTAAVP LDWQLTETYF IVAHLHYVLL GINVFPVLGG IAYWFPKFTG RMMNERFGKL TFFVILIGFN VGFFPMHLSG LFGMPRRIYT YPPGMGWDTT NLVTSLGSFV LGAGVLMFVG HALWSMKRGA RASADPWGAA GLEWSVSSPA PAYNFAALPI VASRHPLWEA QLAPHARRSS LRRGYLLADG REALGVTPLS GRPDVILKMP DDTSSPLALA LFATLACAGL ALRSPGIVAA GALGCAAAML AWLWPRRSLG QREPPLAAAA PARAPNGTDV AHTAHADGGE HTGRAAVANA TNATNATNAT NATNATNATN ATNATNATNA TNATNATNAT NATNATNATN ATNATNATNA TNAANAANAA GATCAAYARE LPVGSAGEHA GGWWGMATLI ATEAALFGYL IFSYFYLQSQ TPERWPPEGL PKLALASFNT AVLVSSSVFV WLADRLVARR RPRAASVALA VAIALGAAFA LIQWHEWRGH PYGMTAHLYG SLYFTITGFH LAHVVVGLAV LAALAFWTMR GYFDDRRRAA LSIGGLYWHF VDIVWLFIFT TLYLTPIWLR GR
|
| |