Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0782 |
Symbol | |
ID | 4901426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 763262 |
End bp | 766186 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640134012 |
Product | cytochrome c oxidase, subunit III:cytochrome c oxidase, subunit I |
Protein accession | YP_001065064 |
Protein GI | 126454136 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 [COG1845] Heme/copper-type cytochrome/quinol oxidase, subunit 3 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CCGACACCGA TCGCCGCGCC CGCCTCGCCT ACCGCCGCGC GCCGGAGATC GGCGAAGCGG GCCCGGGCTC GGCCGCCGAG CGGCGGCTCG CGGCGCTGTG GGAAAGCCGC CCCGGCTGGC GGGGCTGGCT CGCGACGGTC GACCACAAGC GCATCGGGCT GCGCTACATC GTCACCGCGT TCGCGTTCCT GCTCGCGGGC GGCGCCGAGG CGCTCGTGAT GCGCATCCAG CTCGCGCAGC CCAACGGCAC GCTGCTGAAC CCCGAGCAAT ACAACCAGCT GTTCACGATG CACGGCGTGA CGATGATCTT CCTGTACGCG CTGCCCGTGC TGTCCGGCTT CGCGAACTAT CTGTGGCCGC TGATGCTCGG CTCGCGCGAC ATGGCGTTTC CGCGCCTGAA CGCGTTCTCG TACTGGGTGT TCGTCGCGGC GGGAGCGTTC CTCTACGCGA GCTTTCCGTT GGGCGAAGCG CCGAACGGCG GCTGGTTCAA CTACGTGCCG CTCACGACGC TCGACTACAG CCGCGGCCCG AACATCGACG TCTACGCGCT CGGCATGATC CTGCTCGGCG TCTCGACGAC GGTGGGCGCG GCGAACTTCG TCGTCACGCT GCTGCGCATG CGCGCGCCCG GCATGTCGAT CGACCAGCTG CCGATCATCG TCTGGGGCAC GCTCACCGCG TCGTTCGCGA ACCTGTTCGC GGTGCCCGCC GTGAGCCTCG CGTTCCTGCT GCTCTGGCTC GATCGCAACG TCGGCACGCA TTTCTTCGAC GTCGCGGCGG GCGGCCGCCC GCTGCTGTGG CAGCACCTGT TCTGGATGTT CGCGCACCCG TGGGTCTACG TGGTCGTGCT GCCCGCGATG GGCATCGTGT CCGACGCGAT GCCGACGTTC TGCCGGCGCC CGCTCGTCGC GTACGCGGCC GTCGCGGTAT CGACGGTCGC GACGATGCTG ATCGGCTTCG AGGTGTGGGT CCATCACATG TTCGCCACCG GCATCGCGCC GCTCGCGCTC GCGTTCTTCG GCGCGGCGAG CATGCTGATC TCGGTGCCGA GCGCGGTTGC GGTGTTCGCA TGGCTCGCGA CGATCTGGAC GGGCCGCCCG GTGTTCAAGA CGCCGTTCCT CTATTTCGCG GGCTTCGTGC TGATGTTCGT GATCGGCGGC GTGTCCGGCG TGATGACGGC CGCGGTGCCG CTCGACTGGC AATTGACCGA GACTTACTTC ATCGTCGCGC ACCTGCACTA CGTGCTGCTC GGCATCAACG TGTTTCCGGT GCTCGGCGGC ATCGCGTACT GGTTTCCGAA GTTCACCGGC CGGATGATGA ACGAGCGATT CGGCAAGCTC ACGTTCTTCG TGATCCTGAT CGGTTTCAAC GTGGGCTTCT TTCCGATGCA TCTGTCCGGC CTCTTCGGGA TGCCGCGGCG GATCTACACG TATCCGCCCG GCATGGGATG GGACACGACG AATCTCGTGA CGAGCCTCGG CTCGTTCGTG CTCGGCGCGG GCGTGCTGAT GTTCGTCGGC CACGCGCTGT GGAGCATGAA GCGCGGCGCG CGCGCGAGCG CCGATCCGTG GGGCGCGGCC GGCCTCGAAT GGTCGGTGAG CTCGCCCGCG CCCGCGTACA ACTTCGCCGC GCTGCCGATC GTCGCGTCGC GCCATCCGCT GTGGGAGGCG CAACTCGCGC CGCACGCGCG CCGCTCGAGC CTGCGGCGCG GCTATCTGCT CGCCGACGGA CGCGAGGCGC TCGGCGTCAC GCCGCTGTCC GGCAGGCCGG ACGTGATCCT GAAAATGCCC GACGACACGT CCTCGCCGCT CGCGCTCGCG CTCTTTGCGA CGCTCGCGTG CGCGGGGCTC GCGCTTCGCT CGCCGGGCAT CGTCGCGGCG GGCGCGCTCG GCTGCGCGGC CGCGATGCTC GCATGGCTAT GGCCGCGGCG CTCGCTCGGG CAGCGCGAGC CGCCGCTCGC TGCGGCGGTG CCAGCGCGCG CGCCGAACGG CACGGATGTC GCGCACACGG CGCACGCCGA CGGCGGGGAA CATACGGGAC GCGCGGCTGT CGCGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCGACGAACG CGACGAACGC GACGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCGACGAACG CGACGAACGC GACGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCCGCGGGCG CCACATGCGC GGCGTACGCG CGCGAGCTGC CTGTCGGCAG CGCGGGCGAA CACGCGGGGG GCTGGTGGGG CATGGCCACG CTGATCGCGA CCGAGGCGGC GCTGTTCGGC TATCTGATCT TCAGCTACTT CTACCTGCAA AGCCAGACAC CTGAGCGATG GCCGCCGGAA GGGCTGCCGA AGCTCGCGCT CGCGTCGTTC AACACCGCGG TGCTCGTGTC GAGCAGCGTG TTCGTCTGGC TCGCCGACCG GCTCGTCGCG CGCCGGCGCC CGCGCGCGGC GAGCGTCGCG CTCGCCGTGG CGATCGCGCT CGGCGCCGCG TTCGCGCTGA TCCAATGGCA CGAGTGGCGC GGCCATCCGT ACGGAATGAC CGCGCATCTG TACGGCTCGC TGTATTTCAC GATCACGGGC TTTCACCTCG CGCACGTCGT CGTGGGGCTC GCCGTGCTCG CGGCGCTCGC CTTCTGGACG ATGCGCGGCT ACTTCGACGA CAGGCGGCGC GCGGCGCTGT CGATCGGCGG GCTCTACTGG CACTTCGTCG ACATCGTCTG GCTCTTCATC TTCACGACGC TCTACCTCAC GCCGATCTGG CTCAGGGGGC GATGA
|
Protein sequence | MSTADTDRRA RLAYRRAPEI GEAGPGSAAE RRLAALWESR PGWRGWLATV DHKRIGLRYI VTAFAFLLAG GAEALVMRIQ LAQPNGTLLN PEQYNQLFTM HGVTMIFLYA LPVLSGFANY LWPLMLGSRD MAFPRLNAFS YWVFVAAGAF LYASFPLGEA PNGGWFNYVP LTTLDYSRGP NIDVYALGMI LLGVSTTVGA ANFVVTLLRM RAPGMSIDQL PIIVWGTLTA SFANLFAVPA VSLAFLLLWL DRNVGTHFFD VAAGGRPLLW QHLFWMFAHP WVYVVVLPAM GIVSDAMPTF CRRPLVAYAA VAVSTVATML IGFEVWVHHM FATGIAPLAL AFFGAASMLI SVPSAVAVFA WLATIWTGRP VFKTPFLYFA GFVLMFVIGG VSGVMTAAVP LDWQLTETYF IVAHLHYVLL GINVFPVLGG IAYWFPKFTG RMMNERFGKL TFFVILIGFN VGFFPMHLSG LFGMPRRIYT YPPGMGWDTT NLVTSLGSFV LGAGVLMFVG HALWSMKRGA RASADPWGAA GLEWSVSSPA PAYNFAALPI VASRHPLWEA QLAPHARRSS LRRGYLLADG REALGVTPLS GRPDVILKMP DDTSSPLALA LFATLACAGL ALRSPGIVAA GALGCAAAML AWLWPRRSLG QREPPLAAAV PARAPNGTDV AHTAHADGGE HTGRAAVANA TNATNATNAT NATNATNATN ATNATNATNA TNATNATNAT NATNATNATN ATNATNATNA TNATNATNAT NATNATNATN AAGATCAAYA RELPVGSAGE HAGGWWGMAT LIATEAALFG YLIFSYFYLQ SQTPERWPPE GLPKLALASF NTAVLVSSSV FVWLADRLVA RRRPRAASVA LAVAIALGAA FALIQWHEWR GHPYGMTAHL YGSLYFTITG FHLAHVVVGL AVLAALAFWT MRGYFDDRRR AALSIGGLYW HFVDIVWLFI FTTLYLTPIW LRGR
|
| |