Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_0944 |
Symbol | |
ID | 3691547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 985553 |
End bp | 988405 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637727400 |
Product | cytochrome C oxidase subunit I |
Protein accession | YP_332357 |
Protein GI | 76809005 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 [COG1845] Heme/copper-type cytochrome/quinol oxidase, subunit 3 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.358792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACCG CCGACACCGA TCGCCGCGCC CGCCTCGCCT ACCGCCGCGC GCCGGAGATC GGCGAAGCGG GCCCGGGCTC GGCCGCCGAG CGGCGGCTCG CGGCGCTGTG GGAAAGCCGC CCCGGCTGGC GGGGCTGGCT CGCGACGGTC GACCACAAGC GCATCGGGCT GCGCTACATC GTCACCGCGT TCGCGTTCCT GCTCGCGGGC GGCGCCGAGG CGCTCGTGAT GCGCATCCAG CTCGCGCAGC CCAACGGCAC GCTGCTGAAC CCCGAGCAAT ACAACCAGCT GTTCACGATG CACGGCGTGA CGATGATCTT CCTGTACGCG CTGCCCGTGC TGTCCGGCTT CGCGAACTAT CTGTGGCCGC TGATGCTCGG CTCGCGCGAC ATGGCGTTTC CGCGCCTGAA CGCGTTCTCG TACTGGGTGT TCGTCGCGGC GGGAGCGTTC CTCTACGCGA GCTTTCCGTT GGGCGAAGCG CCGAACGGCG GCTGGTTCAA CTACGTGCCG CTCACGACGC TCGACTACAG CCGCGGCCCG AACATCGACG TCTACGCGCT CGGCATGATC CTGCTCGGCG TCTCGACGAC GGTGGGCGCG GCGAACTTCG TCGTCACGCT GCTGCGCATG CGCGCGCCCG GCATGTCGAT CGACCGGCTG CCGATCATCG TCTGGGGCAC GCTCACCGCG TCGTTCGCGA ACCTGTTCGC GGTGCCCGCC GTGAGCCTCG CGTTCCTGCT GCTCTGGCTC GATCGCAACG TCGGCACGCA TTTCTTCGAC GTCGCGGCGG GCGGCCGCCC GCTGCTGTGG CAGCACCTGT TCTGGATGTT CGCGCACCCG TGGGTCTACG TGGTCGTGCT GCCCGCGATG GGCATCGTGT CCGACGCGAT GCCGACGTTC TGCCGGCGCC CGCTCGTCGC GTACGCGGCC GTCGCGGTAT CGACGGTCGC GACGATGCTG ATCGGCTTCG AGGTGTGGGT CCATCACATG TTCGCCACCG GCATCGCGCC GCTCGCGCTC GCGTTCTTCG GCGCGGCGAG CATGCTGATC TCGGTGCCGA GCGCGGTTGC GGTGTTCGCA TGGCTCGCGA CGATCTGGAC GGGCCGCCCG GTGTTCAAGA CGCCGTTCCT CTATTTCGCG GGCTTCGTGC TGATGTTCGT GATCGGCGGC GTGTCCGGCG TGATGACGGC CGCGGTGCCG CTCGACTGGC AATTGACCGA GACTTACTTC ATCGTCGCGC ACCTGCACTA CGTGCTGCTC GGCATCAACG TGTTTCCGGT GCTCGGCGGC ATCGCGTACT GGTTTCCGAA GTTCACCGGC CGGATGATGA ACGAGCGATT CGGCAAGCTC ACGTTCTTCG TGATCCTGAT CGGTTTCAAC GTGGGCTTCT TTCCGATGCA TCTGTCCGGC CTCTTCGGGA TGCCGCGGCG GATCTACACG TATCCGCCCG GCATGGGATG GGACACGACG AATCTCGTGA CGAGCCTCGG CTCGTTCGTG CTCGGCGCGG GCGTGCTGAT GTTCGTCGGC CACGCGCTGT GGAGCATGAA GCGCGGCGCG CGCGCGAGCG CCGATCCGTG GGGCGCGGCC GGCCTCGAAT GGTCGGTGAG CTCGCCCGCG CCCGCGTACA ACTTCGCCGC GCTGCCGATC GTCGCGTCGC GCCATCCGCT GTGGGAGGCG CAACTCGCGC CGCACGCGCG CCGCTCGAGC CTGCGGCGCG GCTATCTGCT CGCCGACGGA CGCGAGGCGC TCGGCGTCAC GCCGCTGTCC GGCAGGCCGG ACGTGATCCT GAAAATGCCC GACGACACGT CCTCGCCGCT CGCGCTCGCG CTCTTTGCGA CGCTCGCGTG CGCGGGGCTC GCGCTTCGCT CGCCGGGCAT CGTCGCGGCG GGCGCGCTCG GCTGCGCGGC CGCGATGCTC GCATGGCTAT GGCCGCGGCG CTCGCTCGGG CAGCGCGAGC CGCCGCTCGC TGCGGCGGCG CCGGCGCGCG CGCCGAACGG CACGGATGTC GCGCACACGG CGCACGCCGA CGGCGGGGAA CATACGGGAC GCGCGGCTGT CGCGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCGACGAACG CGACGAACGC GACGAACGCG ACGAACGCGA CGAACGCGAC GAACGCGACG AACGCGACGA ACGCGACGAA CGCGACGAAC GCCGCGAACG CCGCGAACGC CGCGGGCGCC ACATGCGCGG CGTACGCGCG CGAGCTGCCT GTCGGCAGCG CGGGCGAACA CGCGGGGGGC TGGTGGGGCA TGGCCACGCT GATCGCGACC GAGGCGGCGC TGTTCGGCTA TCTGATCTTC AGCTACTTCT ATCTGCAAAG CCAGACACCT GAGCGATGGC CGCCGGAAGG GCTGCCGAAG CTCGCGCTCG CGTCGTTCAA CACCGCGGTG CTCGTGTCGA GCAGCGTGTT CGTCTGGCTC GCCGACCGGC TCGTCGCGCG CCGGCGCCCG CGCGCGGCGA GCGTCGCGCT CGCCGTGGCG ATCGCGCTCG GCGCCGCGTT CGCGCTGATC CAATGGCACG AGTGGCGCGG CCATCTGTAC GGAATGACCG CACATCTGTA CGGCTCGCTG TATTTCACGA TCACGGGCTT TCACCTCGCG CACGTCGTCG TGGGGCTCGC CGTGCTCGCG GCGCTCGCCT TCTGGACGAT GCGCGGCTAC TTCGACGACA GGCGGCGCGC GGCGCTGTCG ATCGGCGGGC TCTACTGGCA CTTCGTCGAC ATCGTCTGGC TCTTCATCTT CACGACGCTC TACCTCACGC CGATCTGGCT CAGGGGGCGA TGA
|
Protein sequence | MSTADTDRRA RLAYRRAPEI GEAGPGSAAE RRLAALWESR PGWRGWLATV DHKRIGLRYI VTAFAFLLAG GAEALVMRIQ LAQPNGTLLN PEQYNQLFTM HGVTMIFLYA LPVLSGFANY LWPLMLGSRD MAFPRLNAFS YWVFVAAGAF LYASFPLGEA PNGGWFNYVP LTTLDYSRGP NIDVYALGMI LLGVSTTVGA ANFVVTLLRM RAPGMSIDRL PIIVWGTLTA SFANLFAVPA VSLAFLLLWL DRNVGTHFFD VAAGGRPLLW QHLFWMFAHP WVYVVVLPAM GIVSDAMPTF CRRPLVAYAA VAVSTVATML IGFEVWVHHM FATGIAPLAL AFFGAASMLI SVPSAVAVFA WLATIWTGRP VFKTPFLYFA GFVLMFVIGG VSGVMTAAVP LDWQLTETYF IVAHLHYVLL GINVFPVLGG IAYWFPKFTG RMMNERFGKL TFFVILIGFN VGFFPMHLSG LFGMPRRIYT YPPGMGWDTT NLVTSLGSFV LGAGVLMFVG HALWSMKRGA RASADPWGAA GLEWSVSSPA PAYNFAALPI VASRHPLWEA QLAPHARRSS LRRGYLLADG REALGVTPLS GRPDVILKMP DDTSSPLALA LFATLACAGL ALRSPGIVAA GALGCAAAML AWLWPRRSLG QREPPLAAAA PARAPNGTDV AHTAHADGGE HTGRAAVANA TNATNATNAT NATNATNATN ATNATNATNA TNATNATNAT NATNATNATN AANAANAAGA TCAAYARELP VGSAGEHAGG WWGMATLIAT EAALFGYLIF SYFYLQSQTP ERWPPEGLPK LALASFNTAV LVSSSVFVWL ADRLVARRRP RAASVALAVA IALGAAFALI QWHEWRGHLY GMTAHLYGSL YFTITGFHLA HVVVGLAVLA ALAFWTMRGY FDDRRRAALS IGGLYWHFVD IVWLFIFTTL YLTPIWLRGR
|
| |