Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_0929 |
Symbol | choS |
ID | 3689707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 966667 |
End bp | 968427 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637727385 |
Product | cholesterol oxidase |
Protein accession | YP_332342 |
Protein GI | 76810583 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TCCTCGGAGA TCCGGCGTCG CGTCGCGAGC GCCGCGCGTT TCTCGGCGAC GTCGCGCGTC TCGCGGGCGC GGGCATCATC GCCGGATGGA CGCCGATTCG ACCGATTGCC GCGCACGCGC AAGCCGCGGG CGCGGCGCCG CCGAATTTTC CCGCCGGCAT CGCGCTCTAC AAACAGGCGT TTCGCAACTG GAGCGGCGAG ATCGCGGTCG CGGACCTGTG GACCGCCGCG CCCGCGACGC CCGCCGACGT CGTCGCGATC GTCAACTGGG CGGCCGACAA CGGTTACCGC GCGCGGCCGC TCGGCCATAT GCACAACTGG TCGCCGCTCA CGGTCGCCGC GAACGGCGCG CACGAGCGCA CGCTGCTCGT CGACACGACG AGGCATCTGT GCGCCGTATC CGTCGACCCG TCGACGACGC CCGCGCGCGT CGTCGCACAA GCGGGCGTGT CGCTCGATAC GCTGCTCGCG ACGCTCGAGC AGCACGGGCT CGGGTTGACC GCGGCGCCCG CGCCCGGCGA CATCACGCTC GGCGGCGCGC TCGCGATCGG CGCGCACGGC ACCGCGCTGC CGGCGGCGAA CGAGACGCGG CCGCCCGGGC ATACGTACGG CTCGCTCAGC AACGCGGTGC TCGCGCTGAC CGCGGTGGTC TATGACGCGG CGTCGGGCCG CTACGCGTTG CGCACCTTCG ATCGCACCGA TCCGGACATC GGCCCGTTCC TCGCGCACGT CGGCCGCGCG TTCGTCGTCG AGGCGACGTT GCAGGTCGGC GCGAATCAAC GGCTGCAATG CGAGAGCTTC GTCGATATCC CCGCCGCCGA GCTATTCGCC GCGGCGGGCA CGCGGGGCCG CACCGTCGAA TCGTTCGTGC AGCGCTCGGG CCGCATCGAG GCGATCTGGT TTCCATTCAC CGACTATCCG TGGCTGAAGG TCTGGACCGT CCGGCCGAAC CGGCCGTCGG GCGCGCGTGT CGTCGAAGAG CCGTACAACT ACCCGTTTTC CGATTCGATA TCGCGAGAGC TGTCCGATCT GGTCAGCCGC ATCGTACTGA ACGGCGAAAT CCAGTTGGCA CCGCTCTTCG GAAAAACGCA GTACACGATT GCGTATCTCG GCCTCACCAA CATCTTCCGG CCACTGACGA ACCTGTGGGG CTGGTCGCGC TCGGTGCTGC ACTACGTGCG CCCGACGACG CTGCGCGTGA CCGCGAACGG CTACGCGGTG CTCACGCGTC GCGAGAACGT GCAGCGCGCG ATCAACGAGT TCGTCGGCGC GTATCGGCAG CGGGTCGCCG CCTATCGCGC GGCGGGCCGC TACCCGATGA ACGGCCCCAT CGAGATCCGC GTGACGGGCG TCGATACGCC CGACGACGTC GGCCGCGGCG CGGTGCCGCC GTCGTTATCC GCGATCCGCC CGCGTCCCGA CCATCCCGAA TGGAACGCCG CGATCTGGTT CGACATCCTG ACGATACCGG GCACGCCCGA TGCGAACCGC TTCTATCGCG AGATCGAGCA GTGGATGCTG TCGAACTACA GCGGCGATTA CGCGACCGTG CGCCCGGAAT GGTCGAAGGG CTGGGGCTAC GCGGACACCG CCGCATGGAG CGACGACGCG ATGCTCCGCA CGACGATCCC GGACCTGTTC CGCCAGGGGC TCTCGTCCGC CGACGACTGG GACGCCGCGC TGCGCACGCT CGAGCGCTAC GATCCGCGGC GCGTGTTCTC GTCGCCGCTG CTCGACCGGC TGATGGGCTG A
|
Protein sequence | MKKILGDPAS RRERRAFLGD VARLAGAGII AGWTPIRPIA AHAQAAGAAP PNFPAGIALY KQAFRNWSGE IAVADLWTAA PATPADVVAI VNWAADNGYR ARPLGHMHNW SPLTVAANGA HERTLLVDTT RHLCAVSVDP STTPARVVAQ AGVSLDTLLA TLEQHGLGLT AAPAPGDITL GGALAIGAHG TALPAANETR PPGHTYGSLS NAVLALTAVV YDAASGRYAL RTFDRTDPDI GPFLAHVGRA FVVEATLQVG ANQRLQCESF VDIPAAELFA AAGTRGRTVE SFVQRSGRIE AIWFPFTDYP WLKVWTVRPN RPSGARVVEE PYNYPFSDSI SRELSDLVSR IVLNGEIQLA PLFGKTQYTI AYLGLTNIFR PLTNLWGWSR SVLHYVRPTT LRVTANGYAV LTRRENVQRA INEFVGAYRQ RVAAYRAAGR YPMNGPIEIR VTGVDTPDDV GRGAVPPSLS AIRPRPDHPE WNAAIWFDIL TIPGTPDANR FYREIEQWML SNYSGDYATV RPEWSKGWGY ADTAAWSDDA MLRTTIPDLF RQGLSSADDW DAALRTLERY DPRRVFSSPL LDRLMG
|
| |