Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A3161 |
Symbol | shc |
ID | 4904514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 3081565 |
End bp | 3083538 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640146264 |
Product | squalene-hopene cyclase |
Protein accession | YP_001077190 |
Protein GI | 126456557 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA TGACCGAAAT GCATACGCTC GACGCAACCG CCGCGCCGGC CGGCCTCGAC GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA CGGCCACTGG GTCTACGAGC TCGAAGCCGA TTCGACGATC CCGGCCGAAT ACGTGCTGCT CGTCCACTAT CTCGGCGAGG CGCCGAATGT CGAGCTCGAG CAGAAGATCG CGCGCTATCT GCGCCGGATT CAGCAGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGTG CGCCGAACAT TAGCGCGAGC GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAGA TGTCGAACGT GTTCACGCGG ATTCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTGTCGA AGGTGTCGTA CTGGGCGCGC ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC GGCGTGCGCA TCGACGAGCT GTTCAAGGGC GCACCCGTCA GCACCGGCCT GCTGCCGAAG CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTCGACGG GGTGCTGCGT CTCGTCGACG GCCTCTTCCC GCGCTATACG CGCGAGCGCG CGATCCGCCA GGCGGTCGCG TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCGATGGCC AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAAG ATCATCCGAA CCGCGCGATC GCGCGCCGCT CGATCGAGAA GCTGCTCGTC GTCGGCGAGC AAGAGGCGTA TTGCCAGCCG TGCCTGTCGC CGGTATGGGA CACGTCGCTT GCCGCGCACG CGCTGCTCGA GACGGGCGAC GCGCGCGCGC GCGAAGCGGC GGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCCGGCGG CTGGGCGTTC CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGACACGG CGGTCGTCGC GATGGCGATG GACCGCGTCG CGAAGCTCGA CCGGACCGAC GCGTATCGCG AGTCGATCGC GCGCGCGCGC GAGTGGGTTG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC ACGCAGTACT ACCTGAACAA CATTCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG TCGAGCGAGC CCGCGCGCCG CGCGCTCGAC TACATGCTCA AGGAGCAGGA GCCGGACGGC AGCTGGTACG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG CTGAACGCGG CGGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAATGGCTG CTGTCGATCC AGAACGCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC CTGATGGCGG CGGGCGAAGT CGACAATCCC GCCGTCGCGC GCGGCGTCGA TTACCTGCTC GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TCACCGCGAC GGGCTTCCCG CGCGTGTTCT ATCTGCGCTA CCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC CGCTATCGCA ACCTGAAGCG CGCGAACGCG ACGCGCGTGA CGGTCGGGAT GTAA
|
Protein sequence | MNDMTEMHTL DATAAPAGLD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY LGEAPNVELE QKIARYLRRI QQPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR TVIVPLLVLN AKRPVAKNPR GVRIDELFKG APVSTGLLPK QPHQSAGWFA FFRAVDGVLR LVDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI ARRSIEKLLV VGEQEAYCQP CLSPVWDTSL AAHALLETGD ARAREAAVRG LDWLVPRQIL DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL LSIQNADGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDNP AVARGVDYLL GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA TRVTVGM
|
| |