Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1490 |
Symbol | shc |
ID | 3693826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1820163 |
End bp | 1822118 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637731744 |
Product | squalene-hopene cyclase |
Protein accession | YP_336647 |
Protein GI | 76818996 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.761952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACGC TCGACGCAAC CGCCGCGCCG GCCGGCCTCG ACGCCGCCGT CGCGCGCGCG ACCGACGCGC TGCTCGCCGC GCAGCAAGCG GACGGCCACT GGGTCTACGA GCTCGAAGCC GATTCGACGA TCCCGGCCGA ATACGTGCTG CTCGTCCACT ATCTCGGCGA GGCGCCGAAT GTCGAGCTCG AGCAGAAGAT CGCGCGCTAT CTGCGCCGGA TTCAGCAGCC GGACGGCGGC TGGCCGCTCT TCACCGACGG TGCGCCGAAC ATTAGCGCGA GCGTGAAGGC GTACTTCGCG CTGAAGGTGA TCGGCGACGA CGAGAACGCC GAGCACATGC AGCGCGCGCG CCGCGCGATC CACGCGATGG GCGGCGCGGA GATGTCGAAC GTGTTCACGC GGATTCAGCT CGCGCTGTAC GGCGTCGTGC CGTGGTACGC GGTGCCGATG ATGCCGGTCG AGATCATGCT GCTGCCGCAG TGGTTCCCGT TCCATCTGTC GAAGGTGTCG TACTGGGCGC GCACCGTGAT CGTGCCGCTG CTCGTGCTGA ACGCGAAGCG CCCGGTCGCG AAGAATCCGC GCGGCGTGCG CATCGACGAG CTGTTCAAGG GCGCACCCGT CAGCACCGGC CTGCTGCCGA AGCAGCCGCA CCAGAGCGCC GGCTGGTTTG CGTTCTTCCG CGCGGTCGAC GGGGTGCTGC GTCTCGTCGA CGGCCTCTTC CCGCGCTATA CGCGCGAGCG CGCGATCCGC CAGGCGGTCG CGTTCGTCGA CGAGCGCCTG AACGGCGAGG ACGGGCTCGG CGCGATCTAT CCCGCGATGG CCAACGCGGT GATGATGTAC GCGGCGCTCG GCTATCCCGA AGATCATCCG AACCGCGCGA TCGCGCGCCG CTCGATCGAG AAGCTGCTCG TCGTCGGCGA GCAAGAGGCG TATTGCCAGC CGTGCCTGTC GCCGGTATGG GACACGTCGC TTGCCGCGCA CGCGCTGCTC GAGACGGGCG ACGCGCGCGC GCGCGAAGCG GCGGTGCGCG GCCTCGACTG GCTCGTGCCG CGGCAGATCC TCGACGTGCG CGGCGACTGG ATCTCGCGCC GTCCGCACGT GCGCCCCGGC GGCTGGGCGT TCCAGTACGC GAATGCGCAC TATCCGGACG TCGACGACAC GGCGGTCGTC GCGATGGCGA TGGACCGCGT CGCGAAGCTC GACCGGACCG ACGCGTATCG CGAGTCGATC GCGCGCGCGC GCGAGTGGGT TGTCGGCATG CAGAGCAGCG ACGGCGGCTG GGGCGCGTTC GAGCCGGAAA ACACGCAGTA CTACCTGAAC AACATTCCGT TCTCCGATCA CGGCGCGCTG CTCGATCCGC CGACAGCCGA CGTGTCGGGC CGCTGCCTGT CGATGCTCGC GCAGTTCGGC GAGACGAGCG CGTCGAGCGA GCCCGCGCGC CGCGCGCTCG ACTACATGCT CAAGGAGCAG GAGCCGGACG GCAGCTGGTA CGGCCGCTGG GGGATGAACT ACATCTACGG CACGTGGACC GCGCTGTGCT CGCTGAACGC GGCGGGCCTC GGCCACGACG ATCCGCGCGT GAAGCGCGCC GCGCAATGGC TGCTGTCGAT CCAGAACGCC GACGGCGGCT GGGGCGAGGA CGGCGACAGC TACAAGCTCG ACTACCGCGG CTACGAGCGC GCGCCGAGCA CGTCGTCGCA GACCGCGTGG GCGCTGCTCG GCCTGATGGC GGCGGGCGAA GTCGACAATC CCGCCGTCGC GCGCGGCGTC GATTACCTGC TCGGCACGCA GCGCGAGCAC GGCCTGTGGG ACGAGACGCG CTTCACCGCG ACGGGCTTCC CGCGCGTGTT CTATCTGCGC TACCACGGCT ACCGCAAGTT CTTCCCGCTG TGGGCGCTCG CTCGCTATCG CAACCTGAAG CGCGCGAACG CGACGCGCGT GACGGTCGGG ATGTAA
|
Protein sequence | MHTLDATAAP AGLDAAVARA TDALLAAQQA DGHWVYELEA DSTIPAEYVL LVHYLGEAPN VELEQKIARY LRRIQQPDGG WPLFTDGAPN ISASVKAYFA LKVIGDDENA EHMQRARRAI HAMGGAEMSN VFTRIQLALY GVVPWYAVPM MPVEIMLLPQ WFPFHLSKVS YWARTVIVPL LVLNAKRPVA KNPRGVRIDE LFKGAPVSTG LLPKQPHQSA GWFAFFRAVD GVLRLVDGLF PRYTRERAIR QAVAFVDERL NGEDGLGAIY PAMANAVMMY AALGYPEDHP NRAIARRSIE KLLVVGEQEA YCQPCLSPVW DTSLAAHALL ETGDARAREA AVRGLDWLVP RQILDVRGDW ISRRPHVRPG GWAFQYANAH YPDVDDTAVV AMAMDRVAKL DRTDAYRESI ARAREWVVGM QSSDGGWGAF EPENTQYYLN NIPFSDHGAL LDPPTADVSG RCLSMLAQFG ETSASSEPAR RALDYMLKEQ EPDGSWYGRW GMNYIYGTWT ALCSLNAAGL GHDDPRVKRA AQWLLSIQNA DGGWGEDGDS YKLDYRGYER APSTSSQTAW ALLGLMAAGE VDNPAVARGV DYLLGTQREH GLWDETRFTA TGFPRVFYLR YHGYRKFFPL WALARYRNLK RANATRVTVG M
|
| |