Gene BURPS1710b_A1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1490 
Symbolshc 
ID3693826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1820163 
End bp1822118 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content70% 
IMG OID637731744 
Productsqualene-hopene cyclase 
Protein accessionYP_336647 
Protein GI76818996 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.761952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACGC TCGACGCAAC CGCCGCGCCG GCCGGCCTCG ACGCCGCCGT CGCGCGCGCG 
ACCGACGCGC TGCTCGCCGC GCAGCAAGCG GACGGCCACT GGGTCTACGA GCTCGAAGCC
GATTCGACGA TCCCGGCCGA ATACGTGCTG CTCGTCCACT ATCTCGGCGA GGCGCCGAAT
GTCGAGCTCG AGCAGAAGAT CGCGCGCTAT CTGCGCCGGA TTCAGCAGCC GGACGGCGGC
TGGCCGCTCT TCACCGACGG TGCGCCGAAC ATTAGCGCGA GCGTGAAGGC GTACTTCGCG
CTGAAGGTGA TCGGCGACGA CGAGAACGCC GAGCACATGC AGCGCGCGCG CCGCGCGATC
CACGCGATGG GCGGCGCGGA GATGTCGAAC GTGTTCACGC GGATTCAGCT CGCGCTGTAC
GGCGTCGTGC CGTGGTACGC GGTGCCGATG ATGCCGGTCG AGATCATGCT GCTGCCGCAG
TGGTTCCCGT TCCATCTGTC GAAGGTGTCG TACTGGGCGC GCACCGTGAT CGTGCCGCTG
CTCGTGCTGA ACGCGAAGCG CCCGGTCGCG AAGAATCCGC GCGGCGTGCG CATCGACGAG
CTGTTCAAGG GCGCACCCGT CAGCACCGGC CTGCTGCCGA AGCAGCCGCA CCAGAGCGCC
GGCTGGTTTG CGTTCTTCCG CGCGGTCGAC GGGGTGCTGC GTCTCGTCGA CGGCCTCTTC
CCGCGCTATA CGCGCGAGCG CGCGATCCGC CAGGCGGTCG CGTTCGTCGA CGAGCGCCTG
AACGGCGAGG ACGGGCTCGG CGCGATCTAT CCCGCGATGG CCAACGCGGT GATGATGTAC
GCGGCGCTCG GCTATCCCGA AGATCATCCG AACCGCGCGA TCGCGCGCCG CTCGATCGAG
AAGCTGCTCG TCGTCGGCGA GCAAGAGGCG TATTGCCAGC CGTGCCTGTC GCCGGTATGG
GACACGTCGC TTGCCGCGCA CGCGCTGCTC GAGACGGGCG ACGCGCGCGC GCGCGAAGCG
GCGGTGCGCG GCCTCGACTG GCTCGTGCCG CGGCAGATCC TCGACGTGCG CGGCGACTGG
ATCTCGCGCC GTCCGCACGT GCGCCCCGGC GGCTGGGCGT TCCAGTACGC GAATGCGCAC
TATCCGGACG TCGACGACAC GGCGGTCGTC GCGATGGCGA TGGACCGCGT CGCGAAGCTC
GACCGGACCG ACGCGTATCG CGAGTCGATC GCGCGCGCGC GCGAGTGGGT TGTCGGCATG
CAGAGCAGCG ACGGCGGCTG GGGCGCGTTC GAGCCGGAAA ACACGCAGTA CTACCTGAAC
AACATTCCGT TCTCCGATCA CGGCGCGCTG CTCGATCCGC CGACAGCCGA CGTGTCGGGC
CGCTGCCTGT CGATGCTCGC GCAGTTCGGC GAGACGAGCG CGTCGAGCGA GCCCGCGCGC
CGCGCGCTCG ACTACATGCT CAAGGAGCAG GAGCCGGACG GCAGCTGGTA CGGCCGCTGG
GGGATGAACT ACATCTACGG CACGTGGACC GCGCTGTGCT CGCTGAACGC GGCGGGCCTC
GGCCACGACG ATCCGCGCGT GAAGCGCGCC GCGCAATGGC TGCTGTCGAT CCAGAACGCC
GACGGCGGCT GGGGCGAGGA CGGCGACAGC TACAAGCTCG ACTACCGCGG CTACGAGCGC
GCGCCGAGCA CGTCGTCGCA GACCGCGTGG GCGCTGCTCG GCCTGATGGC GGCGGGCGAA
GTCGACAATC CCGCCGTCGC GCGCGGCGTC GATTACCTGC TCGGCACGCA GCGCGAGCAC
GGCCTGTGGG ACGAGACGCG CTTCACCGCG ACGGGCTTCC CGCGCGTGTT CTATCTGCGC
TACCACGGCT ACCGCAAGTT CTTCCCGCTG TGGGCGCTCG CTCGCTATCG CAACCTGAAG
CGCGCGAACG CGACGCGCGT GACGGTCGGG ATGTAA
 
Protein sequence
MHTLDATAAP AGLDAAVARA TDALLAAQQA DGHWVYELEA DSTIPAEYVL LVHYLGEAPN 
VELEQKIARY LRRIQQPDGG WPLFTDGAPN ISASVKAYFA LKVIGDDENA EHMQRARRAI
HAMGGAEMSN VFTRIQLALY GVVPWYAVPM MPVEIMLLPQ WFPFHLSKVS YWARTVIVPL
LVLNAKRPVA KNPRGVRIDE LFKGAPVSTG LLPKQPHQSA GWFAFFRAVD GVLRLVDGLF
PRYTRERAIR QAVAFVDERL NGEDGLGAIY PAMANAVMMY AALGYPEDHP NRAIARRSIE
KLLVVGEQEA YCQPCLSPVW DTSLAAHALL ETGDARAREA AVRGLDWLVP RQILDVRGDW
ISRRPHVRPG GWAFQYANAH YPDVDDTAVV AMAMDRVAKL DRTDAYRESI ARAREWVVGM
QSSDGGWGAF EPENTQYYLN NIPFSDHGAL LDPPTADVSG RCLSMLAQFG ETSASSEPAR
RALDYMLKEQ EPDGSWYGRW GMNYIYGTWT ALCSLNAAGL GHDDPRVKRA AQWLLSIQNA
DGGWGEDGDS YKLDYRGYER APSTSSQTAW ALLGLMAAGE VDNPAVARGV DYLLGTQREH
GLWDETRFTA TGFPRVFYLR YHGYRKFFPL WALARYRNLK RANATRVTVG M