Gene BMAA2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA2100 
Symbolshc 
ID3087572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2306239 
End bp2308194 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content70% 
IMG OID637565963 
Productsqualene-hopene cyclase 
Protein accessionYP_106607 
Protein GI53715881 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACGC TCGACGCAAC CGCCGCGCCG GCCGGCCTCG ACGCCGCCGT CGCGCGCGCG 
ACCGACGCGC TGCTCGCCGC GCAGCAAGCG GACGGCCACT GGGTCTACGA GCTCGAAGCC
GATTCGACGA TCCCGGCCGA ATACGTGCTG CTCGTCCACT ATCTCGGCGA GGCGCCGAAT
GTCGAGCTCG AGCAGAAGAT CGCGCGCTAT CTGCGCCGGA TTCAGCAGCC GGACGGCGGC
TGGCCGCTCT TCACCGACGG TGCGCCGAAC ATTAGCGCGA GCGTGAAGGC GTACTTCGCG
CTGAAGGTGA TCGGCGACGA CGAGAACGCC GAGCACATGC AGCGCGCGCG CCGCGCGATC
CACGCGATGG GCGGCGCGGA GATGTCGAAC GTGTTCACGC GGATTCAGCT CGCGCTGTAC
GGCGTCGTGC CGTGGTACGC GGTGCCGATG ATGCCGGTCG AGATCATGCT GCTGCCGCAG
TGGTTCCCGT TCCATCTATC GAAGGTGTCG TACTGGGCGC GCACCGTGAT CGTGCCGCTG
CTCGTGCTGA ACGCGAAGCG CCCGGTCGCG AAGAATCCGC GCGGCGTGCG CATCGACGAG
CTGTTCAAGG GCGCACCCGT CAGCACCGGC CTGCTGCCGA AGCAGCCGCA CCAGAGCGCC
GGCTGGTTTG CGTTCTTCCG CGCGGTCGAC GGGGTGCTGC GTCTCGTCGA CGGCCTCTTC
CCGCGCTATA CGCGCGAGCG CGCGATCCGC CAGGCGGTCG CGTTCGTCGA CGAGCGCCTG
AACGGCGAGG ACGGGCTCGG CGCGATCTAT CCCGCGATGG CCAACGCGGT GATGATGTAC
GCGGCGCTCG GCTATCCCGA AGATCATCCG AACCGCGCGA TCGCGCGCCG CTCGATCGAG
AAGCTGCTCG TCGTCGGCGA GCAAGAGGCG TATTGCCAGC CGTGCCTGTC GCCGGTATGG
GACACGTCGC TTGCCGCGCA TGCGCTGCTC GAGACGGGCG ACGCGCGCGC GCGCGAAGCG
GCGGTGCGCG GCCTCGACTG GCTCGTGCCG CGGCAGATCC TCGACGTGCG CGGCGACTGG
ATCTCGCGCC GTCCGCACGT GCGCCCCGGC GGCTGGGCGT TCCAGTACGC GAATGCGCAC
TATCCGGACG TCGACGACAC GGCGGTCGTC GCGATGGCGA TGGACCGCGT CGCGAAGCTC
GACCGGACCG ACGCGTATCG CGAGTCGATC GCGCGCGCGC GCGAGTGGGT TGTCGGCATG
CAGAGCAGCG ACGGCGGCTG GGGCGCGTTC GAGCCGGAAA ACACGCAGTA CTACCTGAAC
AACATTCCGT TCTCCGATCA CGGCGCGCTG CTCGATCCGC CGACGGCCGA CGTGTCGGGC
CGCTGCCTGT CGATGCTCGC GCAGTTCGGC GAGACGAGCG CGTCGAGCGA GCCCGCGCGC
CGCGCGCTCG ACTACATGCT CAAGGAGCAG GAGCCGGACG GCAGCTGGTA CGGCCGCTGG
GGGATGAACT ACATCTACGG CACGTGGACC GCGCTGTGCT CGCTGAACGC GGCGGGCCTC
GGCCACGACG ATCCGCGCGT GAAGCGCGCC GCGCAATGGC TGCTGTCGAT CCAGAACGCC
GACGGCGGCT GGGGCGAGGA CGGCGACAGC TACAAGCTCG ACTACCGCGG CTACGAGCGC
GCGCCGAGCA CGTCGTCGCA GACCGCGTGG GCGCTGCTCG GCCTGATGGC GGCGGGCGAA
GTCGACAATC CCGCCGTCGC GCGCGGCGTC GATTACCTGC TCGGCACGCA GCGCGAGCAC
GGCCTGTGGG ACGAGACGCG CTTCACCGCG ACGGGCTTCC CGCGCGTGTT CTATCTGCGC
TACCACGGCT ACCGCAAGTT CTTCCCGCTG TGGGCGCTCG CCCGCTATCG CAACCTGAAG
CGCGCGAACG CGATGCGCGT GACGGTCGGG ATGTAA
 
Protein sequence
MHTLDATAAP AGLDAAVARA TDALLAAQQA DGHWVYELEA DSTIPAEYVL LVHYLGEAPN 
VELEQKIARY LRRIQQPDGG WPLFTDGAPN ISASVKAYFA LKVIGDDENA EHMQRARRAI
HAMGGAEMSN VFTRIQLALY GVVPWYAVPM MPVEIMLLPQ WFPFHLSKVS YWARTVIVPL
LVLNAKRPVA KNPRGVRIDE LFKGAPVSTG LLPKQPHQSA GWFAFFRAVD GVLRLVDGLF
PRYTRERAIR QAVAFVDERL NGEDGLGAIY PAMANAVMMY AALGYPEDHP NRAIARRSIE
KLLVVGEQEA YCQPCLSPVW DTSLAAHALL ETGDARAREA AVRGLDWLVP RQILDVRGDW
ISRRPHVRPG GWAFQYANAH YPDVDDTAVV AMAMDRVAKL DRTDAYRESI ARAREWVVGM
QSSDGGWGAF EPENTQYYLN NIPFSDHGAL LDPPTADVSG RCLSMLAQFG ETSASSEPAR
RALDYMLKEQ EPDGSWYGRW GMNYIYGTWT ALCSLNAAGL GHDDPRVKRA AQWLLSIQNA
DGGWGEDGDS YKLDYRGYER APSTSSQTAW ALLGLMAAGE VDNPAVARGV DYLLGTQREH
GLWDETRFTA TGFPRVFYLR YHGYRKFFPL WALARYRNLK RANAMRVTVG M