Gene BMA10229_1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_1409 
Symbolshc 
ID4790162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp1476190 
End bp1478163 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content69% 
IMG OID 
Productsqualene-hopene cyclase 
Protein accessionYP_001025211 
Protein GI124381762 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.525884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TGACCGAAAT GCATACGCTC GACGCAACCG CCGCGCCGGC CGGCCTCGAC 
GCCGCCGTCG CGCGCGCGAC CGACGCGCTG CTCGCCGCGC AGCAAGCGGA CGGCCACTGG
GTCTACGAGC TCGAAGCCGA TTCGACGATC CCGGCCGAAT ACGTGCTGCT CGTCCACTAT
CTCGGCGAGG CGCCGAATGT CGAGCTCGAG CAGAAGATCG CGCGCTATCT GCGCCGGATT
CAGCAGCCGG ACGGCGGCTG GCCGCTCTTC ACCGACGGTG CGCCGAACAT TAGCGCGAGC
GTGAAGGCGT ACTTCGCGCT GAAGGTGATC GGCGACGACG AGAACGCCGA GCACATGCAG
CGCGCGCGCC GCGCGATCCA CGCGATGGGC GGCGCGGAGA TGTCGAACGT GTTCACGCGG
ATTCAGCTCG CGCTGTACGG CGTCGTGCCG TGGTACGCGG TGCCGATGAT GCCGGTCGAG
ATCATGCTGC TGCCGCAGTG GTTCCCGTTC CATCTATCGA AGGTGTCGTA CTGGGCGCGC
ACCGTGATCG TGCCGCTGCT CGTGCTGAAC GCGAAGCGCC CGGTCGCGAA GAATCCGCGC
GGCGTGCGCA TCGACGAGCT GTTCAAGGGC GCACCCGTCA GCACCGGCCT GCTGCCGAAG
CAGCCGCACC AGAGCGCCGG CTGGTTTGCG TTCTTCCGCG CGGTCGACGG GGTGCTGCGT
CTCGTCGACG GCCTCTTCCC GCGCTATACG CGCGAGCGCG CGATCCGCCA GGCGGTCGCG
TTCGTCGACG AGCGCCTGAA CGGCGAGGAC GGGCTCGGCG CGATCTATCC CGCGATGGCC
AACGCGGTGA TGATGTACGC GGCGCTCGGC TATCCCGAAG ATCATCCGAA CCGCGCGATC
GCGCGCCGCT CGATCGAGAA GCTGCTCGTC GTCGGCGAGC AAGAGGCGTA TTGCCAGCCG
TGCCTGTCGC CGGTATGGGA CACGTCGCTT GCCGCGCATG CGCTGCTCGA GACGGGCGAC
GCGCGCGCGC GCGAAGCGGC GGTGCGCGGC CTCGACTGGC TCGTGCCGCG GCAGATCCTC
GACGTGCGCG GCGACTGGAT CTCGCGCCGT CCGCACGTGC GCCCCGGCGG CTGGGCGTTC
CAGTACGCGA ATGCGCACTA TCCGGACGTC GACGACACGG CGGTCGTCGC GATGGCGATG
GACCGCGTCG CGAAGCTCGA CCGGACCGAC GCGTATCGCG AGTCGATCGC GCGCGCGCGC
GAGTGGGTTG TCGGCATGCA GAGCAGCGAC GGCGGCTGGG GCGCGTTCGA GCCGGAAAAC
ACGCAGTACT ACCTGAACAA CATTCCGTTC TCCGATCACG GCGCGCTGCT CGATCCGCCG
ACGGCCGACG TGTCGGGCCG CTGCCTGTCG ATGCTCGCGC AGTTCGGCGA GACGAGCGCG
TCGAGCGAGC CCGCGCGCCG CGCGCTCGAC TACATGCTCA AGGAGCAGGA GCCGGACGGC
AGCTGGTACG GCCGCTGGGG GATGAACTAC ATCTACGGCA CGTGGACCGC GCTGTGCTCG
CTGAACGCGG CGGGCCTCGG CCACGACGAT CCGCGCGTGA AGCGCGCCGC GCAATGGCTG
CTGTCGATCC AGAACGCCGA CGGCGGCTGG GGCGAGGACG GCGACAGCTA CAAGCTCGAC
TACCGCGGCT ACGAGCGCGC GCCGAGCACG TCGTCGCAGA CCGCGTGGGC GCTGCTCGGC
CTGATGGCGG CGGGCGAAGT CGACAATCCC GCCGTCGCGC GCGGCGTCGA TTACCTGCTC
GGCACGCAGC GCGAGCACGG CCTGTGGGAC GAGACGCGCT TCACCGCGAC GGGCTTCCCG
CGCGTGTTCT ATCTGCGCTA CCACGGCTAC CGCAAGTTCT TCCCGCTGTG GGCGCTCGCC
CGCTATCGCA ACCTGAAGCG CGCGAACGCG ATGCGCGTGA CGGTCGGGAT GTAA
 
Protein sequence
MNDMTEMHTL DATAAPAGLD AAVARATDAL LAAQQADGHW VYELEADSTI PAEYVLLVHY 
LGEAPNVELE QKIARYLRRI QQPDGGWPLF TDGAPNISAS VKAYFALKVI GDDENAEHMQ
RARRAIHAMG GAEMSNVFTR IQLALYGVVP WYAVPMMPVE IMLLPQWFPF HLSKVSYWAR
TVIVPLLVLN AKRPVAKNPR GVRIDELFKG APVSTGLLPK QPHQSAGWFA FFRAVDGVLR
LVDGLFPRYT RERAIRQAVA FVDERLNGED GLGAIYPAMA NAVMMYAALG YPEDHPNRAI
ARRSIEKLLV VGEQEAYCQP CLSPVWDTSL AAHALLETGD ARAREAAVRG LDWLVPRQIL
DVRGDWISRR PHVRPGGWAF QYANAHYPDV DDTAVVAMAM DRVAKLDRTD AYRESIARAR
EWVVGMQSSD GGWGAFEPEN TQYYLNNIPF SDHGALLDPP TADVSGRCLS MLAQFGETSA
SSEPARRALD YMLKEQEPDG SWYGRWGMNY IYGTWTALCS LNAAGLGHDD PRVKRAAQWL
LSIQNADGGW GEDGDSYKLD YRGYERAPST SSQTAWALLG LMAAGEVDNP AVARGVDYLL
GTQREHGLWD ETRFTATGFP RVFYLRYHGY RKFFPLWALA RYRNLKRANA MRVTVGM