Gene BMAA2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA2022 
Symbol 
ID3087561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2209220 
End bp2210878 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content69% 
IMG OID637565887 
Productputative cholesterol oxidase 
Protein accessionYP_106538 
Protein GI53715937 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGC AGTCCTACGA TTACGACTAC GTCGTGGTCG GCTCCGGCTT CGGCGGCAGC 
GTCTCGGCGC TGCGTCTGTC CGAGAAAGGC TATCGTGTGC TCGTGATCGA GCAGGGCCGT
CGCTGGACGC CCGAGAACCT GCCGGAAAGC ACGTGGAACC TGTCGCGCTG GCAATGGCGC
CCCGCGCTCG GGCTGCACGG CTTCTTCAGC ATGCGCTTTT TCAGGCACGT CGTCGTGCTG
CACGGCAATG CGGTGGGCGG CGGCTCGATC ACGTACGCGA ACACGCTGCT CGTGCCGCCC
AACAAGGTCT GGCGCGAGGG CACATGGGCC GGCCTCGAGG ACTGGGAACG CGTGATGCCC
GCGCACTACG CCACCGCGAA GCGCATGCTC GGCGTCGTCA CGAACCGGCG AATGGATGCG
GCCGACTTCC GGCTGAAGGA CATGGCGAAG CTGATCGGCG TCGAGAAGAG CTTCTATCCG
ACCGAGGTCG GCGTGTTCTT CGGCGACGAC GCCGACGCGC CCGGCACGCG CTACGCCGAT
CCGTACTTCG GCGGCGCGGG CCCGGAGCGC ACGTCGTGCA TCGGCTGCGG CGGCTGCATG
GTCGGCTGCC GCCACGGCGC GAAGAACACG CTCGACCGCA ATTACCTGTA TCTCGCCGAG
CGCCTCGGCG CGCAGGTGCG CGAGCAGACG AAGGTCGTCG ACGTGCGCCC GCTCGACGCG
CGCGCCGACG GCGCGGCGGG CTACGCGGTC GAAGCGGTGT CGCTCGCGGC GGGCGCGCGC
GGCGCGAAAA GCCGCCTCAC GTGCCGCGGC GTCGTGTTCG CCGCATCCTC GCTCGGCACG
CAGGATCTGC TGATGCGCCT GAAGGAAAAG GGCTCGCTGC CCCGGCTATC GGACGCGCTC
GGCAAGCGCG TGCGCACGAA CGCCGAATCG CTGATCGGCG TGCGCTTTCC GAAATCGCGC
GTCGATCTGT CGAAGGGCGT GGCGATCGGC TCGGGCATCT ACATCGACGA GCACACGCAC
ATCGAGGCCA CCCGCTATCC TTCGGGCTCC GACACGATGG GGCTGCTCAC GACCGTGCTC
ACGCGCGGCG CGCCGGGCGG TTTGCGTGTG CTCGTGTGGC TCGGCGCGCT CGCGAAGCTC
GTTCTCACGC GACCGCTGAG CGCGTGGCGG ATGATCGACC CGCGCGGCTT CGCGCGCGAG
ACGATGATCT TCCTCTGCAT GCAGACGCTC GAAGGACACC TGACGATGCG CCTGAAGCGC
CGCTGGTTCT GGCCGTTCTC GAAGCAGCTC GCGACCTCCG GCGCGAAGAT CCCCGCCTAC
ATTCCGGCCG CGAACGACTT CGCGCAGAAG GCCGCGCGCG CGCTCGGCGG CGTGCCGATG
ACCTCGCTCA CCGAGATCCT GCTGAACGTG CCGATGACCG CGCATTGCAT GGGCGGCGCG
GCGATGGCGC GCGACGCGCG CGACGGCGTG TGCGACGGCC GCAGCCGCGT GTTCGGCTAT
CGGAACATGT ACGTCTGCGA CGGCTCGGTG CTCGGCGCGA ACCTCGGCGT CAACCCGAGC
CTCACGATCA CGGCGCTCGC CGAGCATGCG ATGAGCCACG TGCCCGCCGC GCGCGAGCAG
CGGTGGGACA GTACCGCGGA GACGCCTGTC GCGGCATGA
 
Protein sequence
MKQQSYDYDY VVVGSGFGGS VSALRLSEKG YRVLVIEQGR RWTPENLPES TWNLSRWQWR 
PALGLHGFFS MRFFRHVVVL HGNAVGGGSI TYANTLLVPP NKVWREGTWA GLEDWERVMP
AHYATAKRML GVVTNRRMDA ADFRLKDMAK LIGVEKSFYP TEVGVFFGDD ADAPGTRYAD
PYFGGAGPER TSCIGCGGCM VGCRHGAKNT LDRNYLYLAE RLGAQVREQT KVVDVRPLDA
RADGAAGYAV EAVSLAAGAR GAKSRLTCRG VVFAASSLGT QDLLMRLKEK GSLPRLSDAL
GKRVRTNAES LIGVRFPKSR VDLSKGVAIG SGIYIDEHTH IEATRYPSGS DTMGLLTTVL
TRGAPGGLRV LVWLGALAKL VLTRPLSAWR MIDPRGFARE TMIFLCMQTL EGHLTMRLKR
RWFWPFSKQL ATSGAKIPAY IPAANDFAQK AARALGGVPM TSLTEILLNV PMTAHCMGGA
AMARDARDGV CDGRSRVFGY RNMYVCDGSV LGANLGVNPS LTITALAEHA MSHVPAAREQ
RWDSTAETPV AA