Gene BMAA0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA0749 
Symbol 
ID3085921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp768235 
End bp769356 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID637564659 
Producthemagglutinin domain-containing protein 
Protein accessionYP_105472 
Protein GI53717377 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTCTAT ACATCCGTAT GAAATATCAC CGTTTTCCCC GCTCTCATGC TCAACAAGAC 
ACCGGGCGAG CCGCATCGAC CGTTCCATTT CAGCGCTTCG CGCATCTACT ATGTTCGTCC
ATCGCTCCGC TGGCCCTCGG CTTTTCCACG GATGCGCTCG CTATCGGACA GGCTGAAAGT
ACGGCGTTTA ACGCGGTGAT CGATCAGATA AAAAAAGGTG ACTTTAAGTT GAAACCAGTT
GGGGACCGCA CGCTACCAAA CAAAGTCCCG CCACCGCCAC CGCCGCCACC GCCACCGCCA
CCGCCACCGC CACCGCCGCC GTCGCCACCG CCGCCGTCGC CACCGCCGCC GTCGCCACCG
CCGCCGTCGC CACCGCCGCC GTCGCCACCG CCGCCGACGA CGACGCCACC GACGACGACG
ACGCCGACAC CATCGATGCA CCCGATACAG CCGACACAAC TGCCGTCGAT TCCTAACGCG
ACACCAACCT CAGGATCCGC GACAAACGTC ACCATCAACT TCAATTCGAC CGGTGCCTCA
GCAATGGGCA CGAACTCTAT CGCCCTTGAC TTCCATGCAC GCGCTAAGGA CAGCGATTCG
CTCGCGAGCG GACGGCTCGC TCATGCGAGC GGCCCCCGGT CAACCGCGAT CGGTGCCGAA
GCAAATGCGT CCGGTCAAAA CACTGTCGCG CTCGGCGCTG GCTCCATAGC GGATCGTAAC
AACACGGTAT CCGTCGGTCG TCACGGTGAC GAACGACAAA TAGTGCACGT CGCAGCCGGC
ACGCAAGCCA CCGATGCCGT GAATGTCGGT CAGTTGAACC TCGCAATGTC GAACGCCAAC
GCGTACACGA ACCAGCGCAT CGGCGATCTT CAGCAGAGCA TCACCGACAC CGCGCGCGAC
GCGTATTCCG GCGTCGCCGC CGCGACCGCG CTGACGATGA TTCCCGATGT CGACCGCGAC
AAGAGGGTGT CGATCGGCGT CGGCGGCGCG GTCTACAAGG GCCATCGCGC CGTCGCGCTC
GGCGGCACCG CGCGCATCAA CGAAAACCTC AAGGTGCGGG CGGGCGTCGC GATGAGCGCG
GGCGGCAATG CCGTGGGCAT CGGCATGAGC TGGCAATGGT AA
 
Protein sequence
MVLYIRMKYH RFPRSHAQQD TGRAASTVPF QRFAHLLCSS IAPLALGFST DALAIGQAES 
TAFNAVIDQI KKGDFKLKPV GDRTLPNKVP PPPPPPPPPP PPPPPPPSPP PPSPPPPSPP
PPSPPPPSPP PPTTTPPTTT TPTPSMHPIQ PTQLPSIPNA TPTSGSATNV TINFNSTGAS
AMGTNSIALD FHARAKDSDS LASGRLAHAS GPRSTAIGAE ANASGQNTVA LGAGSIADRN
NTVSVGRHGD ERQIVHVAAG TQATDAVNVG QLNLAMSNAN AYTNQRIGDL QQSITDTARD
AYSGVAAATA LTMIPDVDRD KRVSIGVGGA VYKGHRAVAL GGTARINENL KVRAGVAMSA
GGNAVGIGMS WQW