Gene BMA10247_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_0643 
Symbol 
ID4892434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009080 
Strand
Start bp622497 
End bp625865 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content69% 
IMG OID640149308 
Producthaemagluttinin family protein 
Protein accessionYP_001080210 
Protein GI126449882 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.696074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTTA GGAATGTTCA GATTCGGATT GTGCCTCGGC AAATAAATTT CAATTGCAGC 
ACAAAACTAT TCGAAGCTTA TTTGCGTGAG GATGAAATGA ACAAGACTTA TCGGGTTAGC
TGGAGCGCGT CGCGGGGTGC GTGGATGGTG GCGCCGGAGA CGGCGCGTCG CAAAGGGAAA
GGACATTCGC TGACGATCGT GTGCGCGATC GCCTCAGGCC TGCTGCTTGC GGCGCCTGCG
TGGGCGGACA CGGTGTCGCC GTCGGGCACG GATAACGTCT ACGGCGTCGA CGCGACCGAT
CCCGGCGTGT CGACGAACCA GGGCAATACG GCCTACGGGG CGCAGGCGGG CGCGAAGGTC
ACGGGTTCGT ACAACACCGC GATCGGGTAT CAAGCAGGGC AGAACGTGAA CGTCATCGAT
ACCGTATCGA TCGGCAAGCA GGCCACCGCG AGCGCGAATG ACGCGATCGC GATCGGCACG
AACACGAAGG CGAGCGGGCC GGCCGACATC TACATGGGGC TGAACGCAGG CGCCGGCGCC
GGCTCGACGA CGAGCCCGGA CGGCACCGTC ACGCTCGGCA TTCGCAACAT GGGCCTCGGG
GAATCCGCGG GCTCGTACGT GACGGGCCAG AACAACACGG GGATCGGCTA TCAGTCGGGC
ATGAACGTGA CGGGCGACCA GAACGTCGGC CTCGGGCAGC AGGCGGGACA ATTCGTGACC
GGGACCGGCA ACTCGGCGAT GGGGCATCTG GCGGGGTCGA CGGTGTCGGG CAGCTACAAC
GCCGCGTTCG GCGAGTATGC GGGGACCAAC ACGAGCGGCG GCGCCAATGC CGCGTTCGGC
TTCTATGCGG GGCGCTACAT CAACGGCACG AACAACACGG CGCTCGGCGC GTACGATCTG
CCGGTCGTCA ATGGCACCTG GTACGGTTCG TACGTGACGG GCAGCAACAA CCTCGGCGCC
GGCCATAATT CGGGCGCCTA CGTGAGCGGC GCGAGCAACG TCGGGCTCGG CGACGGCGCG
GGCACGTTCG TGACCGGCAG CAACAACGTC GCCATCGGCA CGGCAGCGGG CTCGGGCGCG
TATACCAGCG GTCCGAGCGG CGCGACGCTC AACGCGGCGC TCGTCGCGAG CAACACCGTG
AGCATCGGTA CCCGCGCCAC GGCGAGCCAG AGCGACGCGA TCGCGATCGG CAAGGGCGCG
ACCGCGAGCG GCGCGCAATC GATCAGCATC GGCACCGGCA ACGTCGTGAG CGGCAAGGGA
AGCGGCGCGA TCGGCGATCC GAGCACCGTC AGCGGCGCGG GGTCCTATTC GATCGGCAAC
AACAATACCG TCGCGAACAG CAACACGTTC GTGCTCGGCA ACGGCGTGAC GACGACGCAG
GACAACAGCG TCGTGCTCGG CAATCAGAGC ACCGACCGCG CGGCCGTCGC GGTTTCGAGC
GAAACCATCA ATGGCACGAC GTACAACTAC GCGGGCGTCG CGAGCCCGGC CAACGGCGTC
GTCAGCATCG GCGGCGTGGG CACGGAACGC CAGCTCATCA ACGTGGCGGC GGGCCAGGTG
AGCGCGACCA GCACGGACGC GATCAACGGC AGCCAGCTGT ACGCGACGAA CCAGGCGGTG
ATCGCCGAGG ACGCGAAAGT GAATTCGCTC GGCGGCGGCG TGGCGAGCGC GCTCGGCGGC
AACGCGGCGT ACAACGCGAC GACCGGCGCG ATCACCGCGC CGAGCTACGC GGTCTACGGG
ACCACGCAAA ACTCCGTGGG CGGCGCGATC GATGCGCTGC AGGCCCTCGC GCCGCTGCAG
TACACGTCCG GCCCGGGCGT GACCACGCCG AACGCGCCGG GATCGGCGCC GACGAACACG
GTGACGCTCG TCGGCGCCGG CGGGCCGGGA GCCAACACCA CGCCGGTGAC GCTCACGAAC
GTCGCGCCGG GCAAACTCTC CGCGACCAGC ACGGACGCGG TCAACGGCTC GCAGCTCTAC
GCGACCAACC AGCAGGTCGC GAACCTCGTG AGCTCGGTGA ACAACGGCGG CGTCGGCCCG
GTGCAGTACA GCGATCCTAG CGCGCCGACG ACGCCCAACG GCGGCAAGCC CTCGCAGGAC
CTGACGCTCG TCGGCGCGGC AAGCGGCCCT GTCGCGCTGC ATAACGTCGC GCCGGGCACG
GCGTCCACCG ATGCGGTCAA CGTCGGGCAG CTCGGCGCGG TGACGACCGG CCTGGGCGGC
GGCGCGGCGA TCGATCCGAA GACGGGCGCC GTGACCGCGC CGTCGTACAC GGTCTACAAC
GCCGACGGCA CGACGTCGAA CGTCAGCAAC GTCGGCGCGG CGATCGATGC GATCAACTCG
ACCGGCATCA AGTATTTCCA CGCGAACAGC ACGAAGCCGG ACAGCCAGGC GCTCGGCGCG
GACAGCGTCG CGATCGGCCC GAACGCCGTC GCGAACAACG CGGGCGACGT CGCGCTCGGT
TCGGGAGCGG TCACGTCGCA AGCGGGCGGC ACGCTGAGCG AAACGATCAA CGGCGTGACC
TACTCGTTCG CCGGCACGAC GCCGATCGGC ACGGTGAGCG TCGGCGCGCC GGGCGTCGAG
CGCACGATCA CCAACGTTGC CGCGGGGCGC ATCGGGCAGT CGAGCACGGA CGCGATCAAC
GGCTCGCAAC TGTACGGCAC CAACCAGTCG ATCGAGGCGT TGACGGACAA GATGAACAGC
CTCGGCAACA CCGTGGCGAA CACGCTCGGC AGCGGCGCGT CGTACAACCC GCAAACAGGC
GCGGTGAACG GCCCGGCCAA CTCGGGCGGC GTGGTCACGC CCACGGTGAT CCAGGAGGCG
GCGAACAAAT GGGTGAGCGC CAATCCGTCG ACCTACGTGG CGCCCGTCGC GACGGGCACG
AACGGCATGG CGGTCGGCAG CGGCGCGGTT TCGACGGGCC AGAACTCGGT CGCGCTCGGC
ACGAACGCGT CGGACGGCGG CCGCTCGAAC GTCGTGAGCG TCGGGGCGCC GGGCGCGGAG
CGCCAGGTGA CGAACGTGGC GGCCGGCACG CAGGCGACCG ATGCGGTCAA CCTCGGGCAG
ATGAACGGCG CGCTCGCGCA GCAAACCGAC AGCTTCAATC AGCGGCTGGG CGCGGTTCAG
CAGGACGTCG ACAACGTCGC GCGCGCCGCC TACGGCGGCA TCGCGGCCGC GACCGCGCTC
ACGATGATCC CCGAGGTCGA CAAGGACAAG ACGATCGCGG TGGGCATCGG CGGCGGCACG
TATCGCGGCT ACCAGGCGGT GGCGCTCGGC GCGACGGCGC GCATCACCGA GAACATCAAG
GTTCGTGCGG GCGTCGGCAT GAGCTCGGGC GGGACGACGG CCGGCATCGG CGCATCGATG
CAGTGGTAA
 
Protein sequence
MRFRNVQIRI VPRQINFNCS TKLFEAYLRE DEMNKTYRVS WSASRGAWMV APETARRKGK 
GHSLTIVCAI ASGLLLAAPA WADTVSPSGT DNVYGVDATD PGVSTNQGNT AYGAQAGAKV
TGSYNTAIGY QAGQNVNVID TVSIGKQATA SANDAIAIGT NTKASGPADI YMGLNAGAGA
GSTTSPDGTV TLGIRNMGLG ESAGSYVTGQ NNTGIGYQSG MNVTGDQNVG LGQQAGQFVT
GTGNSAMGHL AGSTVSGSYN AAFGEYAGTN TSGGANAAFG FYAGRYINGT NNTALGAYDL
PVVNGTWYGS YVTGSNNLGA GHNSGAYVSG ASNVGLGDGA GTFVTGSNNV AIGTAAGSGA
YTSGPSGATL NAALVASNTV SIGTRATASQ SDAIAIGKGA TASGAQSISI GTGNVVSGKG
SGAIGDPSTV SGAGSYSIGN NNTVANSNTF VLGNGVTTTQ DNSVVLGNQS TDRAAVAVSS
ETINGTTYNY AGVASPANGV VSIGGVGTER QLINVAAGQV SATSTDAING SQLYATNQAV
IAEDAKVNSL GGGVASALGG NAAYNATTGA ITAPSYAVYG TTQNSVGGAI DALQALAPLQ
YTSGPGVTTP NAPGSAPTNT VTLVGAGGPG ANTTPVTLTN VAPGKLSATS TDAVNGSQLY
ATNQQVANLV SSVNNGGVGP VQYSDPSAPT TPNGGKPSQD LTLVGAASGP VALHNVAPGT
ASTDAVNVGQ LGAVTTGLGG GAAIDPKTGA VTAPSYTVYN ADGTTSNVSN VGAAIDAINS
TGIKYFHANS TKPDSQALGA DSVAIGPNAV ANNAGDVALG SGAVTSQAGG TLSETINGVT
YSFAGTTPIG TVSVGAPGVE RTITNVAAGR IGQSSTDAIN GSQLYGTNQS IEALTDKMNS
LGNTVANTLG SGASYNPQTG AVNGPANSGG VVTPTVIQEA ANKWVSANPS TYVAPVATGT
NGMAVGSGAV STGQNSVALG TNASDGGRSN VVSVGAPGAE RQVTNVAAGT QATDAVNLGQ
MNGALAQQTD SFNQRLGAVQ QDVDNVARAA YGGIAAATAL TMIPEVDKDK TIAVGIGGGT
YRGYQAVALG ATARITENIK VRAGVGMSSG GTTAGIGASM QW