Gene BMA10229_0582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_0582 
Symbol 
ID4788573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp620338 
End bp622800 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content70% 
IMG OID 
Producthaemagluttinin family protein 
Protein accessionYP_001024398 
Protein GI124382777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAGGCC AATTGATTGC GGTTTCTGAA TTTTCCCGGT CGAATGGCAA GTGTTCGACG 
ACGCAGGTCG TCACGGCGGC GCCGGGCGTT GCCGGTCGTA CCGCGGCTTC TGGCCGATCG
CGCCCGTCGT GGACGAAGCT CGGGCTGATG TCGCTGGCGG TGAGCGCGGC GATGGGCTGC
ATGGCGACCG ACGCCGCGGC GCAGGTCAGC TATGCGGCGG GCGAGAACGC CTATGCCGGC
CCCGGCGGCA ATACCGGCCC GTGGGCGTTC TACAACCCGG CCTTCAGCGC GGGCACGCTG
CTGTACGGCA CCGCGGTCGG CAACTACGCC TATGCGAACG GCGAGGGCAG CTCGGCCTAC
GGCGATCACG CGACGGTGAA GGGGCGCATC GGCTCCGCGT TCGGCGCGTA TTCGGAAGCG
GCGGGCGACG GCAGCACTGC AATCGGCGCC AGCGCGCGGG CGCTGCCGGA CTTCAGCATC
GCGATCGGCA CGAACGCGCA GGCGCTGAAG GACACGGGCC AATCGATTCC CGGCCGCGAA
GACATCGGCA CGATCGCGAT CGGCGCGGGC GCGCTCGCGC AGGGCGACAA CAGCGATCCG
CTGCACGTGT CCGCGCCGAA CGCGTTCGGC GGCTATTCGA GCGCGACGGC AAGCGGCGCG
GTGGCGCTGG GCGAAGGCGC CGCATCGTCC GGCTATTACG CGAACGCGCT CGGCTCGTAT
TCGAAGGCGT CGGGTGCGGG TGCGGTCGCG GTGGGCGGCG GCGCGCAAGC GAGCGCGCAA
GGCGCGGTGG CGATCGGCGG CGCGACGAGC GTCGACAACG CAACCGCGCT GTCCGGCTAC
GCGAGCGCAA GCGGCGTCAA CGCGATCGCG ATCGGTTCCG GCGCGCAGGC GACGGGCGCC
CGGTCGATCA GCATCGGCAC GGGCAACGTC GTGTCGGGGG CGAGCTCGGG CGCCTTCGGC
GATCCGTCGA CGGTCACGGG CACGGGCTCG TATTCGTTCG GCAACAACAA CACGATCAAT
TCGAACAACG CGTTCGTGCT CGGCAACAAC GTGACGATCG GCCCAGGGTT CGACGGCTCG
GTCGCGCTCG GTAGCGGCAC GACGCTCGCC GCGGCGAACC CCACCGGCAG CGCGACGATC
ACGACGAGCT CGGGCGGCCA GTTGACGCTG TCCGGCTTCG CCGGCGCGAA TCCGACGAGC
GTCGTCAGCG TCGGCGCGCC CGGCGCCGAG CGCCAGATCA CGAACGTCGC GGCGGGGCGC
ATCACGCCGA CGTCGACGGA TGCCGTCAAC GGCAGCCAGC TGTATGCGGT CGCGAGCACG
ATCGACAATG CGGTGAACGG CGGCGGGATC AAGTACTTCC ACGCGAATTC GACCCTGGCC
GATTCGACGG CGGCGGGCAC GGACAGCGTC GCGGTCGGGC CGGCCGCGCT CGCCTACGGC
AACGATTCGA TCGCCGAAGG CACGAACGCG ACGGCGGGCG TGAGCGGCAA TCCGGCGGTG
GCGGGCGATG TCGCGCTCGG CAGCGGCGCG CAGGCGACGG GAGGCCGCTC GCTCGCGCTC
GGCGCGAACG CGTCGGTCAA CACGGCGGGC GGCGTGGCGC TCGGCGCCGG CTCGGTCGCG
AACCGCGCGG CCGGCACGTA CACCGATCCG ATCACGGGCA GCAGCTTCAC GACCGCATTC
GGCGCGGTGT CGGTCGGCCT CGAGGGTTCG CTGCGCCAGA TCACCAACGT CGCGGCGGGC
ACGCAGGCAA CGGATGCGGT AAACGTCGGT CAGTTGCAAG GCGCGATTGC GCAGTTGAAT
CAGACGATCC AGAACATCAC GAACGGCTCC AACTCGGGCA ACACCGGCAA TAACGGCAAC
AACACCGGGC AGACCGTGTC GGGCCAGTGG ATCACGGGCA ACCCGTCGAC CTATACGCCG
CCCGTGGCGA GCGGCATCGG CTCGACCGCC GCGGGCAGCG GCAGCGTGGC GTCCGGCGCG
AACAGCGTCG CGATCGGCGA CGGCGCGTCG GCCTCCGGCA ACAACTCGGT GGCGCTCGGC
GCCCATTCGG TCGCGAGCGC GCCGAACACG GTGTCGGTCG GCTCGGTCGG CAACGAGCGG
ACGATCTCGA ACGTCGCGCC GGGCGTGAAC GGCACCGATG CGGTGAACGT GAACCAGTTG
AACAGCGGTA TCGGCAATGC GGTCGGCCAG GCGAATCAGT ACACGGATCA GAAGGTCGAC
CATCTGCGGC GCGAGATGAA CGGCGGCGTG GCTGCGGCGA TGGCCGTGGC GGGCTTGCCG
CAGCCGACCG CGCCCGGCAA GAGCATGGTC GCGATCGCCG GCTCGACGTG GCAGGGGCAG
CAGGGCTTCG CGCTTGGCGT ATCGACGATT TCCGAGAACG GCAAGTGGCT GTACAAGGGC
TCGCTCACGA CCAGCACGCG CGGCGGCACG GGCGCGGTGC TCGGGGCCGG TTATCAGTGG
TGA
 
Protein sequence
MRGQLIAVSE FSRSNGKCST TQVVTAAPGV AGRTAASGRS RPSWTKLGLM SLAVSAAMGC 
MATDAAAQVS YAAGENAYAG PGGNTGPWAF YNPAFSAGTL LYGTAVGNYA YANGEGSSAY
GDHATVKGRI GSAFGAYSEA AGDGSTAIGA SARALPDFSI AIGTNAQALK DTGQSIPGRE
DIGTIAIGAG ALAQGDNSDP LHVSAPNAFG GYSSATASGA VALGEGAASS GYYANALGSY
SKASGAGAVA VGGGAQASAQ GAVAIGGATS VDNATALSGY ASASGVNAIA IGSGAQATGA
RSISIGTGNV VSGASSGAFG DPSTVTGTGS YSFGNNNTIN SNNAFVLGNN VTIGPGFDGS
VALGSGTTLA AANPTGSATI TTSSGGQLTL SGFAGANPTS VVSVGAPGAE RQITNVAAGR
ITPTSTDAVN GSQLYAVAST IDNAVNGGGI KYFHANSTLA DSTAAGTDSV AVGPAALAYG
NDSIAEGTNA TAGVSGNPAV AGDVALGSGA QATGGRSLAL GANASVNTAG GVALGAGSVA
NRAAGTYTDP ITGSSFTTAF GAVSVGLEGS LRQITNVAAG TQATDAVNVG QLQGAIAQLN
QTIQNITNGS NSGNTGNNGN NTGQTVSGQW ITGNPSTYTP PVASGIGSTA AGSGSVASGA
NSVAIGDGAS ASGNNSVALG AHSVASAPNT VSVGSVGNER TISNVAPGVN GTDAVNVNQL
NSGIGNAVGQ ANQYTDQKVD HLRREMNGGV AAAMAVAGLP QPTAPGKSMV AIAGSTWQGQ
QGFALGVSTI SENGKWLYKG SLTTSTRGGT GAVLGAGYQW