Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_0582 |
Symbol | |
ID | 4788573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | - |
Start bp | 620338 |
End bp | 622800 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | haemagluttinin family protein |
Protein accession | YP_001024398 |
Protein GI | 124382777 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAGGCC AATTGATTGC GGTTTCTGAA TTTTCCCGGT CGAATGGCAA GTGTTCGACG ACGCAGGTCG TCACGGCGGC GCCGGGCGTT GCCGGTCGTA CCGCGGCTTC TGGCCGATCG CGCCCGTCGT GGACGAAGCT CGGGCTGATG TCGCTGGCGG TGAGCGCGGC GATGGGCTGC ATGGCGACCG ACGCCGCGGC GCAGGTCAGC TATGCGGCGG GCGAGAACGC CTATGCCGGC CCCGGCGGCA ATACCGGCCC GTGGGCGTTC TACAACCCGG CCTTCAGCGC GGGCACGCTG CTGTACGGCA CCGCGGTCGG CAACTACGCC TATGCGAACG GCGAGGGCAG CTCGGCCTAC GGCGATCACG CGACGGTGAA GGGGCGCATC GGCTCCGCGT TCGGCGCGTA TTCGGAAGCG GCGGGCGACG GCAGCACTGC AATCGGCGCC AGCGCGCGGG CGCTGCCGGA CTTCAGCATC GCGATCGGCA CGAACGCGCA GGCGCTGAAG GACACGGGCC AATCGATTCC CGGCCGCGAA GACATCGGCA CGATCGCGAT CGGCGCGGGC GCGCTCGCGC AGGGCGACAA CAGCGATCCG CTGCACGTGT CCGCGCCGAA CGCGTTCGGC GGCTATTCGA GCGCGACGGC AAGCGGCGCG GTGGCGCTGG GCGAAGGCGC CGCATCGTCC GGCTATTACG CGAACGCGCT CGGCTCGTAT TCGAAGGCGT CGGGTGCGGG TGCGGTCGCG GTGGGCGGCG GCGCGCAAGC GAGCGCGCAA GGCGCGGTGG CGATCGGCGG CGCGACGAGC GTCGACAACG CAACCGCGCT GTCCGGCTAC GCGAGCGCAA GCGGCGTCAA CGCGATCGCG ATCGGTTCCG GCGCGCAGGC GACGGGCGCC CGGTCGATCA GCATCGGCAC GGGCAACGTC GTGTCGGGGG CGAGCTCGGG CGCCTTCGGC GATCCGTCGA CGGTCACGGG CACGGGCTCG TATTCGTTCG GCAACAACAA CACGATCAAT TCGAACAACG CGTTCGTGCT CGGCAACAAC GTGACGATCG GCCCAGGGTT CGACGGCTCG GTCGCGCTCG GTAGCGGCAC GACGCTCGCC GCGGCGAACC CCACCGGCAG CGCGACGATC ACGACGAGCT CGGGCGGCCA GTTGACGCTG TCCGGCTTCG CCGGCGCGAA TCCGACGAGC GTCGTCAGCG TCGGCGCGCC CGGCGCCGAG CGCCAGATCA CGAACGTCGC GGCGGGGCGC ATCACGCCGA CGTCGACGGA TGCCGTCAAC GGCAGCCAGC TGTATGCGGT CGCGAGCACG ATCGACAATG CGGTGAACGG CGGCGGGATC AAGTACTTCC ACGCGAATTC GACCCTGGCC GATTCGACGG CGGCGGGCAC GGACAGCGTC GCGGTCGGGC CGGCCGCGCT CGCCTACGGC AACGATTCGA TCGCCGAAGG CACGAACGCG ACGGCGGGCG TGAGCGGCAA TCCGGCGGTG GCGGGCGATG TCGCGCTCGG CAGCGGCGCG CAGGCGACGG GAGGCCGCTC GCTCGCGCTC GGCGCGAACG CGTCGGTCAA CACGGCGGGC GGCGTGGCGC TCGGCGCCGG CTCGGTCGCG AACCGCGCGG CCGGCACGTA CACCGATCCG ATCACGGGCA GCAGCTTCAC GACCGCATTC GGCGCGGTGT CGGTCGGCCT CGAGGGTTCG CTGCGCCAGA TCACCAACGT CGCGGCGGGC ACGCAGGCAA CGGATGCGGT AAACGTCGGT CAGTTGCAAG GCGCGATTGC GCAGTTGAAT CAGACGATCC AGAACATCAC GAACGGCTCC AACTCGGGCA ACACCGGCAA TAACGGCAAC AACACCGGGC AGACCGTGTC GGGCCAGTGG ATCACGGGCA ACCCGTCGAC CTATACGCCG CCCGTGGCGA GCGGCATCGG CTCGACCGCC GCGGGCAGCG GCAGCGTGGC GTCCGGCGCG AACAGCGTCG CGATCGGCGA CGGCGCGTCG GCCTCCGGCA ACAACTCGGT GGCGCTCGGC GCCCATTCGG TCGCGAGCGC GCCGAACACG GTGTCGGTCG GCTCGGTCGG CAACGAGCGG ACGATCTCGA ACGTCGCGCC GGGCGTGAAC GGCACCGATG CGGTGAACGT GAACCAGTTG AACAGCGGTA TCGGCAATGC GGTCGGCCAG GCGAATCAGT ACACGGATCA GAAGGTCGAC CATCTGCGGC GCGAGATGAA CGGCGGCGTG GCTGCGGCGA TGGCCGTGGC GGGCTTGCCG CAGCCGACCG CGCCCGGCAA GAGCATGGTC GCGATCGCCG GCTCGACGTG GCAGGGGCAG CAGGGCTTCG CGCTTGGCGT ATCGACGATT TCCGAGAACG GCAAGTGGCT GTACAAGGGC TCGCTCACGA CCAGCACGCG CGGCGGCACG GGCGCGGTGC TCGGGGCCGG TTATCAGTGG TGA
|
Protein sequence | MRGQLIAVSE FSRSNGKCST TQVVTAAPGV AGRTAASGRS RPSWTKLGLM SLAVSAAMGC MATDAAAQVS YAAGENAYAG PGGNTGPWAF YNPAFSAGTL LYGTAVGNYA YANGEGSSAY GDHATVKGRI GSAFGAYSEA AGDGSTAIGA SARALPDFSI AIGTNAQALK DTGQSIPGRE DIGTIAIGAG ALAQGDNSDP LHVSAPNAFG GYSSATASGA VALGEGAASS GYYANALGSY SKASGAGAVA VGGGAQASAQ GAVAIGGATS VDNATALSGY ASASGVNAIA IGSGAQATGA RSISIGTGNV VSGASSGAFG DPSTVTGTGS YSFGNNNTIN SNNAFVLGNN VTIGPGFDGS VALGSGTTLA AANPTGSATI TTSSGGQLTL SGFAGANPTS VVSVGAPGAE RQITNVAAGR ITPTSTDAVN GSQLYAVAST IDNAVNGGGI KYFHANSTLA DSTAAGTDSV AVGPAALAYG NDSIAEGTNA TAGVSGNPAV AGDVALGSGA QATGGRSLAL GANASVNTAG GVALGAGSVA NRAAGTYTDP ITGSSFTTAF GAVSVGLEGS LRQITNVAAG TQATDAVNVG QLQGAIAQLN QTIQNITNGS NSGNTGNNGN NTGQTVSGQW ITGNPSTYTP PVASGIGSTA AGSGSVASGA NSVAIGDGAS ASGNNSVALG AHSVASAPNT VSVGSVGNER TISNVAPGVN GTDAVNVNQL NSGIGNAVGQ ANQYTDQKVD HLRREMNGGV AAAMAVAGLP QPTAPGKSMV AIAGSTWQGQ QGFALGVSTI SENGKWLYKG SLTTSTRGGT GAVLGAGYQW
|
| |