Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1750 |
Symbol | |
ID | 3689716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1893703 |
End bp | 1896927 |
Gene Length | 3225 bp |
Protein Length | 1074 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637728206 |
Product | haemagluttinin family protein |
Protein accession | YP_333151 |
Protein GI | 76809734 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGCGC CGGAGACGGC GCGTCGCAAA GGGAAAGGAC ATTCGCTGAC GATCGTGTGC GCGATCGCCT CAGGCCTGCT GCTTGCGGCG CCTGCGTGGG CGGACACGGT GTCGCCCTCG GGCACGGATA ACGTCTACGG CGTCGACGCG ACCGATCCCG GCGTGTCGAC GAACCAGGGC AATACGGCCT ACGGGGCGCA GGCGGGCGCG AAGGTCACGG GTTCGTACAA CACCGCGATC GGGTATCAAG CAGGGCAGAA CGTGAACGCC ATCGATACCG TATCGATCGG CAAGCAGGCC ACCGCGAGCG CGAATGACGC GATCGCGATC GGCACGAACA CGAAGGCGAG CGGGCCGGCC GACATCTACA TGGGGCTGAA CGCAGGCGCC GGCGCCGGCT CGACGACGAG CCCGGACGGC ACCGTCACGC TCGGCATTCG CAACATGGGC CTCGGGGAAT CCGCGGGCTC GTACGTGACG GGCCAGAACA ACACGGGGAT CGGCTATCAG TCGGGCATGA ACGTGACGGG CGACCAGAAC GTCGGCCTCG GGCAGCAGGC GGGACAATTC GTGACCGGGA CCGGCAACTC GGCGATGGGG CATCTGGCGG GGTCGACGGT GTCGGGCAGC TACAACGCCG CGTTCGGCGA GTATGCGGGG ACCAACACGA GCGGCGGCGC CAATGCCGCG TTCGGCTTCT ATGCGGGGCG CTACATCAAC GGCACGAACA ACACGGCGCT CGGTGCGTAC GATCTGCCGG TCGTCAATGG CACCTGGTAC GGTTCGTACG TGACGGGCAG CAACAACCTC GGCGCCGGCC ATAATTCGGG CGCCTACGTG AGCGGCGCGA GCAACGTCGG GCTCGGCGAC GGCGCGGGCA CGTTCGTGAC CGGCAGCAAC AACGTCGCCA TCGGCACGGC AGCGGGCTCG GGCGCGTATA CCAGCGGTCC GAGCGGCGCG ACGCTCAATG CGGCGCTCGT CGCGAGCAAC ACCGTGAGCA TCGGTACCCG CGCCACGGCG AGCCAGAGCG ACGCGATCGC GATCGGCAAG GGCGCGACCG CGAGCGGCGC GCAATCGATC AGCATCGGCA CCGGCAACGT CGTGAGCGGC AAGGGAAGCG GTGCGATCGG CGATCCGAGC ACCGTCAGCG GCGCGGGGTC CTATTCGATC GGCAACAACA ATACCGTCGC GAACAGCAAC ACGTTCGTGC TCGGCAACGG CGTGACGACG ACGCAGGACA ACAGCGTCGT GCTCGGCAAT CAGAGCACCG ACCGCGCGGC CGTCGCGGTT TCGAGCGAAA CCATCAATGG CACGACGTAC AACTACGCGG GCGTCGCGAG CCCGGCCAAC GGCGTCGTCA GCATCGGCGG CGTGGGCACG GAGCGCCAGC TCATCAACGT GGCGGCGGGC CAGGTGAGCG CGACCAGCAC GGACGCGATC AACGGCAGCC AGCTGTACGC GACGAACCAG GCGGTGATCG CCGAGGACGC GAAAGTGAAT TCGCTCGGCG GCGGCGTGGC GAGCGCGCTC GGCGGCAACG CGGCGTACAA CGCGACGACC GGCGCGATCA CCGCGCCGAG CTACGCGGTC TACGGGACCA CGCAAAACTC CGTGGGCGGC GCGATCGATG CGCTGCAGGC CCTCGCGCCG CTGCAGTACA CGTCCGGCCC GGGCGTGACC ACGCCGAACG CGCCGGGATC GGCGCCGACG AACACGGTGA CGCTCGTCGG CGCCGGCGGG CCGGGAGCCA ATACCACGCC GGTGACGCTC ACGAACGTCG CGCCGGGCAA ACTCTCCGCG ACCAGCACGG ACGCGGTCAA CGGCTCGCAG CTCTACGCGA CCAACCAGCA GGTCGCGAAC CTCGTGAGCT CGGTGAACAA CGGCGGCGTC GGCCCGGTGC AGTACAGCGA TCCTAGCGCG CCGACGACGC CCAACGGCGG CAAGCCCTCG CAGGACCTGA CGCTCGTCGG CGCGGCAAGC GGCCCTGTCG CGCTGCATAA CGTCGCGCCG GGCACGGCGT CCACCGATGC GGTCAACGTC GGGCAGCTCG GCGCGGTGAC GACCGGCCTG GGCGGCGGCG CGGCGATCGA TCCGAAGACG GGCGCCGTGA CCGCGCCGTC GTACACGGTC TACAACGCCG ACGGCACGAC GTCGAACGTC GGCAACGTCG GCGCGGCGAT CGATGCGATC AACTCGACCG GCATCAAGTA TTTCCACGCG AACAGCACGA AGCCGGACAG CCAGGCGCTC GGCGCGGACA GCGTCGCGAT CGGCCCGAAC GCCGTCGCGA ACAACGCGGG CGACGTCGCG CTCGGTTCGG GAGCGGTCAC GTCGCAAGCG GGCGGCACGC TGAGCGAAAC GATCAACGGC GTGACCTACT CGTTCGCCGG CACGACGCCG ATCGGCACGG TGAGCGTCGG CGCGCCGGGC GTCGAGCGCA CGATCACCAA CGTTGCCGCG GGGCGCATCG GGCAGTCGAG CACGGACGCG ATCAACGGCT CGCAACTGTA CGGCACCAAC CAGTCGATCG AGGCGTTGAC GGACAAGATG AACAGCCTCG GCAACACCGT GGCGAACACG CTCGGCAGCG GCGCGTCGTA CAACCCGCAA ACAGGCGCGG TGAACGGCCC GGCCAACTCG GGCGGCGTGG TCACGCCCAC GGTGATCCAG GAGGCGGCGA ACAAATGGGT GAGCGCCAAT CCGTCGACCT ACGTGGCGCC CGTCGCGACG GGCACGAACG GCATGGCGGT CGGCAGCGGC GCGGTTTCGA CGGGCCAGAA CTCGGTCGCG CTCGGCACGA ACGCGTCGGA CGGCGGCCGC TCGAACGTCG TGAGCGTCGG GGCGCCGGGC GCGGAGCGCC AGGTGACGAA CGTGGCGGCC GGCACGCAGG CGACCGATGC GGTCAACCTC GGGCAGATGA ACGGCGCGCT CGCGCAGCAA ACCGACAGCT TCAACCAGCG GCTGGGCGCG GTTCAGCAGG ACGTCGACAA CGTCGCGCGC GCCGCCTACG GCGGCATCGC GGCCGCGACC GCGCTCACGA TGATCCCCGA GGTCGACAAG GACAAGACGA TCGCGGTGGG CATCGGCGGC GGCACGTATC GCGGCTACCA GGCGGTGGCG CTCGGCGCGA CGGCGCGCAT CACCGAGAAC ATCAAGGTTC GTGCGGGCGT CGGCATGAGC TCGGGCGGGA CGACGGCCGG CATCGGCGCA TCGATGCAGT GGTAA
|
Protein sequence | MVAPETARRK GKGHSLTIVC AIASGLLLAA PAWADTVSPS GTDNVYGVDA TDPGVSTNQG NTAYGAQAGA KVTGSYNTAI GYQAGQNVNA IDTVSIGKQA TASANDAIAI GTNTKASGPA DIYMGLNAGA GAGSTTSPDG TVTLGIRNMG LGESAGSYVT GQNNTGIGYQ SGMNVTGDQN VGLGQQAGQF VTGTGNSAMG HLAGSTVSGS YNAAFGEYAG TNTSGGANAA FGFYAGRYIN GTNNTALGAY DLPVVNGTWY GSYVTGSNNL GAGHNSGAYV SGASNVGLGD GAGTFVTGSN NVAIGTAAGS GAYTSGPSGA TLNAALVASN TVSIGTRATA SQSDAIAIGK GATASGAQSI SIGTGNVVSG KGSGAIGDPS TVSGAGSYSI GNNNTVANSN TFVLGNGVTT TQDNSVVLGN QSTDRAAVAV SSETINGTTY NYAGVASPAN GVVSIGGVGT ERQLINVAAG QVSATSTDAI NGSQLYATNQ AVIAEDAKVN SLGGGVASAL GGNAAYNATT GAITAPSYAV YGTTQNSVGG AIDALQALAP LQYTSGPGVT TPNAPGSAPT NTVTLVGAGG PGANTTPVTL TNVAPGKLSA TSTDAVNGSQ LYATNQQVAN LVSSVNNGGV GPVQYSDPSA PTTPNGGKPS QDLTLVGAAS GPVALHNVAP GTASTDAVNV GQLGAVTTGL GGGAAIDPKT GAVTAPSYTV YNADGTTSNV GNVGAAIDAI NSTGIKYFHA NSTKPDSQAL GADSVAIGPN AVANNAGDVA LGSGAVTSQA GGTLSETING VTYSFAGTTP IGTVSVGAPG VERTITNVAA GRIGQSSTDA INGSQLYGTN QSIEALTDKM NSLGNTVANT LGSGASYNPQ TGAVNGPANS GGVVTPTVIQ EAANKWVSAN PSTYVAPVAT GTNGMAVGSG AVSTGQNSVA LGTNASDGGR SNVVSVGAPG AERQVTNVAA GTQATDAVNL GQMNGALAQQ TDSFNQRLGA VQQDVDNVAR AAYGGIAAAT ALTMIPEVDK DKTIAVGIGG GTYRGYQAVA LGATARITEN IKVRAGVGMS SGGTTAGIGA SMQW
|
| |