Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1090 |
Symbol | |
ID | 4905708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1045325 |
End bp | 1050334 |
Gene Length | 5010 bp |
Protein Length | 1669 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640144196 |
Product | haemagluttinin motif-containing protein |
Protein accession | YP_001075125 |
Protein GI | 126456236 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5099] RNA-binding protein of the Puf family, translational repressor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAGA TTTATCGCAA GGTTTGGAAC AAGGCGCGCG GCCAACTGGT CGTCGCGTCG GAACTGGCAT CCAGCCGTTC CAGTGTGGGA GAAGCTTCGG TCGACGCGGG GCGGTCTGGA GACCAGACAG GCTCGGCGGC ATTCACCAGC GAGGAGCGCA AACCTGGCTC AGGCCGGATG ATTCCGCTTG CAATAGCCGT GGCTCTGATG TTTTCACCCT ACGCATGGGC GGGTGTCGGC GGAGCCGACA ACGGCGTGAC GGGGACGAGC AACAACGGCG GGGTAGGCGG CTCGTCCGGC GGCGGCGGTG TCCAGTTCAG CGACATGGGC GTGGCCTTTG TCGGCGATGG CGACTGCTCG ATGCTCACGT CCGGGCCGGG ATCGTATGCC GGCGTATACG GTTCGGGGAG CAATTATCTG GGCGGCCTGT TCGGCTTCGG CGCACAGACG TCGGCCGTCG GCTGGGGGAC GCCCAGCAAC GCCGGCGCCA ACAGCGGTAT CGTCCCATAC CAGGGCGCTG CTCAAACCTT CGGCAACGTC ACCTATGCCG GCAACGGCAC GCAGAGCGGC AACTTCACGC AGGCGTTCGG CCTGAATTCC TTTGCGGTCG GCTGCGGCGC TCACGCGACC GGCCTGAGCG CGACGGCGAT CGGCTGGGGA ACCACCGCGA GCGGCGCCGG AAGCGTCGCG CTCGGGCTGT ACAGCACCGC GAGCGGCCAG GGATCGTTGG CGTTCGGCAC CAGCGCGACG GCGACGGCCA CCGACACCAT CGCGCTCGGC ACGCTGGCCA CGGCCAACGC GGTCAGCGGC GTGGCGATCG GCGCGAACAC GCAGGCTTCG GCCGCCAATG CAACCGCGAT CGGCGGCAAT TCATCCGGCG CCAACCTCGG CGCGCAGGCG ACGGCGGCCG GCGCCACTGC GATCGGCGGC AACGCGACGG CCGGGGCCGC CGCGACGGCG ACGAACGCCA TCGCGATCGG CGGGCAGTCG TCGGCGAAGG ATGCGAACGA TGTGGCCGTG GGCCTGGGCG CGAGGGCCGG CACGGGAAGC GGCGCGGGCA ACGATCTCGC GATCGGCAAT GGCGCGACGG CCACGGGCGG CAATTCGATC GCCCAGGGCG CGGGCGCGAG CGCCAATGCG GCCGGCGCGG TGGCCATCGG CAAATCGGCG TCCGCCGCCG GCGGGCAAGC CGTTTCGATC GGCGTGGCCA ATACCGCGTC GGGCAACGGC GCGGTGGCGA TCGGCGATCC GAACGTCGCG ACCGGAACCG GCGCTGTCGC GCTGGGCAAC AACAATACGG CCAATGGTCA AGGCGCGGTG GCGCTGGGCA ACGTCAGCAC GGCGGTCGGC CAAGGCAGCG TGGCGCTCGG CAACAGCAGC AATGCGGCCG CGGCGGGCGG AGTGGCCTTG GGCGATACCG CGAGCGCGGT GATGGCGGGC GGCTTGGCAC TCGGCTCGCT CGCGACGGCG AGCAATGCGA ACGACGTGGC GCTCGGCGCG GGTTCGAAAA CCGCCGCGGC AGTCGCGACG TCCACGGTTT CGGTGAACGG CGCCAACTAC GCGGTGGCGG GAAGCGGCCC GGCCAGCACG GTCAGCGTGG GTGCGCCGGG CAGCGAGCGC ACGATCACCA ATGTGGCCGC GGGTCAAGTA AGCGCCGGTT CCACCGATGC GGTGAATGGT TCGGAACTGT ACGCGACGAA CCAGGCAATC ACGACCGGAT TGTCGACAGC GAACAGCAGC ATCGCGTCGC TGTCCACGTC GACGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC TCGTTGTCGA CGGGTTTGTC GACCGCCAAC AGCACGGTTG CGTCGTTGTC GAGTTCCACG TCGACGGGAT TGTCCTCAGC CAATAGCGCG GTGGCGTCGC TGTCCACGTC GGCGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC TCGTTGTCGA CGGGCTTGTC GACCACGAAC AGCACGGTTG CGTCGTTGTC GACGTCCACG GTTGCCGGCC TGAATTCGCT GTCCACCGGA TTGAGCACGA CCAATAGCAA TGTCGCGTCG TTGTCGAGTT CCACGTCGAC GGGGCTGTCC TCGGCCAATA GCGCGGTGGC GTCACTGTCC ACGTCGACGT CGACGGGTCT TTCGAGCGCC AACAGCAACA TCGGTTCGTT GTCGACGGGC TTGTCGACCG CCAACAGCAC GGTTGCGTCG TTGTCGACGT CCACGGTTGC CGGCCTGAAT TCGCTGTCCA CCGGATTGAG CACGACCAAT AGCAATGTCG CGTCGTTGTC GAGTTCCACG TCGACGGGGC TGTCCTCGGC CAATAGCGCG GTGGTGTCGC TGTCCACGTC GGCGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC TCGTTGTCGA CGGGTTTGTC GACCACGAAC AGCACGGTTG CGTCGTTGTC GAGCTCGACG TCGACCGGTA TCGGTTCGTT GTCGACGGGG GTGGCCAATT CTGTCCAGTA TGACAGTCCT GCTCATACGT CCATTACCCT GGGTGGCGCC AGTGCAACGT CACCCGTGAA GATCACTAAT TTGGCGGCGG GCGCGAACCC GAGCGATGCC GTCAACTATG AGCAACTGAC ATCGCTGTCG ACGTCGGCGT CGACGGGACT GTCGTCGGCC AACAGCGCGA TCACGTCGCT ATCCACCTCG ACGTCGACCG GCATCGGCTC GCTGTCCACC GGACTGAGCA CGACCAACAG CAACGTCGCG TCGTTGTCGA CGTCGGCGTC GACGGGACTG TCGTCGGCCA ACAGCGCGAT CACGTCGCTA TCCACCTCGA CGTCGACCGG CATCGGCTCG CTGTCCACCG GGTTGAGCAC GACCAACAGC AACGTCGCGT CGTTGTCGAC GTCGGCGTCG ACGGGACTGT CGTCGGCCAA CAGCGCGATC ACGTCGCTGT CCACCTCGAC GTCGACCGGC ATCGGTTCGC TGTCCACCGG GTTGAGCACG ACCAACAGCA ACGTCGCATC GTTGTCGACG TCGGCGTCGA CGGGACTGTC GTCGGCCAAC AGCGCGATCA CGTCGCTGTC CACCTCGACG TCGACCGGCA TCGGTTCGCT GTCCACCGGA CTGAGCACGA CCAACAGTAA CGTCGCATCG TTGTCGACGT CGGCGTCGAC GGGATTATCC TCGGCCAACA GCGCGATCAC GTCGCTGTCC ACCTCGACGT CGACCGGCAT CGGCTCGTTG TCCACCGGGC TGAGCACGAC CAACAGCAAC CTGAGCTCCC TGTCCACGTC GAGCTCGACC GGCCTGAGTA CGGCCAACAG CAACATCTCG TCGCTGTCCA CCGGGCTGAA TTCGTTGTCG ACCGCGGTCA ACGGCGGCGG GACGAAGTAC TTCCACGCCA ACTCGACGCA GCCGGACAGT CAGGCGCTGG GGGCGGATTC CGTCGCGGTC GGGCCCGCGG CCATCGCGGC GGGCGCAAGC GGCATTGCGA TCGGCAATGC GGCGAACGCG GCCGCCAACG GCGCCGTCGC GATCGGCCAG GCCGCCGTCG CGAAGGGCGG GCTGGCTGTC TCGATCGGGG TGTCGAACAC GGCGAGCGGA GACGGCGCGG TGGCGATCGG CGATCCGAAC GTCGCGACCG GCACCGGCGC GGTCGCGCTT GGCGCGGACA ATTCGGCAAA CGGCCAGGGC GCCGTCGCGC TCGGCAACGC GAACATCGCA ACCGGAACGG GCTCGCTTGC GTTCGGCAAC ACGTCGACGG CGGCAGCGGC GGGCGCGGTC GCGTTGGGCG CCGGCGCAAT CGCGAACAAT GCGAACGATG TCGCGCTGGG TTCCGCTAGC GTGACCGCGG CTGCGAATCC GGTGGCCAGC GCGTTGATCG CAGGTCAGGC TTATTCGCTT GCCGGCGGCG CGCCGGCGAG CGTGGTGAGC GTCGGCGCGC CCGGCGCCGA ACGGCAAATC ATCAACGTCG CGGCCGGGCG GATTTCCGCC ACGTCGACCG ATGCGGTGAA CGGCTCGCAG ATGAATGCGA TGACTCAGGC GCTGGAATCG CTGTCGACTT CGACGGCCAG CGCGCTGTCC ACGGCGCAAA GCGGTCTGGG TTCGTTGTCG ACGGGGCTCA GCTCGACGCA GAGCAGCGTG AGTTCGCTGT CGACGGGGCT CAGCACGACG AGCGGCAATG TGGCGTCGCT GTCGAGCGGT CTGGGCACGA TGCAAAGCGG TATCGCGTCG CTGTCCACGG GGCTGAGCAC GACGAACAGC AGCCTCGCGT CGCTGTCGAC CGCCGTGTCC GGCGGCGGTG TTCGCACCAG CAGCTTGGGC GACACGTCGG CGGGCAATGG CGCGAACGCG TCCGGCGGCA ACGGCACGGC GGTCGGCGGC GCCGCGTCCG CTTCGGGAAC CGATGCGACC GCGCTGGGCC AGGCGTCGAA CGCGTCGGGC AATCATTCGA CCGCATTGGG GCAAGCATCG AGCGCGTCCG GAAGCGGCTC CACCGCGGTG GGACAAGGCG CCGGCGCGCC CGGCGACGGC GCTTCGGCAT TCGGCCAAGG GGCACTTGCC TCCGGTACGG ACTCGACGGC GCTCGGCGCT CATTCGACGG CTGCGGCGCC GAACTCGGCG GCGATCGGCG CGAATTCGGT GGCGTCCGCG CCGAATTCGG TGTCGTTCGG TTCGCGGGGC CATGAGCGCA GGCTGACGAA TGTCGCGCCG GGGATCGACG GCACCGACGC GGCGAACATG AACCAGCTCT GGGGCGTGCA ATCGAGCGTC GATCAGGCGG CGCGCCGCGC CTATTCCGGG GTGGCGGCCG CGACCGCGCT GACGATGATT CCGGAGGTCG ACCCCGGCAA GACGATCGCG GTCGGGATCG GCGCGGGCAG CTATCAAGGG TATTCGGCGT CCGCGATCGG CGTGTCCGTG CGGTTCTCCG ACAACCTGAA GGCGAAGCTC GGCGTGGGGA TCAGCGCTCA GGGCAGCACA TATGGCGCAG GCGTCTCGTA CCAGTGGTAG
|
Protein sequence | MNKIYRKVWN KARGQLVVAS ELASSRSSVG EASVDAGRSG DQTGSAAFTS EERKPGSGRM IPLAIAVALM FSPYAWAGVG GADNGVTGTS NNGGVGGSSG GGGVQFSDMG VAFVGDGDCS MLTSGPGSYA GVYGSGSNYL GGLFGFGAQT SAVGWGTPSN AGANSGIVPY QGAAQTFGNV TYAGNGTQSG NFTQAFGLNS FAVGCGAHAT GLSATAIGWG TTASGAGSVA LGLYSTASGQ GSLAFGTSAT ATATDTIALG TLATANAVSG VAIGANTQAS AANATAIGGN SSGANLGAQA TAAGATAIGG NATAGAAATA TNAIAIGGQS SAKDANDVAV GLGARAGTGS GAGNDLAIGN GATATGGNSI AQGAGASANA AGAVAIGKSA SAAGGQAVSI GVANTASGNG AVAIGDPNVA TGTGAVALGN NNTANGQGAV ALGNVSTAVG QGSVALGNSS NAAAAGGVAL GDTASAVMAG GLALGSLATA SNANDVALGA GSKTAAAVAT STVSVNGANY AVAGSGPAST VSVGAPGSER TITNVAAGQV SAGSTDAVNG SELYATNQAI TTGLSTANSS IASLSTSTST GLSSANSNIG SLSTGLSTAN STVASLSSST STGLSSANSA VASLSTSAST GLSSANSNIG SLSTGLSTTN STVASLSTST VAGLNSLSTG LSTTNSNVAS LSSSTSTGLS SANSAVASLS TSTSTGLSSA NSNIGSLSTG LSTANSTVAS LSTSTVAGLN SLSTGLSTTN SNVASLSSST STGLSSANSA VVSLSTSAST GLSSANSNIG SLSTGLSTTN STVASLSSST STGIGSLSTG VANSVQYDSP AHTSITLGGA SATSPVKITN LAAGANPSDA VNYEQLTSLS TSASTGLSSA NSAITSLSTS TSTGIGSLST GLSTTNSNVA SLSTSASTGL SSANSAITSL STSTSTGIGS LSTGLSTTNS NVASLSTSAS TGLSSANSAI TSLSTSTSTG IGSLSTGLST TNSNVASLST SASTGLSSAN SAITSLSTST STGIGSLSTG LSTTNSNVAS LSTSASTGLS SANSAITSLS TSTSTGIGSL STGLSTTNSN LSSLSTSSST GLSTANSNIS SLSTGLNSLS TAVNGGGTKY FHANSTQPDS QALGADSVAV GPAAIAAGAS GIAIGNAANA AANGAVAIGQ AAVAKGGLAV SIGVSNTASG DGAVAIGDPN VATGTGAVAL GADNSANGQG AVALGNANIA TGTGSLAFGN TSTAAAAGAV ALGAGAIANN ANDVALGSAS VTAAANPVAS ALIAGQAYSL AGGAPASVVS VGAPGAERQI INVAAGRISA TSTDAVNGSQ MNAMTQALES LSTSTASALS TAQSGLGSLS TGLSSTQSSV SSLSTGLSTT SGNVASLSSG LGTMQSGIAS LSTGLSTTNS SLASLSTAVS GGGVRTSSLG DTSAGNGANA SGGNGTAVGG AASASGTDAT ALGQASNASG NHSTALGQAS SASGSGSTAV GQGAGAPGDG ASAFGQGALA SGTDSTALGA HSTAAAPNSA AIGANSVASA PNSVSFGSRG HERRLTNVAP GIDGTDAANM NQLWGVQSSV DQAARRAYSG VAAATALTMI PEVDPGKTIA VGIGAGSYQG YSASAIGVSV RFSDNLKAKL GVGISAQGST YGAGVSYQW
|
| |