Gene BURPS1106A_A1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1090 
Symbol 
ID4905708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1045325 
End bp1050334 
Gene Length5010 bp 
Protein Length1669 aa 
Translation table11 
GC content68% 
IMG OID640144196 
Producthaemagluttinin motif-containing protein 
Protein accessionYP_001075125 
Protein GI126456236 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5099] RNA-binding protein of the Puf family, translational repressor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAGA TTTATCGCAA GGTTTGGAAC AAGGCGCGCG GCCAACTGGT CGTCGCGTCG 
GAACTGGCAT CCAGCCGTTC CAGTGTGGGA GAAGCTTCGG TCGACGCGGG GCGGTCTGGA
GACCAGACAG GCTCGGCGGC ATTCACCAGC GAGGAGCGCA AACCTGGCTC AGGCCGGATG
ATTCCGCTTG CAATAGCCGT GGCTCTGATG TTTTCACCCT ACGCATGGGC GGGTGTCGGC
GGAGCCGACA ACGGCGTGAC GGGGACGAGC AACAACGGCG GGGTAGGCGG CTCGTCCGGC
GGCGGCGGTG TCCAGTTCAG CGACATGGGC GTGGCCTTTG TCGGCGATGG CGACTGCTCG
ATGCTCACGT CCGGGCCGGG ATCGTATGCC GGCGTATACG GTTCGGGGAG CAATTATCTG
GGCGGCCTGT TCGGCTTCGG CGCACAGACG TCGGCCGTCG GCTGGGGGAC GCCCAGCAAC
GCCGGCGCCA ACAGCGGTAT CGTCCCATAC CAGGGCGCTG CTCAAACCTT CGGCAACGTC
ACCTATGCCG GCAACGGCAC GCAGAGCGGC AACTTCACGC AGGCGTTCGG CCTGAATTCC
TTTGCGGTCG GCTGCGGCGC TCACGCGACC GGCCTGAGCG CGACGGCGAT CGGCTGGGGA
ACCACCGCGA GCGGCGCCGG AAGCGTCGCG CTCGGGCTGT ACAGCACCGC GAGCGGCCAG
GGATCGTTGG CGTTCGGCAC CAGCGCGACG GCGACGGCCA CCGACACCAT CGCGCTCGGC
ACGCTGGCCA CGGCCAACGC GGTCAGCGGC GTGGCGATCG GCGCGAACAC GCAGGCTTCG
GCCGCCAATG CAACCGCGAT CGGCGGCAAT TCATCCGGCG CCAACCTCGG CGCGCAGGCG
ACGGCGGCCG GCGCCACTGC GATCGGCGGC AACGCGACGG CCGGGGCCGC CGCGACGGCG
ACGAACGCCA TCGCGATCGG CGGGCAGTCG TCGGCGAAGG ATGCGAACGA TGTGGCCGTG
GGCCTGGGCG CGAGGGCCGG CACGGGAAGC GGCGCGGGCA ACGATCTCGC GATCGGCAAT
GGCGCGACGG CCACGGGCGG CAATTCGATC GCCCAGGGCG CGGGCGCGAG CGCCAATGCG
GCCGGCGCGG TGGCCATCGG CAAATCGGCG TCCGCCGCCG GCGGGCAAGC CGTTTCGATC
GGCGTGGCCA ATACCGCGTC GGGCAACGGC GCGGTGGCGA TCGGCGATCC GAACGTCGCG
ACCGGAACCG GCGCTGTCGC GCTGGGCAAC AACAATACGG CCAATGGTCA AGGCGCGGTG
GCGCTGGGCA ACGTCAGCAC GGCGGTCGGC CAAGGCAGCG TGGCGCTCGG CAACAGCAGC
AATGCGGCCG CGGCGGGCGG AGTGGCCTTG GGCGATACCG CGAGCGCGGT GATGGCGGGC
GGCTTGGCAC TCGGCTCGCT CGCGACGGCG AGCAATGCGA ACGACGTGGC GCTCGGCGCG
GGTTCGAAAA CCGCCGCGGC AGTCGCGACG TCCACGGTTT CGGTGAACGG CGCCAACTAC
GCGGTGGCGG GAAGCGGCCC GGCCAGCACG GTCAGCGTGG GTGCGCCGGG CAGCGAGCGC
ACGATCACCA ATGTGGCCGC GGGTCAAGTA AGCGCCGGTT CCACCGATGC GGTGAATGGT
TCGGAACTGT ACGCGACGAA CCAGGCAATC ACGACCGGAT TGTCGACAGC GAACAGCAGC
ATCGCGTCGC TGTCCACGTC GACGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC
TCGTTGTCGA CGGGTTTGTC GACCGCCAAC AGCACGGTTG CGTCGTTGTC GAGTTCCACG
TCGACGGGAT TGTCCTCAGC CAATAGCGCG GTGGCGTCGC TGTCCACGTC GGCGTCGACG
GGTCTTTCGA GCGCCAACAG CAACATCGGC TCGTTGTCGA CGGGCTTGTC GACCACGAAC
AGCACGGTTG CGTCGTTGTC GACGTCCACG GTTGCCGGCC TGAATTCGCT GTCCACCGGA
TTGAGCACGA CCAATAGCAA TGTCGCGTCG TTGTCGAGTT CCACGTCGAC GGGGCTGTCC
TCGGCCAATA GCGCGGTGGC GTCACTGTCC ACGTCGACGT CGACGGGTCT TTCGAGCGCC
AACAGCAACA TCGGTTCGTT GTCGACGGGC TTGTCGACCG CCAACAGCAC GGTTGCGTCG
TTGTCGACGT CCACGGTTGC CGGCCTGAAT TCGCTGTCCA CCGGATTGAG CACGACCAAT
AGCAATGTCG CGTCGTTGTC GAGTTCCACG TCGACGGGGC TGTCCTCGGC CAATAGCGCG
GTGGTGTCGC TGTCCACGTC GGCGTCGACG GGTCTTTCGA GCGCCAACAG CAACATCGGC
TCGTTGTCGA CGGGTTTGTC GACCACGAAC AGCACGGTTG CGTCGTTGTC GAGCTCGACG
TCGACCGGTA TCGGTTCGTT GTCGACGGGG GTGGCCAATT CTGTCCAGTA TGACAGTCCT
GCTCATACGT CCATTACCCT GGGTGGCGCC AGTGCAACGT CACCCGTGAA GATCACTAAT
TTGGCGGCGG GCGCGAACCC GAGCGATGCC GTCAACTATG AGCAACTGAC ATCGCTGTCG
ACGTCGGCGT CGACGGGACT GTCGTCGGCC AACAGCGCGA TCACGTCGCT ATCCACCTCG
ACGTCGACCG GCATCGGCTC GCTGTCCACC GGACTGAGCA CGACCAACAG CAACGTCGCG
TCGTTGTCGA CGTCGGCGTC GACGGGACTG TCGTCGGCCA ACAGCGCGAT CACGTCGCTA
TCCACCTCGA CGTCGACCGG CATCGGCTCG CTGTCCACCG GGTTGAGCAC GACCAACAGC
AACGTCGCGT CGTTGTCGAC GTCGGCGTCG ACGGGACTGT CGTCGGCCAA CAGCGCGATC
ACGTCGCTGT CCACCTCGAC GTCGACCGGC ATCGGTTCGC TGTCCACCGG GTTGAGCACG
ACCAACAGCA ACGTCGCATC GTTGTCGACG TCGGCGTCGA CGGGACTGTC GTCGGCCAAC
AGCGCGATCA CGTCGCTGTC CACCTCGACG TCGACCGGCA TCGGTTCGCT GTCCACCGGA
CTGAGCACGA CCAACAGTAA CGTCGCATCG TTGTCGACGT CGGCGTCGAC GGGATTATCC
TCGGCCAACA GCGCGATCAC GTCGCTGTCC ACCTCGACGT CGACCGGCAT CGGCTCGTTG
TCCACCGGGC TGAGCACGAC CAACAGCAAC CTGAGCTCCC TGTCCACGTC GAGCTCGACC
GGCCTGAGTA CGGCCAACAG CAACATCTCG TCGCTGTCCA CCGGGCTGAA TTCGTTGTCG
ACCGCGGTCA ACGGCGGCGG GACGAAGTAC TTCCACGCCA ACTCGACGCA GCCGGACAGT
CAGGCGCTGG GGGCGGATTC CGTCGCGGTC GGGCCCGCGG CCATCGCGGC GGGCGCAAGC
GGCATTGCGA TCGGCAATGC GGCGAACGCG GCCGCCAACG GCGCCGTCGC GATCGGCCAG
GCCGCCGTCG CGAAGGGCGG GCTGGCTGTC TCGATCGGGG TGTCGAACAC GGCGAGCGGA
GACGGCGCGG TGGCGATCGG CGATCCGAAC GTCGCGACCG GCACCGGCGC GGTCGCGCTT
GGCGCGGACA ATTCGGCAAA CGGCCAGGGC GCCGTCGCGC TCGGCAACGC GAACATCGCA
ACCGGAACGG GCTCGCTTGC GTTCGGCAAC ACGTCGACGG CGGCAGCGGC GGGCGCGGTC
GCGTTGGGCG CCGGCGCAAT CGCGAACAAT GCGAACGATG TCGCGCTGGG TTCCGCTAGC
GTGACCGCGG CTGCGAATCC GGTGGCCAGC GCGTTGATCG CAGGTCAGGC TTATTCGCTT
GCCGGCGGCG CGCCGGCGAG CGTGGTGAGC GTCGGCGCGC CCGGCGCCGA ACGGCAAATC
ATCAACGTCG CGGCCGGGCG GATTTCCGCC ACGTCGACCG ATGCGGTGAA CGGCTCGCAG
ATGAATGCGA TGACTCAGGC GCTGGAATCG CTGTCGACTT CGACGGCCAG CGCGCTGTCC
ACGGCGCAAA GCGGTCTGGG TTCGTTGTCG ACGGGGCTCA GCTCGACGCA GAGCAGCGTG
AGTTCGCTGT CGACGGGGCT CAGCACGACG AGCGGCAATG TGGCGTCGCT GTCGAGCGGT
CTGGGCACGA TGCAAAGCGG TATCGCGTCG CTGTCCACGG GGCTGAGCAC GACGAACAGC
AGCCTCGCGT CGCTGTCGAC CGCCGTGTCC GGCGGCGGTG TTCGCACCAG CAGCTTGGGC
GACACGTCGG CGGGCAATGG CGCGAACGCG TCCGGCGGCA ACGGCACGGC GGTCGGCGGC
GCCGCGTCCG CTTCGGGAAC CGATGCGACC GCGCTGGGCC AGGCGTCGAA CGCGTCGGGC
AATCATTCGA CCGCATTGGG GCAAGCATCG AGCGCGTCCG GAAGCGGCTC CACCGCGGTG
GGACAAGGCG CCGGCGCGCC CGGCGACGGC GCTTCGGCAT TCGGCCAAGG GGCACTTGCC
TCCGGTACGG ACTCGACGGC GCTCGGCGCT CATTCGACGG CTGCGGCGCC GAACTCGGCG
GCGATCGGCG CGAATTCGGT GGCGTCCGCG CCGAATTCGG TGTCGTTCGG TTCGCGGGGC
CATGAGCGCA GGCTGACGAA TGTCGCGCCG GGGATCGACG GCACCGACGC GGCGAACATG
AACCAGCTCT GGGGCGTGCA ATCGAGCGTC GATCAGGCGG CGCGCCGCGC CTATTCCGGG
GTGGCGGCCG CGACCGCGCT GACGATGATT CCGGAGGTCG ACCCCGGCAA GACGATCGCG
GTCGGGATCG GCGCGGGCAG CTATCAAGGG TATTCGGCGT CCGCGATCGG CGTGTCCGTG
CGGTTCTCCG ACAACCTGAA GGCGAAGCTC GGCGTGGGGA TCAGCGCTCA GGGCAGCACA
TATGGCGCAG GCGTCTCGTA CCAGTGGTAG
 
Protein sequence
MNKIYRKVWN KARGQLVVAS ELASSRSSVG EASVDAGRSG DQTGSAAFTS EERKPGSGRM 
IPLAIAVALM FSPYAWAGVG GADNGVTGTS NNGGVGGSSG GGGVQFSDMG VAFVGDGDCS
MLTSGPGSYA GVYGSGSNYL GGLFGFGAQT SAVGWGTPSN AGANSGIVPY QGAAQTFGNV
TYAGNGTQSG NFTQAFGLNS FAVGCGAHAT GLSATAIGWG TTASGAGSVA LGLYSTASGQ
GSLAFGTSAT ATATDTIALG TLATANAVSG VAIGANTQAS AANATAIGGN SSGANLGAQA
TAAGATAIGG NATAGAAATA TNAIAIGGQS SAKDANDVAV GLGARAGTGS GAGNDLAIGN
GATATGGNSI AQGAGASANA AGAVAIGKSA SAAGGQAVSI GVANTASGNG AVAIGDPNVA
TGTGAVALGN NNTANGQGAV ALGNVSTAVG QGSVALGNSS NAAAAGGVAL GDTASAVMAG
GLALGSLATA SNANDVALGA GSKTAAAVAT STVSVNGANY AVAGSGPAST VSVGAPGSER
TITNVAAGQV SAGSTDAVNG SELYATNQAI TTGLSTANSS IASLSTSTST GLSSANSNIG
SLSTGLSTAN STVASLSSST STGLSSANSA VASLSTSAST GLSSANSNIG SLSTGLSTTN
STVASLSTST VAGLNSLSTG LSTTNSNVAS LSSSTSTGLS SANSAVASLS TSTSTGLSSA
NSNIGSLSTG LSTANSTVAS LSTSTVAGLN SLSTGLSTTN SNVASLSSST STGLSSANSA
VVSLSTSAST GLSSANSNIG SLSTGLSTTN STVASLSSST STGIGSLSTG VANSVQYDSP
AHTSITLGGA SATSPVKITN LAAGANPSDA VNYEQLTSLS TSASTGLSSA NSAITSLSTS
TSTGIGSLST GLSTTNSNVA SLSTSASTGL SSANSAITSL STSTSTGIGS LSTGLSTTNS
NVASLSTSAS TGLSSANSAI TSLSTSTSTG IGSLSTGLST TNSNVASLST SASTGLSSAN
SAITSLSTST STGIGSLSTG LSTTNSNVAS LSTSASTGLS SANSAITSLS TSTSTGIGSL
STGLSTTNSN LSSLSTSSST GLSTANSNIS SLSTGLNSLS TAVNGGGTKY FHANSTQPDS
QALGADSVAV GPAAIAAGAS GIAIGNAANA AANGAVAIGQ AAVAKGGLAV SIGVSNTASG
DGAVAIGDPN VATGTGAVAL GADNSANGQG AVALGNANIA TGTGSLAFGN TSTAAAAGAV
ALGAGAIANN ANDVALGSAS VTAAANPVAS ALIAGQAYSL AGGAPASVVS VGAPGAERQI
INVAAGRISA TSTDAVNGSQ MNAMTQALES LSTSTASALS TAQSGLGSLS TGLSSTQSSV
SSLSTGLSTT SGNVASLSSG LGTMQSGIAS LSTGLSTTNS SLASLSTAVS GGGVRTSSLG
DTSAGNGANA SGGNGTAVGG AASASGTDAT ALGQASNASG NHSTALGQAS SASGSGSTAV
GQGAGAPGDG ASAFGQGALA SGTDSTALGA HSTAAAPNSA AIGANSVASA PNSVSFGSRG
HERRLTNVAP GIDGTDAANM NQLWGVQSSV DQAARRAYSG VAAATALTMI PEVDPGKTIA
VGIGAGSYQG YSASAIGVSV RFSDNLKAKL GVGISAQGST YGAGVSYQW