Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1946 |
Symbol | |
ID | 4905803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1904035 |
End bp | 1909785 |
Gene Length | 5751 bp |
Protein Length | 1916 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640145052 |
Product | putative cell surface protein |
Protein accession | YP_001075980 |
Protein GI | 126456867 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCACCG GGCTGTCGAC CGCGAATAGC ACCGTCGCTT CGCTGTCCAC CTCGACTTCG ACCGGCCTGA GCTCGGCCAC CAGCTCGATC GCTTCGCTGT CGACTTCCAC GTCAACCGGT ATTGGCTCGC TCTCCACCGG CCTGTCGACG ACCAACAGCA ACGTCGCCTC GCTGTCCAGC GGCCTGAGCA GCACGAATAG CTCGCTGACC TCGCTGTCGA CTTCAGCTTC GTCCGGCATC AGCACCGCGC AGAGCGGTGT CAATTCGTTG TCCACCGGGC TGTCGACCGC GAATAGCACC GTCGCTTCGC TGTCCACGTC GACCTCGACC GGTCTGAGCT CGGCCACCAG CTCGATCGCT TCGCTGTCGA CTTCCACGTC GACCGGCATC GGCTCGCTTT CCACCGGCCT GTCGACCGTC GCGACGACGA CCAACAATCT CGGCAGCAGC ACCGCCGCGG CGCTCGGCGG CGGCGCGACG TACAATCCGG CCACCGGCAC GATTTCCGCG CCGTCGTACA CGACGTACAA CGCGAACGGC ACGACCACGG CCAACACCAG TGTCGGCTCT GCGATCGACA GCATCAACGC GAACGGCATC AAGTACTTCC ATGCGAACTC GACTGGGCCG GACAGCGTCG CGACCGGCGC GGACGCCGTC GCGATCGGCA CGGGCGCGAC GGCCGCCACG GCGAACTCGG TCGCGCTCGG CGCGAACTCG GTGACGGCCG CGGCCGTCCC GACCAGCAGC GCGACCGTCG GCTCGACCAC GCTCGGTACG TTCGCGGGCA GCGCGCCCGT GGGCGTCGTG AGCGTCGGGG CGCCGAGCGC GGAACGCCAG ATCACGAACG TCGCCGCCGG CCAGGTCACC GCCAGCAGCA CCGATGCGAT CAACGGCAGC CAGCTCTATG CGGTTGCGTC GAAGCTCGAC TCGGCCTCGA GTTCGATTTC GTCGCTGTCG ACGGGACTGT CGTCGGCCAC CAGTTCGATT ACTTCGCTCT CCACTTCCAC CTCGACGGGC CTGAGCTCGG CGAATAGCTC GATCGCCTCG CTGTCGACCT CCACGTCGAC CGGTATCGGT TCGCTCTCCA CCGGTCTGTC CACGACCGAT AGCGCCGTCG CCTCGCTGTC CACCTCGACT TCGACCGGCC TGAGCTCGGC CACCAGCTCG ATCACTTCGC TGTCGACTTC CACGTCGACC GGCATCGGTT CGCTTTCCAC CGGCCTGTCC ACAACCAACA GCAACGTCGC TTCGCTGTCC ACCTCGACTT CGACCGGTCT GAGCTCGGCC ACCAGCTCGA TCGCTTCGCT GTCGACCTCC ACGTCGACCG GCATCGGCTC GCTTTCCACC GGTCTGTCCA CGACCAATAG CACCGTCGCC TCGCTGTCCA CGTCGACCTC GACCGGCCTG AGCTCGGCGA ATAGCTCGAT CGCCTCGCTG TCGACCTCCA CGTCGACCGG CATCGGCTCG CTTTCCACCG GCCTGTCCAC GACCAACAGC AACGTCGCCT CGCTGTCCAG CGGCCTGAGC AGCACGAACA GCTCGCTGAC CTCGCTTTCG ACTTCAGCTT CGTCCGGCAT CAGCACCGCG CAGAGCGGTG TCAATTCGTT GTCCACCGGG CTGTCGACCA CCAATAGCAC CGTCGCTTCG CTGTCCACCT CGACCTCGAC CGGCTTGAGC TCGGCCACCA GCTCGATCGC TTCGCTGTCG ACTTCCACGT CGACCGGCAT CGGCTCGCTC TCCACCGGTC TGTCCACGAC CGATAGCACC GTCGCTTCGC TGTCCACGTC GACCTCGACC GGCCTGAGTT CGGCCACCAG CTCGATCGCT TCGCTGTCGA CTTCCACGTC GACCGGTATC GGCTCGCTTT CCACCGGCCT GTCGACCACC AACAGCAACG TCGCCTCGCT GTCCAGCGGC CTGAGCAGCA CGAATAGCTC GCTGACCTCG CTGTCGACGT CCGCTTCGTC GGGCATCAGC ACCGCGCAGA GCGGCGTCAA TTCGTTGTCC ACCGGGCTGT CGACCGCGAA TAGCACCGTC GCTTCGCTGT CCACGTCGAC TTCGACGGGC CTGAGCTCGG CGAACAGCTC GATCGCTTCG CTGTCGACTT CCACGTCGAC CGGCATCGGC TCGCTTTCCA CCGGCCTGTC CACGACCGAT AGCACCGTCG CCTCGCTGTC CACGTCGACT TCGACCGGTC TGAGCTCGGC GAATAGCTCG ATCGCCTCGC TGTCGACCTC CACGTCGACC GGTATCGGCT CGCTGTCCAC CGGTCTGTCC ACGACCAACA GCAACGTCGC TTCGCTGTCC ACCTCGACTT CGACGGGCCT GAGCTCGGCG AATAGCTCGA TCGCCTCGCT GTCGACTTCC ACGTCGACCG GTATCGGCTC GCTTTCCACC GGCCTGTCCA CGACCAACAG CAACGTCGCT TCGCTGTCCA CCTCGACTTC GACGGGCCTG AACTTGGCGA ATAGCTCGAT CGCCTCGCTG TCGACTTCCA CGTCGACCAG TATCGACTCG CTTTCCACCG GCCTGTCCAC GACCAACAGC AACGTCGCTT CGCTGTCCAC CTCGACTTCG ACGGGCCTGA GCTCGGCGAA TAGCTCGATC GCCTCGCTGT CGACTTCCAC GTCGACCGGT ATCGGCTCGC TTTCCACCGG CCTGTCCACG ACCAACAGCA ACGTCGCTTC GCTGTCCACC TCGACTTCGA CGGGCCTGAA CTTGGCGAAT AGCTCGATCG CCTCGCTGTC GACCTCCACG TCGACCGGTA TCGGCTCGCT TTCCACCGGT CTGTCCACGA CCAACAGCAA CGTCGCTTCG CTGTCCACCT CGACTTCGAC GGGTCTGAGC TCGGCCACCA GCTCGATCGC TTCGCTGTCG ACTTCCACGT CGACCGGCAT CGGCTCGCTC TCCACCGGTC TGTCCACGAC CGATAGCACC GTCGCCTCGC TGTCCACGTC GACTTCGACC GGCCTGAGCT CGGCGAACAG CTCGATCGCC TCGCTGTCGA CTTCCACGTC GACCGGCATC GGCTCGCTGT CCACCGGTCT GTCCACGACC GATAGCACCG TCGCCTCGCT GTCCACGTCG ACTTCGACCG GCCTGAGCTC GGCCACCAGC TCGATCACTT CGCTGTCGAC TTCCACGTCG ACCGGCATCG GTTCGCTTTC CACCGGCCTG TCCACGACCA ACAGCAACGT CGCTTCGCTG TCCACCTCGA CTTCGACCGG TCTGAGCTCG GCCACCAGCT CGATCGCTTC GCTGTCGACC TCCACGTCGA CCGGCATCGG CTCGCTTTCC ACCGGTCTGT CGACGACCAA CAGCAACGTC GCTTCGCTGT CCACCTCGAC TTCGACGGGC CTGAGCTCGG CCACCAGCTC GATCGCTTCG CTGTCGACTT CCACGTCGAC CGGTATCGGC TCGCTTTCCA CCGGCCTGTC CACGACCGAT AGCACCGTCG CTTCGCTGTC CACGTCGACT TCGACGGGCC TGAGCTCGGC CACCAGCTCG ATCGCTTCGC TGTCGACTTC CACGTCGACC GGCATCGGCT CGCTGTCCAC CGGTCTGTCC ACGACCAACA GCACCGTCGC TTCGCTGTCC ACCTCGACTT CGACGGGCCT GAGCTCGGCC ACCAGCTCGA TCGCCTCGCT GTCGACCTCC ACGTCGACCG GTATCGGCTC GCTCTCCACC GGCCTGTCCA CGACCGATAG CACCGTCGCT TCGCTGTCCA CGTCGACTTC GACCGGCCTG AGCTCGGCCA CCAGCTCGAT CGCTTCGCTG TCGACCTCCA CGTCGACCGG TATCGGCTCG CTTTCCACCG GTCTGTCCAC GACCAACAGC ACCGTCGCTT CGCTGTCCAC CTCGACTTCG ACGGGCCTGA GCTCGGCCAC CAGCTCGATC GCCTCGCTGT CGACCTCCAC GTCAACCGGC ATCGGCTCGC TTTCCACCGG CCTGTCCACG ACCAACAGCA CCGTCGCCTC GCTGTCCACG TCGACCTCGA CCGGCATCGG CTCGCTCTCC ACCGGCCTGT CCACGACCAA CAGCACCGTC GCGTCGCTGT CCACGTCGAC TTCGACCGGC ATCGGTTCGC TGTCCACCGG CCTGTCGTCG ACCAACAGCA ACCTCGTGTC GCTCTCCACG TCGGCCTCGA CCGGCATCGG CTCGCTGTCG ACCAGCATCG ACACGACCAA CAGCAACCTC GCGTCGCTCT CCACGTCGAG CTCGACCGGC ATCGGCTCGC TGTCGACCAG CATCAACGCG ACCAACAGCA ACCTCGCGTC GCTCTCCACG TCCGCCTCGA CCGGCATCGG CTCGCTGTCG ACGAGCATCA GCTCGATCAC GACGAACACG ACGAACCTCG GCAACAGCAC CGCCGCGGCG CTCGGCGGCG GCGCGACGTA CGATCCGGCC ACCGGCGCGA TCTCCGCGCC GTCGTACACG ACGTACAACG CGAACGGCAC CACCGCGACC AACACCAGCG TCGGCGCCGC GATCGACAAC ATCAACGCGA ACGGCATCAA GTACTTCCAC GCGAACTCGA CCGATCCGGA CAGCGTCGCG ACCGGCACGA ACAGCGTCGC GATCGGCCCG AACGCGGTCG CGAACGTCGA CTACTCGGTC GCGATCGGCA GCGGCGCGAC GACCTCGGCG GCCGTGCCCG TCGCGTCGGC GAGCGTCGGC GGCCTCACGT TCGGCGGCTT CGCGGGCAGC GCGCCCATCG GCGTGTTCAG CGTCGGCGCG CCGGGCGCGG AACGCCAGAT CACGAATGTC GCCGCCGGCC GCATCTCCGC GGCCAGCACC GACGCCGTCA ACGGCAGCCA GCTCTACGCG ACCAACAGCA ATGTCGCGTC GCTGTCGACC GGTCTGAACG CGACGAACAG CAACCTCGCG TCGCTGTCCA CGTCCACCTC GACCGCCGTC GGCTCGCTGT CCACCGGCCT GTCCACGACC AACAGCACCG TCGCCTCGCT GTCCACGTCG ACCTCGACCA GCATCGGCTC GCTGTCCACC GGCCTCTCGA CCGCGAACAG CAACCTCGCG TCGCTGTCCA CGTCCACCTC GACCGGCATC GGCTCGCTGT CCACCGGCCT CGCGACGACC AACAGCAATG TCGCGTCGCT GTCGACGAGC GTGACCAACA TCAACACGCA GCTCACGTCG CTGTCGACGT CGATCACGAA CAACGTGATC CGGTCGCTGC CCGCGAGCAC CGGCGTCGCC GCGGACATGA GCGCGCCGAA GGCGACCTCG CCGTCCGTCA CGGCCGGCTC GAACTCGGTC GCGCTCGGCG CGGGCTCGAA CGACGGCGGT CGCTCGAACG TCGTGTCGGT GGGCAGCGAC ACGCAGCAGC GCCAGATCAC GAACGTCGCG GCCGGCACCG AGGGCACCGA CGCGGTCAAC GTCAACCAGT TGAATACGCT GTCGACGTCG ATGTCGCAAT CGCTGTCGAA TCAGCAGACG CAGCTCAACA ATCTCGGCTC GCAACTGAAC CAGACGCAGC AGCAACTGCA GCAGACCGAC ACGATGGCCC GCCAGGGGAT CGCGGCGGTC GCGGCGATGG CGTCGATTCC GCACATGGAC CGCGACTCGA ACTTCGCGAT GGGCGTGGGC ACCTCTTCGT TCCTCGGCCA GAAGGCGATC GCGGTCGGCA TGCAGGCGCG CATCACCGAG AACCTGAAGG CGTCGCTGAA CGGCGGCTTC GCCGGCAATC AGAAGGTCAT CGGCGCGGGC ATGCTCTATC AGTGGAAGTA A
|
Protein sequence | MSTGLSTANS TVASLSTSTS TGLSSATSSI ASLSTSTSTG IGSLSTGLST TNSNVASLSS GLSSTNSSLT SLSTSASSGI STAQSGVNSL STGLSTANST VASLSTSTST GLSSATSSIA SLSTSTSTGI GSLSTGLSTV ATTTNNLGSS TAAALGGGAT YNPATGTISA PSYTTYNANG TTTANTSVGS AIDSINANGI KYFHANSTGP DSVATGADAV AIGTGATAAT ANSVALGANS VTAAAVPTSS ATVGSTTLGT FAGSAPVGVV SVGAPSAERQ ITNVAAGQVT ASSTDAINGS QLYAVASKLD SASSSISSLS TGLSSATSSI TSLSTSTSTG LSSANSSIAS LSTSTSTGIG SLSTGLSTTD SAVASLSTST STGLSSATSS ITSLSTSTST GIGSLSTGLS TTNSNVASLS TSTSTGLSSA TSSIASLSTS TSTGIGSLST GLSTTNSTVA SLSTSTSTGL SSANSSIASL STSTSTGIGS LSTGLSTTNS NVASLSSGLS STNSSLTSLS TSASSGISTA QSGVNSLSTG LSTTNSTVAS LSTSTSTGLS SATSSIASLS TSTSTGIGSL STGLSTTDST VASLSTSTST GLSSATSSIA SLSTSTSTGI GSLSTGLSTT NSNVASLSSG LSSTNSSLTS LSTSASSGIS TAQSGVNSLS TGLSTANSTV ASLSTSTSTG LSSANSSIAS LSTSTSTGIG SLSTGLSTTD STVASLSTST STGLSSANSS IASLSTSTST GIGSLSTGLS TTNSNVASLS TSTSTGLSSA NSSIASLSTS TSTGIGSLST GLSTTNSNVA SLSTSTSTGL NLANSSIASL STSTSTSIDS LSTGLSTTNS NVASLSTSTS TGLSSANSSI ASLSTSTSTG IGSLSTGLST TNSNVASLST STSTGLNLAN SSIASLSTST STGIGSLSTG LSTTNSNVAS LSTSTSTGLS SATSSIASLS TSTSTGIGSL STGLSTTDST VASLSTSTST GLSSANSSIA SLSTSTSTGI GSLSTGLSTT DSTVASLSTS TSTGLSSATS SITSLSTSTS TGIGSLSTGL STTNSNVASL STSTSTGLSS ATSSIASLST STSTGIGSLS TGLSTTNSNV ASLSTSTSTG LSSATSSIAS LSTSTSTGIG SLSTGLSTTD STVASLSTST STGLSSATSS IASLSTSTST GIGSLSTGLS TTNSTVASLS TSTSTGLSSA TSSIASLSTS TSTGIGSLST GLSTTDSTVA SLSTSTSTGL SSATSSIASL STSTSTGIGS LSTGLSTTNS TVASLSTSTS TGLSSATSSI ASLSTSTSTG IGSLSTGLST TNSTVASLST STSTGIGSLS TGLSTTNSTV ASLSTSTSTG IGSLSTGLSS TNSNLVSLST SASTGIGSLS TSIDTTNSNL ASLSTSSSTG IGSLSTSINA TNSNLASLST SASTGIGSLS TSISSITTNT TNLGNSTAAA LGGGATYDPA TGAISAPSYT TYNANGTTAT NTSVGAAIDN INANGIKYFH ANSTDPDSVA TGTNSVAIGP NAVANVDYSV AIGSGATTSA AVPVASASVG GLTFGGFAGS APIGVFSVGA PGAERQITNV AAGRISAAST DAVNGSQLYA TNSNVASLST GLNATNSNLA SLSTSTSTAV GSLSTGLSTT NSTVASLSTS TSTSIGSLST GLSTANSNLA SLSTSTSTGI GSLSTGLATT NSNVASLSTS VTNINTQLTS LSTSITNNVI RSLPASTGVA ADMSAPKATS PSVTAGSNSV ALGAGSNDGG RSNVVSVGSD TQQRQITNVA AGTEGTDAVN VNQLNTLSTS MSQSLSNQQT QLNNLGSQLN QTQQQLQQTD TMARQGIAAV AAMASIPHMD RDSNFAMGVG TSSFLGQKAI AVGMQARITE NLKASLNGGF AGNQKVIGAG MLYQWK
|
| |