Gene BURPS1106A_A1951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1951 
Symbol 
ID4903449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1913443 
End bp1918269 
Gene Length4827 bp 
Protein Length1608 aa 
Translation table11 
GC content64% 
IMG OID640145057 
Productmembrane-anchored cell surface protein 
Protein accessionYP_001075985 
Protein GI126455469 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats
[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA TCTATAAAAC CATCTGGTGC GAGACCACGC GGAGCTGGGT CGCGGTTTCC 
GAGCACGCAA ACGGCAAGCG CGGCGGCGCG ACGGCGGCCG CGACGACTTC CGCGCGTCCG
ATCTGGACGC GGCTGCGCGG CATCTCGCTC GCCGCGCTGG CAGCGTTCGG CCTGGGGCTC
TTCGCCTCGC CCGCCGCGTT CGCGCAGTCG AACTCAGTGA TGTGCGCGAA CTACAACAAC
GGCATTCTTC CGACCTATAC CGGTTACGGC GCGAGCCCTT CGCTCACCTC GCCCTGCACG
ACGGGCATCG GCTCGTGGGC GGGCGGCGTC ACGCCCGGCT CGACGACCAA CTGGATCGGC
CTGTCCGCCG ACGACACGCA AATCGTCCTG AACGGCAGCA CCGGCAATAT CTATTTCCGC
GCGGGCGGCA CGAACGGCAA CACGCTGACG ATGAGCAACG TCGCGGGCTC GGGCCCGACG
GGAGGCGTGC TGCTGTCGGG CGTCGCGGCG GGCGCCGTGA CGGCGACGAG CTCGCAGGCG
ATCAACGGCA GCCAGCTCTA TTCGCTGTCG ACCTCGGCAT CGACGGGCAT CGGCTCGTTG
TCGAGCAGCA TGTCGACGTT CAACAGCTCG ATCTCGTCGC TGTCCACCGG ACTGTCGTCG
ACAAACAGCG GCCTGACCTC GCTTTCCACT TCGGCGTCGA CGGGCCTGTC GTCGGCGAAC
AGCTCGATTG CTTCGCTGTC GAGCGGGCTG AGCAGCACGA ACAGCTCGCT GACCTCGCTT
TCGACGTCCG CTTCCTCGGG CATCAGCACC GCGCAGAGCG GCGTCAATTC GCTGTCGACC
GGACTGTCGA CGACCAATAG CACGGTCGCT TCGTTGTCCA CGTCGACCTC GACCGGCATC
GGCTCGCTTT CCACCGGCCT GAGCAGCACG AACAGTTCGT TGACCTCGCT TTCAACGTCC
GCCTCGTCGG GCATCAGCTC GGCCAATAGC TCGGTTGCTT CGCTGTCCAC GTCGACCTCG
ACCGGCATCG GCTCGCTTTC CACCGGCCTG AGCAGCACTA ACAGCAGCCT CACGTCGCTG
TCGACCTCCA CGTCGACCGG CCTCTCGTCG GCCAACAGCT CGATCACTTC CCTTTCGAGC
GGGCTGAGCA CGACCAACAG CAACGTCGCC TCGCTGTCCA CCGGCCTGAG CAGCACCAAC
AGCTCGCTGA CCTCGCTTTC AACGTCCGCC TCGTCGGGCA TTAGCTCGGC CAATAGCTCG
GTTGCTTCGC TGTCCACGTC GACCTCGACC GGCATCGGCT CGCTGTCCAC CGGTCTGAGC
ACCACCAACA GCAACCTCGC CTCGCTGTCG ACCTCCACGT CGACGGGGCT CTCGTCGGCC
AACAGTTCGA TCACTTCCCT TTCGAGCGGG CTGAGCACGA CCAACAGCAA CGTCGCCTCG
CTGTCAACCG GCCTGAGCAG CACGAACAGC TCGTTGACCT CGCTTTCAAC GTCCACCTCG
TCGGGCATCA GCTCAGCCAA TAGCTCGGTT GCTTCGTTGT CCACGTCGAC CTCGACCGGC
ATCGGCTCGC TTTCCACCGG TCTGAGCACC ACCAACAGCA ACCTCACGTC GCTGTCGATC
TCCACGTCGA CGGGGCTCTC GTCGGCCAAC AGCTCGATCA CCTCCCTTTC GAGCGGGCTG
AGCACGACCA ACAGCAACGT CACCTCGCTG TCCACCGGCC TGAGCAGCAC GAACAGCTCG
TTGACCTCGC TTTCAACGTC CACCTCGTCG GGCATCAGCT CAGCCAATAG CTCGATTGCT
TCGTTGTCCA CGTCGACCTC GACCGGCATC ACTTCGTTGT CCACCGGCTT GAGTACCACC
GACAGCAATC TCACGTCGCT GTCGACCTCC ACGTCGACGG GGCTCTCGTC GGCCAACAGC
TCGATCACTT CCCTTTCGAG CGGGCTGAGC ACGACCAACA GCAACGTCGC CTCGCTGTCC
AGCGGCTTGA GCGCCACCAA CAGCTCGCTG ACCTCGCTTT CAACGTCCGC CTCGTCGGGC
ATCAGCTCAG CCAATAGCTC GGTTGCTTCG CTGTCCACGT CGACCTCGAC CGGTATCGGC
TCGCTTTCCA CCGGCTTGAG CACCACCAAC AGCAACCTCA CGTCGCTGTC GACCTCCACC
TCGACCAGCC TCTCGTCGGC CAACAGCTCG ATCACTTCCC TTTCGAGCGG GCTGAGCACG
ACCAACAGCA ACGTCGCCTC GCTGTCGACT TCCACGTCGA CCAGCCTCTC ATCCGCAACC
AGCTCGATCG CTTCGCTGTC CACGTCGACC TCGACCGGCA TCAGTTCGTT GTCCACCGGC
TTGAGTACCA CCGACAGCAA TCTCACGTCG CTGTCGACCT CCACCTCGAC CGGCCTCTCG
TCGGCAACCA GCTCGATCAC CTCCCTTTCG AGCGGACTGA GCACGACCAA CAGCAACGTC
GCCTCGCTGT CCACCGGCCT GAACAGCACG AACAGCTCGT TGACCTCGCT TTCAACGTCC
ACCTCGTCGG GCATCAGCTC GGCCAATAGC TCGATTGCTT CATTGTCCAC GTCGACCTCG
ACCGGCATCA GCTCGTTGTC CACCGGCTTG AGTACCACCG ACAGCAATCT CACGTCGCTG
TCGACCTCCA CCTCGACCGG CCTCTCGTCG GCAACCAGCT CGATCACCTC CCTTTCGAGC
GGACTGAGCA CGACCAACAG CAATGTCGCC TCGCTGTCCA CCGGCCTGAG CAGCACGAAC
AGCTCGCTGA CCTCGCTTTC GACGTCCGCT TCCTCCGGCA TCAGCACCGC GCAGAGCGGC
GTCAATTCGC TGTCGACCGG ACTGTCGACG ACCAATAGCA CGGTCGCTTC GTTGTCCACG
TCGACCTCGA CCGGCATCGG CTCGCTGTCC ACCGGTTTGA GCACCATCGA CAGCAACCTC
GCCTCGCTGT CGACTTCCAC GTCGACCGGC CTCTCGTCCG CGACCAGCTC GATCGCTTCG
TTGTCCACGT CGACCTCGAC CGGCATCAGC TCGCTGTCCA CCGGTTTGAG CACCACCGAC
AGCAACCTCG CCTCGCTGTC GACTTCCACG TCGACCGGCC TCTCATCCGC AACCAGCTCG
ATCGCTTCGC TGTCCACGTC GACCTCGACC GGCATCACTT CGTTGTCCAC CGGCTTGAGT
ACCACCGACA GCAATCTCAC GTCGCTGTCG ACCTCCACCT CGACCGGCCT CTCGTCGGCA
ACCAGCTCGA TCACCTCCCT TTCGAGCGGG CTGAGCACGA CCAACAGCAA CGTCGCCTCG
CTGTCCACCG GCCTGAGCAG CACGAACAGC TCGTTGACCT CGCTTTCAAC GTCCACCTCG
TCGGGCATCA GCTCAGCCAA TAGCTCGATT GCTTCGTTGT CCACGTCGAC CTCGACCGGC
ATCACTTCGT TGTCCACCGG CTTGAGTACC ACCGACAGCA ACCTCGCCTC GCTGTCGACT
TCCACGTCGA CCGGCCTTTC GTCCGCGACC AGCTCGATCG CTTCGCTGTC CACGTCGACC
TCGACGAGCG TCGACTCGCT GTCCACCGGC TTGAGCACCA CCAACAGCAG CGTCGCATCG
CTCTCGACCG GCCTGAGCAC CACCAACAGC AGCGTCGCAT CGCTCTCGAC CGGCCTGAGC
ACCACCGACA GCAGCCTCGC CTCGCTGTCG ACCTCCACGT CGACCGGCCT CTCGTCAACA
ACCAGCTCGA TCGCTTCGCT CTCGACGTCG ACCTCGACGA GCTTCTCGTC GGCGCTGAGC
TCGATCGGCT CGCTGTCCAC CGGCCTCGCG ACGACCAACA GCAATCTCGC GTCGCTGTCC
ACGTCGACGC TCACCTCCGT GAGCTCGCTG TCCACCGGCC TGAGCGCCAC CAACAGCAGC
GTCGCGTCGC TGTCGACGTC CGCCTCGACC GGCCTCGCCG CAACCAACAG CACGGTCGCC
TCGCTGTCCA CATCCACCTC GACGGCCGTC GGCTCGCTGT CCACCGGCCT GTCGACCACC
AACAGCAACG TCGCCTCGCT GTCCACGTCC ACCTCGACGG CCGTCGGCTC GCTGTCCACC
AGCCTGTCGA CCACCAACAG CAACGTCGCG TCGCTGTCCA CGTCGACTTC GACAAGCGTG
AACTCGCTGT CCACCGGCCT GTCGACGACG AACACGAGCG TCGCGTCGCT GTCGACGAGC
GTGACCAACC TCAACACGCA GCTCACGTCG CTGTCGACGA CGATCGTCAA CAGCACGAAC
AACGTGATCC GCACGCTGCC CGCGAGCACG GGCATCGCCG CCGACATGAG CGCGCCGAAC
GCCGCCGCGC CGTCCGTCAC GGCCGGCTCG AACTCGGTCG CGCTCGGCGC GAACTCGACC
GACGGCGGCC GCTCGAACGT CGTCTCGGTC GGCAGCGCCA CGCAGCAGCG GCAAATCACG
AACGTCGCCG CCGGCACCGA AGGCACCGAC GCGGTCAACG TCAACCAGTT GAATGCGCTG
TCCACTTCTA TGTCACAATC TCTGGCGGGC CAGCAGGGCC AGATCAACAA TCTGGGCTCG
CAGTTGACCC AGACTCAGCA AGCGCTGCAG CAGACCGATA CGATGGCCCG TCAGGGCATC
GCGGCCGCCA CCGCGCTGAC GATGCTGCCG CAGGTCGAGC CGGGCAAGAC GATCAATGTC
GCCGTCGGCG TCGCCCGCTT CGCAGGTCAG TCGGGGATGG CGTTCGGCGC GAGCGCGCAC
GTGACAACCA ACGGCATTCT CAAACTGGGC ATCGGCGTGT CGGGGCAGAA CAAGACCTTC
GGTGCAGGAT ATGGATATAG CTGGTGA
 
Protein sequence
MNKIYKTIWC ETTRSWVAVS EHANGKRGGA TAAATTSARP IWTRLRGISL AALAAFGLGL 
FASPAAFAQS NSVMCANYNN GILPTYTGYG ASPSLTSPCT TGIGSWAGGV TPGSTTNWIG
LSADDTQIVL NGSTGNIYFR AGGTNGNTLT MSNVAGSGPT GGVLLSGVAA GAVTATSSQA
INGSQLYSLS TSASTGIGSL SSSMSTFNSS ISSLSTGLSS TNSGLTSLST SASTGLSSAN
SSIASLSSGL SSTNSSLTSL STSASSGIST AQSGVNSLST GLSTTNSTVA SLSTSTSTGI
GSLSTGLSST NSSLTSLSTS ASSGISSANS SVASLSTSTS TGIGSLSTGL SSTNSSLTSL
STSTSTGLSS ANSSITSLSS GLSTTNSNVA SLSTGLSSTN SSLTSLSTSA SSGISSANSS
VASLSTSTST GIGSLSTGLS TTNSNLASLS TSTSTGLSSA NSSITSLSSG LSTTNSNVAS
LSTGLSSTNS SLTSLSTSTS SGISSANSSV ASLSTSTSTG IGSLSTGLST TNSNLTSLSI
STSTGLSSAN SSITSLSSGL STTNSNVTSL STGLSSTNSS LTSLSTSTSS GISSANSSIA
SLSTSTSTGI TSLSTGLSTT DSNLTSLSTS TSTGLSSANS SITSLSSGLS TTNSNVASLS
SGLSATNSSL TSLSTSASSG ISSANSSVAS LSTSTSTGIG SLSTGLSTTN SNLTSLSTST
STSLSSANSS ITSLSSGLST TNSNVASLST STSTSLSSAT SSIASLSTST STGISSLSTG
LSTTDSNLTS LSTSTSTGLS SATSSITSLS SGLSTTNSNV ASLSTGLNST NSSLTSLSTS
TSSGISSANS SIASLSTSTS TGISSLSTGL STTDSNLTSL STSTSTGLSS ATSSITSLSS
GLSTTNSNVA SLSTGLSSTN SSLTSLSTSA SSGISTAQSG VNSLSTGLST TNSTVASLST
STSTGIGSLS TGLSTIDSNL ASLSTSTSTG LSSATSSIAS LSTSTSTGIS SLSTGLSTTD
SNLASLSTST STGLSSATSS IASLSTSTST GITSLSTGLS TTDSNLTSLS TSTSTGLSSA
TSSITSLSSG LSTTNSNVAS LSTGLSSTNS SLTSLSTSTS SGISSANSSI ASLSTSTSTG
ITSLSTGLST TDSNLASLST STSTGLSSAT SSIASLSTST STSVDSLSTG LSTTNSSVAS
LSTGLSTTNS SVASLSTGLS TTDSSLASLS TSTSTGLSST TSSIASLSTS TSTSFSSALS
SIGSLSTGLA TTNSNLASLS TSTLTSVSSL STGLSATNSS VASLSTSAST GLAATNSTVA
SLSTSTSTAV GSLSTGLSTT NSNVASLSTS TSTAVGSLST SLSTTNSNVA SLSTSTSTSV
NSLSTGLSTT NTSVASLSTS VTNLNTQLTS LSTTIVNSTN NVIRTLPAST GIAADMSAPN
AAAPSVTAGS NSVALGANST DGGRSNVVSV GSATQQRQIT NVAAGTEGTD AVNVNQLNAL
STSMSQSLAG QQGQINNLGS QLTQTQQALQ QTDTMARQGI AAATALTMLP QVEPGKTINV
AVGVARFAGQ SGMAFGASAH VTTNGILKLG IGVSGQNKTF GAGYGYSW