Gene BURPS668_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2027 
Symbol 
ID4885601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2020672 
End bp2023560 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content68% 
IMG OID640127955 
Productputative outer membrane protein 
Protein accessionYP_001059062 
Protein GI126441648 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3846] Type IV secretory pathway, TrbL components 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.782084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCGCG GTGGCCGCGC GTCGAGCGTC GTCGCGTCCG CCGGCGGATT GGAGAAAGTG 
CTCAAGCTGT CGATTCTGGG CGCGGCATCG CTGATTGCGA TGGGCGTGGT CGGACCGTTT
GCCGAGGAGG CAATGGCGGC GAATAACACG GGCCTGTGCC TGACGTACAA CGGGGGGAGC
AACAACGTTG CCGGCAGTGG CGGCTTGATC ACCGGCAACG GTTGTAATTC GGCCGGGTGG
ATCAACGGCA TGGTTCCGGG CGGCTCGACG AATTGGATGG GGATGACCGC GGACGACACG
CAGATCGTGC TCGACGGCAG TGCGGGCAGC ATTTACTTCC GGACGGGCGG CATAAACGGC
AACGTGTTGA CGATGTCGAA CGCGACCGGC GGCGTATTGC TCAGCGGCCT CGCGGCCGGC
GTCAAGCCGA CCGATGCGGT CAACATGTCC CAGTTGACCT CGTTGTCGAC GTCGACGGCA
ACCGGCATCA CCTCGCTTTC GACGTCGACG GCAACCAGCA TCGCTTCGCT TTCGACGAGC
ATGCTGTCGC TCGGCGTGGG CGTCGTGACG CAAGACGCCT CGACCGGCGC GATCAGCGTC
GGCGCCAATT CGCCGGGCCT GACGGTGGAT TTCGCGGGGG GCCAGGGCCC GCGCACGCTG
ACGGGCGTCG CCGCGGGCGT CAACGCTACG GACGCGGTCA ATGTCGGCCA GTTGGCGTCG
CTGTCGACGA GCACGGCAGC GGGACTTTCC ACCGCCGCGA GCGGCGTCGC GTCGCTGTCG
ACGTCGCTGC TCGGCGCGGC GGGCGATCTG GCGTCACTGT CGACGAGCGC ATCGACGGGA
CTCGCCACTG CGGATAGCGG CATCGCGTCG TTGTCCACGT CGCTGCTCGG CACCACGAAC
AACGTGACGT CGCTGTCGAC GAGCCTCAGC ACGGTCAACG CGAATCTGGC CGGCCTGCAG
ACCTCGGTGG ACAACGTCGT GTCATACGAC GATCCGTCGA AGTCGGCGAT CACCCTCGGC
GGCGCGGGCG TCGCGACGCC CGTCCTGCTG ACGAACGTGG CTGCGGGGAA GATCGCCGCG
ACCAGCACGG ACGCGGTGAA CGGTTCGCAG CTTTACACGC TCCAGCAGGA GTTCTCGCAG
CAGTACGATC TGCTGACGTC GCAAGTCTCG TCGCTCAGCA CTTCGGTGTC GGGTCTCCAA
GGCAGCGTCT CGGCAAATAC GGGAACCGCG TCGGGTGATA ACAGCACGGC AAGCGGCACG
AATGCGTCGG CGAGCGGTGA GAACAGCACG GCGACGGGTA CAGACTCGAC CGCATCGGGC
AGCAACAGCA CAGCCAACGG GACGAACTCG ACCGCGTCGG GTGATAACAG CACGGCGAGC
GGGACGAACG CATCGGCGAC GGGCGAGAAC AGCACGGCGA CGGGCACGGA TTCGACCGCA
TCGGGCACGA ATAGCACGGC CAACGGGACG AACTCGACCG CGTCGGGCGA TAACAGCACG
GCGAGCGGCA CGAACGCATC GGCGACGGGT GAGAACAGCA CGGCGACGGG CACGGATTCG
ACCGCGTCGG GCAGCAACAG CACGGCGAGC GGCACGAACG CATCGGCGAC GGGCGAGAAC
AGCACGGCGA CAGGCACGGA TTCGACCGCG TCGGGCACGA ACAGCACGGC CAACGGGACG
AACTCGACTG CGTCGGGCGA TAACAGCACG GCGAGCGGCA CGAACGCATC GGCGACGGGC
GAGAACAGCA CGGCGACGGG CACGGCTTCG ACCGCATCGG GCAGCAATAG CACGGCCAAC
GGCACGAATT CGACCGCGTC GGGCGATAAC AGCACGGCGA GCGGCACGAA TGCATCGGCG
ACCGGTGAGA ACAGCACGGC GACGGGCACG GCTTCGACCG CATCGGGCAG CAATAGCACG
GCCAACGGCA CGAACTCGAC CGCGTCGGGC GATAACAGCA CGGCGAGCGG CACGAACGCG
TCGGCGACGG GTGAAAACAG CACGGCGACG GGTACGGCTT CGACTGCGTC GGGCAGCAAC
AGCACGGCCA ACGGTGCGAA CTCGACGGCA TCCGGCGCGG GGGCGACGGC AACGGGTGAA
AACGCCGCAG CCACGGGCGC GGGCGCGACG GCGACCGGCA ACAATGCATC GGCATCGGGC
ACGAGCAGCA CGGCCGGCGG TGCGAACGCA ATCGCGTCGG GCGAGAACAG CACGGCCAAC
GGTGCGAACT CGACGGCATC CGGCAACGGC AGCTCGGCGT TCGGCGAGAG CGCGGCGGCA
GCCGGCGACG GCAGCACGGC GCTGGGTGCA AACGCTGTCG CATCGGGTGT CGGCAGCGTC
GCGACGGGCG CGGGTTCGGT CGCGTCCGGC GCGAACAGTT CGGCGTACGG TACGGGCTCG
AACGCGGCGG GCGCGGGCAG CGTCGCCATC GGTCAGGGCG CGACGGCCTC GGGATCGAAC
TCGGTCGCGC TTGGCACCGG TTCTGTCGCG TCGGAGGACA ACACGGTATC GGTCGGCTCC
GCAGGCAGCG AGCGCAGGAT CACCAACGTC GCCGCCGGCG TCAATGCCAC CGACGCCGTC
AACGTCGGCC AGTTGAACAG CGCCGTGTCG GGCATCCAGC ATCAGATGGA CGGCATGCAA
GGTCAGATCG ATACGCTTGC ACGCGACGCG TATTCCGGTA TCGCGGCCGC GACCGCGTTG
ACGATGATTC CGGACGTGGA TCCGGGCAAG ACGCTGGCCG TGGGCATCGG CACGGCCAAT
TTCAAGGGCT ACCAGGCTTC CGCGCTCGGC GCGACCGCAC GTATCACCCA GAACCTCAAG
GTGAAGACGG GCGTGAGCTA CAGCGGCAGC AACTACGTGT GGGGCGCAGG CATGTCGTAT
CAATGGTAA
 
Protein sequence
MARGGRASSV VASAGGLEKV LKLSILGAAS LIAMGVVGPF AEEAMAANNT GLCLTYNGGS 
NNVAGSGGLI TGNGCNSAGW INGMVPGGST NWMGMTADDT QIVLDGSAGS IYFRTGGING
NVLTMSNATG GVLLSGLAAG VKPTDAVNMS QLTSLSTSTA TGITSLSTST ATSIASLSTS
MLSLGVGVVT QDASTGAISV GANSPGLTVD FAGGQGPRTL TGVAAGVNAT DAVNVGQLAS
LSTSTAAGLS TAASGVASLS TSLLGAAGDL ASLSTSASTG LATADSGIAS LSTSLLGTTN
NVTSLSTSLS TVNANLAGLQ TSVDNVVSYD DPSKSAITLG GAGVATPVLL TNVAAGKIAA
TSTDAVNGSQ LYTLQQEFSQ QYDLLTSQVS SLSTSVSGLQ GSVSANTGTA SGDNSTASGT
NASASGENST ATGTDSTASG SNSTANGTNS TASGDNSTAS GTNASATGEN STATGTDSTA
SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTDS TASGSNSTAS GTNASATGEN
STATGTDSTA SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTAS TASGSNSTAN
GTNSTASGDN STASGTNASA TGENSTATGT ASTASGSNST ANGTNSTASG DNSTASGTNA
SATGENSTAT GTASTASGSN STANGANSTA SGAGATATGE NAAATGAGAT ATGNNASASG
TSSTAGGANA IASGENSTAN GANSTASGNG SSAFGESAAA AGDGSTALGA NAVASGVGSV
ATGAGSVASG ANSSAYGTGS NAAGAGSVAI GQGATASGSN SVALGTGSVA SEDNTVSVGS
AGSERRITNV AAGVNATDAV NVGQLNSAVS GIQHQMDGMQ GQIDTLARDA YSGIAAATAL
TMIPDVDPGK TLAVGIGTAN FKGYQASALG ATARITQNLK VKTGVSYSGS NYVWGAGMSY
QW