Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2027 |
Symbol | |
ID | 4885601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2020672 |
End bp | 2023560 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640127955 |
Product | putative outer membrane protein |
Protein accession | YP_001059062 |
Protein GI | 126441648 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3846] Type IV secretory pathway, TrbL components |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.782084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGCGCG GTGGCCGCGC GTCGAGCGTC GTCGCGTCCG CCGGCGGATT GGAGAAAGTG CTCAAGCTGT CGATTCTGGG CGCGGCATCG CTGATTGCGA TGGGCGTGGT CGGACCGTTT GCCGAGGAGG CAATGGCGGC GAATAACACG GGCCTGTGCC TGACGTACAA CGGGGGGAGC AACAACGTTG CCGGCAGTGG CGGCTTGATC ACCGGCAACG GTTGTAATTC GGCCGGGTGG ATCAACGGCA TGGTTCCGGG CGGCTCGACG AATTGGATGG GGATGACCGC GGACGACACG CAGATCGTGC TCGACGGCAG TGCGGGCAGC ATTTACTTCC GGACGGGCGG CATAAACGGC AACGTGTTGA CGATGTCGAA CGCGACCGGC GGCGTATTGC TCAGCGGCCT CGCGGCCGGC GTCAAGCCGA CCGATGCGGT CAACATGTCC CAGTTGACCT CGTTGTCGAC GTCGACGGCA ACCGGCATCA CCTCGCTTTC GACGTCGACG GCAACCAGCA TCGCTTCGCT TTCGACGAGC ATGCTGTCGC TCGGCGTGGG CGTCGTGACG CAAGACGCCT CGACCGGCGC GATCAGCGTC GGCGCCAATT CGCCGGGCCT GACGGTGGAT TTCGCGGGGG GCCAGGGCCC GCGCACGCTG ACGGGCGTCG CCGCGGGCGT CAACGCTACG GACGCGGTCA ATGTCGGCCA GTTGGCGTCG CTGTCGACGA GCACGGCAGC GGGACTTTCC ACCGCCGCGA GCGGCGTCGC GTCGCTGTCG ACGTCGCTGC TCGGCGCGGC GGGCGATCTG GCGTCACTGT CGACGAGCGC ATCGACGGGA CTCGCCACTG CGGATAGCGG CATCGCGTCG TTGTCCACGT CGCTGCTCGG CACCACGAAC AACGTGACGT CGCTGTCGAC GAGCCTCAGC ACGGTCAACG CGAATCTGGC CGGCCTGCAG ACCTCGGTGG ACAACGTCGT GTCATACGAC GATCCGTCGA AGTCGGCGAT CACCCTCGGC GGCGCGGGCG TCGCGACGCC CGTCCTGCTG ACGAACGTGG CTGCGGGGAA GATCGCCGCG ACCAGCACGG ACGCGGTGAA CGGTTCGCAG CTTTACACGC TCCAGCAGGA GTTCTCGCAG CAGTACGATC TGCTGACGTC GCAAGTCTCG TCGCTCAGCA CTTCGGTGTC GGGTCTCCAA GGCAGCGTCT CGGCAAATAC GGGAACCGCG TCGGGTGATA ACAGCACGGC AAGCGGCACG AATGCGTCGG CGAGCGGTGA GAACAGCACG GCGACGGGTA CAGACTCGAC CGCATCGGGC AGCAACAGCA CAGCCAACGG GACGAACTCG ACCGCGTCGG GTGATAACAG CACGGCGAGC GGGACGAACG CATCGGCGAC GGGCGAGAAC AGCACGGCGA CGGGCACGGA TTCGACCGCA TCGGGCACGA ATAGCACGGC CAACGGGACG AACTCGACCG CGTCGGGCGA TAACAGCACG GCGAGCGGCA CGAACGCATC GGCGACGGGT GAGAACAGCA CGGCGACGGG CACGGATTCG ACCGCGTCGG GCAGCAACAG CACGGCGAGC GGCACGAACG CATCGGCGAC GGGCGAGAAC AGCACGGCGA CAGGCACGGA TTCGACCGCG TCGGGCACGA ACAGCACGGC CAACGGGACG AACTCGACTG CGTCGGGCGA TAACAGCACG GCGAGCGGCA CGAACGCATC GGCGACGGGC GAGAACAGCA CGGCGACGGG CACGGCTTCG ACCGCATCGG GCAGCAATAG CACGGCCAAC GGCACGAATT CGACCGCGTC GGGCGATAAC AGCACGGCGA GCGGCACGAA TGCATCGGCG ACCGGTGAGA ACAGCACGGC GACGGGCACG GCTTCGACCG CATCGGGCAG CAATAGCACG GCCAACGGCA CGAACTCGAC CGCGTCGGGC GATAACAGCA CGGCGAGCGG CACGAACGCG TCGGCGACGG GTGAAAACAG CACGGCGACG GGTACGGCTT CGACTGCGTC GGGCAGCAAC AGCACGGCCA ACGGTGCGAA CTCGACGGCA TCCGGCGCGG GGGCGACGGC AACGGGTGAA AACGCCGCAG CCACGGGCGC GGGCGCGACG GCGACCGGCA ACAATGCATC GGCATCGGGC ACGAGCAGCA CGGCCGGCGG TGCGAACGCA ATCGCGTCGG GCGAGAACAG CACGGCCAAC GGTGCGAACT CGACGGCATC CGGCAACGGC AGCTCGGCGT TCGGCGAGAG CGCGGCGGCA GCCGGCGACG GCAGCACGGC GCTGGGTGCA AACGCTGTCG CATCGGGTGT CGGCAGCGTC GCGACGGGCG CGGGTTCGGT CGCGTCCGGC GCGAACAGTT CGGCGTACGG TACGGGCTCG AACGCGGCGG GCGCGGGCAG CGTCGCCATC GGTCAGGGCG CGACGGCCTC GGGATCGAAC TCGGTCGCGC TTGGCACCGG TTCTGTCGCG TCGGAGGACA ACACGGTATC GGTCGGCTCC GCAGGCAGCG AGCGCAGGAT CACCAACGTC GCCGCCGGCG TCAATGCCAC CGACGCCGTC AACGTCGGCC AGTTGAACAG CGCCGTGTCG GGCATCCAGC ATCAGATGGA CGGCATGCAA GGTCAGATCG ATACGCTTGC ACGCGACGCG TATTCCGGTA TCGCGGCCGC GACCGCGTTG ACGATGATTC CGGACGTGGA TCCGGGCAAG ACGCTGGCCG TGGGCATCGG CACGGCCAAT TTCAAGGGCT ACCAGGCTTC CGCGCTCGGC GCGACCGCAC GTATCACCCA GAACCTCAAG GTGAAGACGG GCGTGAGCTA CAGCGGCAGC AACTACGTGT GGGGCGCAGG CATGTCGTAT CAATGGTAA
|
Protein sequence | MARGGRASSV VASAGGLEKV LKLSILGAAS LIAMGVVGPF AEEAMAANNT GLCLTYNGGS NNVAGSGGLI TGNGCNSAGW INGMVPGGST NWMGMTADDT QIVLDGSAGS IYFRTGGING NVLTMSNATG GVLLSGLAAG VKPTDAVNMS QLTSLSTSTA TGITSLSTST ATSIASLSTS MLSLGVGVVT QDASTGAISV GANSPGLTVD FAGGQGPRTL TGVAAGVNAT DAVNVGQLAS LSTSTAAGLS TAASGVASLS TSLLGAAGDL ASLSTSASTG LATADSGIAS LSTSLLGTTN NVTSLSTSLS TVNANLAGLQ TSVDNVVSYD DPSKSAITLG GAGVATPVLL TNVAAGKIAA TSTDAVNGSQ LYTLQQEFSQ QYDLLTSQVS SLSTSVSGLQ GSVSANTGTA SGDNSTASGT NASASGENST ATGTDSTASG SNSTANGTNS TASGDNSTAS GTNASATGEN STATGTDSTA SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTDS TASGSNSTAS GTNASATGEN STATGTDSTA SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTAS TASGSNSTAN GTNSTASGDN STASGTNASA TGENSTATGT ASTASGSNST ANGTNSTASG DNSTASGTNA SATGENSTAT GTASTASGSN STANGANSTA SGAGATATGE NAAATGAGAT ATGNNASASG TSSTAGGANA IASGENSTAN GANSTASGNG SSAFGESAAA AGDGSTALGA NAVASGVGSV ATGAGSVASG ANSSAYGTGS NAAGAGSVAI GQGATASGSN SVALGTGSVA SEDNTVSVGS AGSERRITNV AAGVNATDAV NVGQLNSAVS GIQHQMDGMQ GQIDTLARDA YSGIAAATAL TMIPDVDPGK TLAVGIGTAN FKGYQASALG ATARITQNLK VKTGVSYSGS NYVWGAGMSY QW
|
| |