Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1822 |
Symbol | |
ID | 4886691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1778810 |
End bp | 1780240 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640131760 |
Product | putative outer membrane protein TolC |
Protein accession | YP_001062817 |
Protein GI | 126442528 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAAA CAGAGAATTT GGGTGCCTCG TCTAAATTTC GGCATGCGGG TTTTCACTTA ACGACAAAAT CTTTTCGGCT TCGCGCCAAT GTGGCCTACG TTACCGTTTG CGCGGTGCTT GCCGCATTCG GCACCGGTTC CGAATCCGCA CGTGCGTTCT GTCTCGACGA GGCTTATCAG CACGCAATAT CGAACGATCC GAAATTTCTC CAGGCGCGGG CGGAATACGA CGCCGCGCGG CAGAAGTTTC CGCAGGCGCG CGCGCAGATG CTGCCGCAGG TGAGCGCGCA GCTCGAATGG GGCCGCTACG GCACGCACGC GAACCTGTTC GGCATCGACG TGAGCGGCAA CAGCACCGCC GCGTACGGCG CCGCGCAGCT CTCGCAGGCG CTGTTCAACA TGCCGTATTT GTACGACATG AGCCGCGCGA AGGAATTCGA GGAATCCGCG CGGCAGCAGC TCGAAGTCGC GAAGCAGGAG CTGATCATGC GCGTCGCGAA CGCGTGCTTC GATCTGCTGT CCGCGCGCGA GAAGCTGCAG CTCGCCGACG ACGAGGTCGG CGCGCTCACG CGCCTGGAGA GCGATACCCG CCGCATGGCG CAGCTCGGCA TGAAGACCAT CGGCGACACG GCCGAGATCG AGGCGCGCAG GAGCCTCGCG CAGTCGGACG AGGCGCTCGC GCGCACCGAC GTCGAGGCGC GGCGCGCCCG CTACGAGACG CTGCTCGGCT CCGCGATCGA CTTCACGCGC TGGCCGCGGC TCGCGATGCA CGGCACGTCG CCGCGCATTC CGACGGGCGA CTACCAGCCG CAGGACAACC CGTCGTACCA GCAGGCGTAT CGCGACCTGC GCGTCGCGCG GCTCGCGTCC AAGCGCATCA ACGCGGAGCA CCTGCCGAGC GTCGACCTGT TCGCGACGTA CTCGCGCGGC CTCAATCCGA ACCTGCGCGG CCTCACCGAC AAGAACGACT TCCACCAGAG CGCGGTCGGC GTGCAGGTGA CGATTCCGAT CTTCTCGGGC GGCAGCGTGC ACTACCGGAA GATCGAGGCC GACCACGTCG CGACGCAGTA CCAGAACCGG CTGCGCGAGG TCGAGCAGCA ACTGAGCACC GATCATCGCG AGACGCTCGC GGCGCTGCAG TCGATCGGCA CGCGGATCCG CGCGCTGCAG CAATCGCTGC AGGCGGCGCG GCTCGCGTAC GATTCGTCGA TGAAGGCGCA CCAGGTCGGC TACAGTACGA CGTACGAGAC GCTGAACCTG CGCACCGACA TCTCGAACAT CCGCCAGAAG CTGTTCGAGA GCTACCTCGA CGCGCTGAAG CTCCAGCTGA AGCTCAAGGG CATTCTCGGC ACGCTGGACG AGCAGTCGCT CGTCGCGGTC GACAGCTTCC TCGCGAGCAA CGCGGCGCCC GCCGATCAGA AGAGCGAATG A
|
Protein sequence | MFETENLGAS SKFRHAGFHL TTKSFRLRAN VAYVTVCAVL AAFGTGSESA RAFCLDEAYQ HAISNDPKFL QARAEYDAAR QKFPQARAQM LPQVSAQLEW GRYGTHANLF GIDVSGNSTA AYGAAQLSQA LFNMPYLYDM SRAKEFEESA RQQLEVAKQE LIMRVANACF DLLSAREKLQ LADDEVGALT RLESDTRRMA QLGMKTIGDT AEIEARRSLA QSDEALARTD VEARRARYET LLGSAIDFTR WPRLAMHGTS PRIPTGDYQP QDNPSYQQAY RDLRVARLAS KRINAEHLPS VDLFATYSRG LNPNLRGLTD KNDFHQSAVG VQVTIPIFSG GSVHYRKIEA DHVATQYQNR LREVEQQLST DHRETLAALQ SIGTRIRALQ QSLQAARLAY DSSMKAHQVG YSTTYETLNL RTDISNIRQK LFESYLDALK LQLKLKGILG TLDEQSLVAV DSFLASNAAP ADQKSE
|
| |