Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1792 |
Symbol | |
ID | 4900643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1753644 |
End bp | 1754651 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640135022 |
Product | type II secretion system protein |
Protein accession | YP_001066061 |
Protein GI | 126453046 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4965] Flp pilus assembly protein TadB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0984165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCG CGGCGCTCTG GGCGCTCGCG CTCGCGCTGC TGTGCGTCGC CGGGGCGTTC GCGCTATGGC GGCGCGGCGA GGCGAACAGG GAGCGCGCGC ATGCGGCGCG CTACATCGAC AGCCGGCTCG AGCCCGGCGC GCGCGCGAGC GCGCAGCCGA AGATGCCGGC CGCGGCCGAG CCCAAGCGCG CGGCGCCCAT GCCGGCCGCG GGCGCGGCGG GCGGCGCGCG CGCGGAGAAG CCCGCCGAAG GGCTCGCGCG CTGGCGTGAG CGCGCGGCCG ACGCATGGCT GAACGTGTCG AACCGCGCGG GCGTGTCCGA GATCCGCGCG CCGCTCGCCG CGCTCGCCGC GACGACGGCC GTCGCCACGC TGTGGGCGGG CCTGCGCGGC GGGCTGCTCG CCGCCTGCGC GGCGCTCGTC GCGGGCGCGA CGCTCGCGGT CTTCTGGCTC GTGTCGCGGA TGCAGAAGCG GCGGCTGCGG ATCGTGCGCC AACTGCCGTC GTTCCTCGAC GGCATCGTGC GTCTCGTCAC GCTCGGCAAC AGCGTGCCGG CCGCGTTCCA GGCGACGCTG CAGACGACCG AGGCGCCGCT GCGCGGCTGT CTCGATCACG TGTCGCGGAT GCTGCGCTCG GGCGTCGAGA TCGACCGTGC GATGGTGTCC ATCGCGGCGC TCTACCGGAT CAAGGAATTC GAGCTCGTCG GCTCGGTGCT GCGGTTGTCC GTCAAGTACG GCGGCCGCGC CGACGTGATG CTCGACCGAA TGGCCGTGTT CATGCGCGAT CTCGAGCAGG CCGAGCGCGA GCTCGTCGCG ATGTCGGCGG AGACGCGGCT GTCGGCATGG GTGCTCGGCG CGCTGCCCGT GGGCATCGGC AGCTTCGTGA TCGCGACGAA TCCGAAATAT TTCAGCGCGA TGTGGCTTGA CCCGACGGGC CGCCAGCTCG TGTATCTCGC ATTCATCCTG CAAATCGCCG GCGGCTACTG GCTGTACCGG CTCGCCCGAT TGAGGTGA
|
Protein sequence | MSSAALWALA LALLCVAGAF ALWRRGEANR ERAHAARYID SRLEPGARAS AQPKMPAAAE PKRAAPMPAA GAAGGARAEK PAEGLARWRE RAADAWLNVS NRAGVSEIRA PLAALAATTA VATLWAGLRG GLLAACAALV AGATLAVFWL VSRMQKRRLR IVRQLPSFLD GIVRLVTLGN SVPAAFQATL QTTEAPLRGC LDHVSRMLRS GVEIDRAMVS IAALYRIKEF ELVGSVLRLS VKYGGRADVM LDRMAVFMRD LEQAERELVA MSAETRLSAW VLGALPVGIG SFVIATNPKY FSAMWLDPTG RQLVYLAFIL QIAGGYWLYR LARLR
|
| |