Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_2229 |
Symbol | |
ID | 3690239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 2489664 |
End bp | 2492870 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637728684 |
Product | Hep_Hag family protein |
Protein accession | YP_333623 |
Protein GI | 76811430 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3846] Type IV secretory pathway, TrbL components |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGGA TTTTCAAATC GATCTGGTGC GAACAGACGC GTACGTGGGT TGCGGCATCG GAGCATGCCG TGGCGCGCGG TGGCCGCGCG TCGAGCGTCG TCGCGTCCGC CGGCGGATTG GAGAAGGTGC TCAAGCTGTC GATTCTGGGC GCGGCATCGC TGATTGCGAT GGGCGTGGTC GGACCGTTTG CCGAGGAGGC AATGGCGGCG AATAACGCCG GTGTGTGTTT GACGTACAAC GGTAGTAGCA ACAATACATC AGGTACTGGC GGCTGGTTCG CTGATGGTTG TAAATCGGCC GGCTGGGTGC AGGGCATGGT TACGAATAGC AAGACGGATT GGGTCGGGCT GACCGCGGAC GACACGCAGA TCGTGCTCGA CGGTAGCGCG GGCAGCATTT ACTTCCGGAC GGGCGGCATA AACGGCAACG TGTTGACGAT GTCGAACGCG ACCGGCGGCG TATTGCTCAG CGGCCTCGCG GCCGGCGTCA ATCCGACCGA TGCGGTCAAC ATGTCCCAGT TGACCTCGTT GTCGACGTCG ACGGCAACCG GCATCACCTC GCTTTCGACG TCGACGGCAA CCAGCATCGC TTCGCTTTCG ACGAGCATGC TGTCGCTCGG CGTGGGCGTC GTGACGCAAG ACGCCTCGAC CGGCGCGATC AGCGTCGGCG CCAATTCGCC GGGCCTGACG GTGGATTTCG CGGGGGGCCA GGGCGCGCGC ACGCTGACGG GCGTCGCCGC GGGCGTCAAC GCTACGGACG CGGTCAATGT CGGCCAGCTG GCGTCGCTGT CGACGAGCAC GGCAGCGGGG CTTTCCACCG CCGCGAGCGG CGTCGCGTCG CTGTCGACGT CGCTGCTCGG CGCGGCGGGC GATCTGGCGT CACTGTCGAC GAGCGCGTCG ACGGGGCTCG CCACTGCGGA TAGCGGCATC GCGTCGTTGT CCACGTCGCT GCTCGGTACC GCGGACAACG TGACGTCGCT GTCGACGAGC CTCAGCACGG TCAACGCGAA TCTGGCCGGC CTGCAGACCT CGGTGGACAA CGTCGTGTCA TACGACGATC CGTCGAAGTC GGCGATCACC CTCGGCGGCG CGGGCGTCAC GACGCCCGTC CTGCTGACGA ACGTGGCTGC GGGGAAGATC GCCGCGACCA GCACGGACGC GGTGAACGGT TCGCAGCTTT ACACGCTCCA GCAGGAATTC TCGCAGCAGT ACGATCTGCT GACGTCGCAA GTCTCGTCGC TCAGCACTTC GGTGTCGGGT CTCCAAGGCA GCGTCTCGGC AAATACGGGA ACCGCGTCGG GTGACAACAG CACGGCGAGC GGTGACAACG CGACCGCGTC GGGCACGAAC AGCACGGCCA ACGGCACGAA CTCGACCGCG TCGGGTGATA ACAGCACGGC AAGCGGGACG AATGCGTCGG CGAGCGGCGA GAACAGCACG GCGACGGGTA CGGACTCGAC CGCATCGGGT AGCAATAGCA CGGCCAACGG GACGAACTCG ACTGCGTCGG GTGATAACAG CACGGCAAGC GGGACGAATG CGTCGGCGAC GGGCGAGAAC AGCACGGCGA GCGGCACGAA CGCATCGGCG ACGGGCGAGA ACAGCACGGC GACGGGCACG GCTTCGACCG CATCGGGCAG TAACAGCACG GCGAACGGCA CGAACTCGAC CGCGTCGGGC GAGAACAGCA CGGCGACGGG TACGGACTCG ACCGCATCGG GCAGCAACAG CACGGCCAAC GGGACGAACT CGACCGCGTC GGGTGATAAC AGCACGGCGA GCGGCACGAA CGCATCGGCG ACCGGTGAGA ACAGCACGGC GACAGGCACG GATTCGACCG CATCGGGCAG CAACAGCACA GCCAACGGCA CGAACTCGAC CGCGTCGGGC GATAACAGCA CGGCGAGCGG CACGAACGCA TCAGCGACCG GTGAGAACAG CACGGCGACG GGTACGGACT CGACCGCATC GGGCAGCAAC AGCACGGCCA ACGGGGCGAA CTCGACCGCA TCGGGCGATA ACAGCACGGC GAGCGGCACG AACGCATCGG CGACGGGCGA GAACAGCACG GCGACAGGCA CGGATTCGAC CGCATCGGGC AGCAACAGCA CGGCCAACGG AACGAACTCG ACTGCGTCGG GCAACAACAG CACGGCGAGC GGGACGAACG CGTCGGCGAC GGGTGAAAAC AGCACGGCGA CGGGTACGGA CTCGGCCGCA TCCGGTACGA ATAGCACGGC CAACGGCACG AACTCGACCG CGTCGGGCGA TAACAGCACG GCGAGCGGGA CGAATGCGTC GGCGACCGGT GAGAACAGCA CGGCGACGGG TACGGCTTCG ACCGCGTCGG GCAGCAACAG CACGGCCAAC GGTGCGAACT CGACGGCATC CGGCGCGGGG GCGACGGCAA CGGGTGAAAA CGCCGCAGCC ACGGGCGCGG GCGCGACGGC GACCGGCAAC AACGCATCGG CATCGGGCAC GAGCAGCACG GCCGGCGGTG CGAATGCAAT CGCGTCGGGC GAGAACAGCA CGGCCAACGG CGCGAACTCG ACCGCATCCG GCAACGGCAG CTCGGCGTTC GGCGAGAGCG CGGCGGCAGC CGGCGACGGC AGCACGGCGC TGGGTTCAAA CGCTGTCGCG TCGGGTGTCG GCAGCGTCGC GACGGGCGCG GGTTCGGTCG CGTCCGGCGC GAACAGTTCG GCGTACGGTA CGGGCTCGAA CGCGACGGGC GCGGGCAGCG TCGCCATCGG TCAGGGCGCG ACGGCCTCGG GATCGAACTC GGTCGCGCTT GGCACCGGTT CTGTCGCGTC GGAGGACAAC ACGGTATCGG TCGGCTCCGC AGGCAGCGAG CGCAGGATCA CCAACGTCGC CGCCGGCGTC AATGCCACCG ACGCCGTCAA CGTCGGCCAG TTGAACAGCG CCGTGTCGGG CATCCGGAAT CAGATGGACG GCATGCAAGG CCAGATCGAT ACGCTTGCAC GCGATGCGTA TTCCGGTATC GCGGCCGCGA CCGCGTTGAC GATGATTCCG GACGTGGATC CGGGCAAGAC GCTGGCCGTG GGCATCGGCA CGGCCAATTT CAAGGGCTAC CAAGCCTCCG CGCTCGGCGC GACCGCACGT ATCACCCAGA ACCTCAAGGT GAAGACGGGC GTGAGCTACA GCGGCAGCAA CTACGTGTGG GGCGCGGGCA TGTCGTATCA ATGGTAA
|
Protein sequence | MNRIFKSIWC EQTRTWVAAS EHAVARGGRA SSVVASAGGL EKVLKLSILG AASLIAMGVV GPFAEEAMAA NNAGVCLTYN GSSNNTSGTG GWFADGCKSA GWVQGMVTNS KTDWVGLTAD DTQIVLDGSA GSIYFRTGGI NGNVLTMSNA TGGVLLSGLA AGVNPTDAVN MSQLTSLSTS TATGITSLST STATSIASLS TSMLSLGVGV VTQDASTGAI SVGANSPGLT VDFAGGQGAR TLTGVAAGVN ATDAVNVGQL ASLSTSTAAG LSTAASGVAS LSTSLLGAAG DLASLSTSAS TGLATADSGI ASLSTSLLGT ADNVTSLSTS LSTVNANLAG LQTSVDNVVS YDDPSKSAIT LGGAGVTTPV LLTNVAAGKI AATSTDAVNG SQLYTLQQEF SQQYDLLTSQ VSSLSTSVSG LQGSVSANTG TASGDNSTAS GDNATASGTN STANGTNSTA SGDNSTASGT NASASGENST ATGTDSTASG SNSTANGTNS TASGDNSTAS GTNASATGEN STASGTNASA TGENSTATGT ASTASGSNST ANGTNSTASG ENSTATGTDS TASGSNSTAN GTNSTASGDN STASGTNASA TGENSTATGT DSTASGSNST ANGTNSTASG DNSTASGTNA SATGENSTAT GTDSTASGSN STANGANSTA SGDNSTASGT NASATGENST ATGTDSTASG SNSTANGTNS TASGNNSTAS GTNASATGEN STATGTDSAA SGTNSTANGT NSTASGDNST ASGTNASATG ENSTATGTAS TASGSNSTAN GANSTASGAG ATATGENAAA TGAGATATGN NASASGTSST AGGANAIASG ENSTANGANS TASGNGSSAF GESAAAAGDG STALGSNAVA SGVGSVATGA GSVASGANSS AYGTGSNATG AGSVAIGQGA TASGSNSVAL GTGSVASEDN TVSVGSAGSE RRITNVAAGV NATDAVNVGQ LNSAVSGIRN QMDGMQGQID TLARDAYSGI AAATALTMIP DVDPGKTLAV GIGTANFKGY QASALGATAR ITQNLKVKTG VSYSGSNYVW GAGMSYQW
|
| |