Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_0881 |
Symbol | |
ID | 4900371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 864234 |
End bp | 865535 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640134111 |
Product | hypothetical protein |
Protein accession | YP_001065162 |
Protein GI | 126453437 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAG GACGTCGACA CTTCGTGCGC TCGGTTGCGA GCGCCTCGGC CGCGCTCGCG GCCGCCGCAT GGTCCCCGGC GCGCGCCGCA ATCGACGCGC CCGCCTCGCC CGCGACCGCG CTGTCGCTCA CGCCCGGGCG CTGGTCGCCG AACAACGTCG CGCGGCTGCG CGCGGTGCTC GCCGGGCACG GCGCGTCGAG CCCGCGCTAC CGCCCCGAGC ACCGCCCGTA CGCGGTGTTC GACTGGGACA ACACGAGCAT CATGAACGAC TGCGAAGAAG CGCTGCTGAT GCACCAGATC GACGGGCTGC ATTACCGGCT CACGCCCGAG CAGTTCTCGG CGATCCTGCG CCAGGGCGTG CCCGACGGCC CGTTCGACGC GAAGCTCGGC TATACGAGCG TCGACGGCAA GCCCGTGCGG ATGGAGGACA TCGCGGCCGA CGTCGACGCC GACTACCGGT GGCTGCATGC GAACTATCGC GGCCTCGCGG GCGACAAGCC GCTCGACGAG ATCCACCGCA GCGAGCAGTT CCGGGATTTC CGCGCGAAGC TGTACTTCAT GTACGACGCG ATCTGCGACA CGTATCCGGT CGAGATCGGC TACAAGTGGA TCATGTACTG GTACGCGGGC ATGACGCGCG ACGAGTTGCA GGCGATGGCG TTCGACAGCA ACGTCGCGAA CCTCGGCGAC GCGCTGCGCA AGGTGACCTA CGAAAGCTCG CGCGCGCTGC CGGGCAAGGC GGGCGTCATC GCCGCGACGC ACTTCCACGG CATCCGCATC CACGAGGAGA TCCGCGCGGT GATGGACACG CTGCGCTCGA ACGGCATCGA CGTGTACGTC AGCACCGCAT CGCTCGACGA CGTCGTGCGC GTGTTCGCGG GCCATCCGGC GTTCGGCTAC GGCGTGCCCG CCGAAAACGT GATCGGCATG CGGCTCACGA TGGCGGACGG CAAGTACATG AACGAATACC TGCCGAACTG GCACTTCAAC TACGGGCCGG GCAAGACGGT CGGCATCCGC CGCGAGCTCG AATCGAAGAA GGGCTACGGG CCGCTGCTCG TGTTCGGCGA CAGCGACGGC GACGCGTGGA TGCTGCGCGA CTTCGCCGAT ACCGCGGTCG GCGTGATCGT CAACCGGATG AAGAAAGGCG AGATCGGTAT CGACAGCCGC AAGGCGGCCG AGCAGATCGG CGCGAAGGAC GCGCGGCTCG TGCTGCAAGG GCGCGACGAG AACACCGGGC TGATGGTCGC CGACGAGCGC TCGATCAAGT ACGGCAAGCG CGATCCCAAA CTGCTCGCGT GA
|
Protein sequence | MKTGRRHFVR SVASASAALA AAAWSPARAA IDAPASPATA LSLTPGRWSP NNVARLRAVL AGHGASSPRY RPEHRPYAVF DWDNTSIMND CEEALLMHQI DGLHYRLTPE QFSAILRQGV PDGPFDAKLG YTSVDGKPVR MEDIAADVDA DYRWLHANYR GLAGDKPLDE IHRSEQFRDF RAKLYFMYDA ICDTYPVEIG YKWIMYWYAG MTRDELQAMA FDSNVANLGD ALRKVTYESS RALPGKAGVI AATHFHGIRI HEEIRAVMDT LRSNGIDVYV STASLDDVVR VFAGHPAFGY GVPAENVIGM RLTMADGKYM NEYLPNWHFN YGPGKTVGIR RELESKKGYG PLLVFGDSDG DAWMLRDFAD TAVGVIVNRM KKGEIGIDSR KAAEQIGAKD ARLVLQGRDE NTGLMVADER SIKYGKRDPK LLA
|
| |