Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1949 |
Symbol | |
ID | 4903862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1910791 |
End bp | 1912587 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640145055 |
Product | TPR repeat-containing protein |
Protein accession | YP_001075983 |
Protein GI | 126455773 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.411479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTTC ATTCCGATCG TTTCGCCGCC ATCCAGATGC AACTGAAGCA AGGCGACCTG ATTGCCGCCG CCGATGCGAT CGACGCGTGG CGCGCGGCCG AGCCCGCGTC CGCCGACGCG CTCGCCTGCC GCGCGCACTG GCTGCGCCTG CTCGGCCGCT TCGACGAAGC GGCGGCCGCG CTCGAGCCGG CGCTCGCCGC GACGCCGCCG TGCGCGACCG CGTGGGCCGA GCGCGCGCGC CTCGACCGGC TCGCCGGGCA AGCCGAGCGC GCGCACGCCG CGTTCGACGC CGCGCATCGC GCCGATCCGG CCGCGACGGC ATGGCTCGCC GAATGGATCG AACTGCTGCA CCCGCTCCAT CGTCCCGCGC TCGCGCTGCC GGTTGCGCAG GCGCTGTGCG AGCACGCGCC GGACAGTGCG CAGTCTTGGT TTCTGCTCGG CCTCACGCAC CACTACGCGG GCGACTACGC GGCGGCGGCC GCTGCATACC GTCGCGCGGA TGCACTCGAT CCGGCCTATC CGATGTTGCG CAACAATCTC GCCGCGCTTC GCTATCAGAC CGGCATGACC GCCGAGGCGC TCGCGCTGGC GGAAGCGGCG ATTCGCGCGG AGCCGGACAA CCAGATGGCG TGGTGCAACT GCTCGAATGC GTGGCTCGCG CTGCGCGAGC CGGCACGCGC GCTGATCGCG GGCGAGCGCG CCTGCGCGCT CGGGCCGAAC TACGCGATCG CGCAACTCGC ACGCGCGAAC GCGCTGAAAG AGCTGCAGCG CTGGCCGGAC GCGCTCGCCG CCGCGGCGCA CGCGCACCGC AGCGCGCCCG ACGATCCCGT CATGCAGTGG TCGCTCGCGA TGCTGCAACT GCTGCACGGC GACTACGCGA ACGGCTGGGC GAACCATGAG GCGCGGTGGA ACGGCTCGCG CGAGCTCGGC GACCGCCCGC GCCCCTCGCC GCAGCAGCAG TGGCGCGGCG AGCCGCTCGC CGGCAAGACA TTGATGCTGT GGGGCGAGCA GGGCTTCGGC GATGCGCTGC AGTTCGCGCG CTTCGCGCCG ATCATCGCCG AGCAGGCGAC GCGCGCGGGC GCGCAGGTCG TCTTCGCGTG CTTCGCGGGC CTCGAGCCGC TTTTCGCGCG CAGCTTCGCC GGCGCGCCGA TGCGGATCGT GCGGCACGAC GCGCCGCAAT TGCCCGCATT CGACCATCAC CTGCCCGTCG GCAGCGCGCC CCTGTTGCTC GGCGTGCTGC CCGACACGAT CCCGGCCGCG GGCGGCTACC TGCGCGCGGA TCCGGCGCGC GCCGCGCAAT GGGCGGCGCG GCGGCCGGCC GACGGCCGGC TGCGCGTCGG GCTCGTCTGG AGCGGCAGCC GCACGCACCA GCGCAACCCG CTGCGCGCGA TCGATCCGGC GGCGTGCGCG CGCGCATGGC GCGACCTGAC GGGCGTCGCG TTCCACAGCC TGCAGATCGA CGGCGCCGCC GACGTCGCGA CAATGCGCGC GGCGGGCCTC GACGTGATCG ACCATACGGC CGAGTTGCCG AGCTTCGACG ACACGGCTGC GTATCTGTCG AGCCTCGACC TCGTCGTCAC CGTCTGCACG TCGGTCGCGC ACCTCGCGGG CGCGCTCGGC CGGCCGACGC GGCTGCTGCT CGACGTCAAT CCGCACTGGG TCTGGATGAT CGACCGCGAA GACAGCCCGT GGTACGGCTC GCTCCGGCTC TACCGGCAGC CCCGGTACCG CGACTGGACG ACGGTGCTCG ACCGCGTGCG CGACGAACTG GCCGCGCTCG CAGCCGCGCG CGCGTAG
|
Protein sequence | MSVHSDRFAA IQMQLKQGDL IAAADAIDAW RAAEPASADA LACRAHWLRL LGRFDEAAAA LEPALAATPP CATAWAERAR LDRLAGQAER AHAAFDAAHR ADPAATAWLA EWIELLHPLH RPALALPVAQ ALCEHAPDSA QSWFLLGLTH HYAGDYAAAA AAYRRADALD PAYPMLRNNL AALRYQTGMT AEALALAEAA IRAEPDNQMA WCNCSNAWLA LREPARALIA GERACALGPN YAIAQLARAN ALKELQRWPD ALAAAAHAHR SAPDDPVMQW SLAMLQLLHG DYANGWANHE ARWNGSRELG DRPRPSPQQQ WRGEPLAGKT LMLWGEQGFG DALQFARFAP IIAEQATRAG AQVVFACFAG LEPLFARSFA GAPMRIVRHD APQLPAFDHH LPVGSAPLLL GVLPDTIPAA GGYLRADPAR AAQWAARRPA DGRLRVGLVW SGSRTHQRNP LRAIDPAACA RAWRDLTGVA FHSLQIDGAA DVATMRAAGL DVIDHTAELP SFDDTAAYLS SLDLVVTVCT SVAHLAGALG RPTRLLLDVN PHWVWMIDRE DSPWYGSLRL YRQPRYRDWT TVLDRVRDEL AALAAARA
|
| |