Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2044 |
Symbol | |
ID | 4888294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1976758 |
End bp | 1978554 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640131982 |
Product | TPR domain-containing protein |
Protein accession | YP_001063039 |
Protein GI | 126443819 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTTC ATTCCGATCG TTTCGCCGCC ATCCAGATGC AACTGAAGCA AGGCGACCTG ATTGCCGCCG CCGACGCGAT CGACGCGTGG CGCGCGGCCG AGCCCGCGTC CGCCGACGCG CTCGCCTGCC GCGCGCACTG GCTGCGCCTG CTCGGTCGCT TCGACGAAGC GGCGGCCGCG CTCGAGCCGG CGCTCGCCGC GACGCCGCCG TGCGCGGCCG CGTGGGCCGA GCGCGCGCGC CTCGACCGGC TCGCCGGGCA GGCCGAGCGC GCGCACGCCG CGTTCGACGC CGCGCATCGC GCCGATCCAG CCGCCACGGC ATGGCTCGCC GAATGGATCG AACTGCTGCA CCCGCTCCAT CGTCCCGCGC TCGCGCTGCC GGTTGCGCAG GCGCTGTGCG AGCACGCGCC GGACAGTGCG CAGTCTTGGT TTCTGCTCGG CCTCACGCAC CACTACGCGG GCGACTACGC GGCGGCGGCC GCTGCATATC GCCGCGCGGA TGCACTCGAT CCGGCCTATC CGATGTTGCG CAACAATCTC GCCGCGCTTC GCTATCAGAC CGGCATGACC GCCGAGGCGC TCGCGCTGGC GGAAGCGGCG ATTCGCGCGG AGCCGGACAA CCAGATGGCG TGGTGCAACT GCTCGAATGC GTGGCTCGCG CTGCGCGAGC CGGCACGCGC GCTGATCGCG GGCGAGCGCG CCTGCGCGCT CGGGCCGAAC TACGCGATCG CGCAACTCGC GCGCGCGAAC GCGCTGAAAG AGCTGCAGCG CTGGCCGGAC GCGCTCGCCG CCGCGGCGCA CGCGCACCGC AGCGCGCCCG ACGATCCCGT CATGCAGTGG TCGCTCGCGA TGCTGCAACT GCTGCACGGC GACTACGCGA ACGGCTGGGC GAACCATGAG GCGCGGTGGA ACGGCTCGCG CGAGCTCGGC GACCGCCCGC GCCCCTCGCC GCAGCAGCAG TGGCGCGGCG AGCCGCTCGC CGGCAAGACA TTGATGCTGT GGGGCGAGCA GGGCTTCGGC GATGCGCTGC AGTTCGCGCG CTTCGCGCCG ATCATCGCCG AGCAGGCGAC GCGCGCGGGC GCGCAGGTCG TCTTCGCGTG CTTCGCGGGC CTCGAGCCGC TTTTCGCGCG CAGCTTCGCC GGCGCGCCGA TGCGGATCGT GCGGCACGAC GCGCCGCAAT TGCCCGCATT CGACCATCAC CTGCCCGTCG GCAGCGCGCC CCTGTTGCTC GGCGTGCGGC CCGACACGAT CCCGGCCGCG GGCGGCTACC TGCGCGCGGA TCCGGCGCGC GCCGCGCAAT GGGCGGCGCG GCGGCCGGCC GACGGCCGGC TGCGCGTCGG GCTCGTCTGG AGCGGCAGCC GCACGCACCA GCGCAACCCG CTGCGCGCGA TCGATCCGGC GGCGTGCGCG CACGCATGGC GCGACCTGAC GGGCGTCGCG TTCCACAGCC TGCAGATCGA CGGCGCCGCC GACGTCGCGA CAATGCGCGC GGCGGGCCTC GACGTGATCG ACCATACGGC CGAGTTGCCG AGCTTCGACG ACACGGCTGC GTATCTGTCG AGCCTCGACC TCGTCGTCAC CGTCTGCACG TCGGTCGCGC ACCTCGCGGG CGCGCTCGGC CGGCCGACGC GGCTGCTGCT CGACGTCAAT CCGCACTGGG TCTGGATGAT CGACCGCGAA GACAGCCCGT GGTACGGCTC GCTCCGGCTC TACCGGCAGC CCCGGTACCG CGACTGGACG ACGGTGCTCG ACCGCGTGCG CGACGAACTG GCCGCGCTCG CAGCCGCGCG CGCGTAG
|
Protein sequence | MSVHSDRFAA IQMQLKQGDL IAAADAIDAW RAAEPASADA LACRAHWLRL LGRFDEAAAA LEPALAATPP CAAAWAERAR LDRLAGQAER AHAAFDAAHR ADPAATAWLA EWIELLHPLH RPALALPVAQ ALCEHAPDSA QSWFLLGLTH HYAGDYAAAA AAYRRADALD PAYPMLRNNL AALRYQTGMT AEALALAEAA IRAEPDNQMA WCNCSNAWLA LREPARALIA GERACALGPN YAIAQLARAN ALKELQRWPD ALAAAAHAHR SAPDDPVMQW SLAMLQLLHG DYANGWANHE ARWNGSRELG DRPRPSPQQQ WRGEPLAGKT LMLWGEQGFG DALQFARFAP IIAEQATRAG AQVVFACFAG LEPLFARSFA GAPMRIVRHD APQLPAFDHH LPVGSAPLLL GVRPDTIPAA GGYLRADPAR AAQWAARRPA DGRLRVGLVW SGSRTHQRNP LRAIDPAACA HAWRDLTGVA FHSLQIDGAA DVATMRAAGL DVIDHTAELP SFDDTAAYLS SLDLVVTVCT SVAHLAGALG RPTRLLLDVN PHWVWMIDRE DSPWYGSLRL YRQPRYRDWT TVLDRVRDEL AALAAARA
|
| |