Gene BURPS1106A_A1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1949 
Symbol 
ID4903862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1910791 
End bp1912587 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content74% 
IMG OID640145055 
ProductTPR repeat-containing protein 
Protein accessionYP_001075983 
Protein GI126455773 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.411479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTTC ATTCCGATCG TTTCGCCGCC ATCCAGATGC AACTGAAGCA AGGCGACCTG 
ATTGCCGCCG CCGATGCGAT CGACGCGTGG CGCGCGGCCG AGCCCGCGTC CGCCGACGCG
CTCGCCTGCC GCGCGCACTG GCTGCGCCTG CTCGGCCGCT TCGACGAAGC GGCGGCCGCG
CTCGAGCCGG CGCTCGCCGC GACGCCGCCG TGCGCGACCG CGTGGGCCGA GCGCGCGCGC
CTCGACCGGC TCGCCGGGCA AGCCGAGCGC GCGCACGCCG CGTTCGACGC CGCGCATCGC
GCCGATCCGG CCGCGACGGC ATGGCTCGCC GAATGGATCG AACTGCTGCA CCCGCTCCAT
CGTCCCGCGC TCGCGCTGCC GGTTGCGCAG GCGCTGTGCG AGCACGCGCC GGACAGTGCG
CAGTCTTGGT TTCTGCTCGG CCTCACGCAC CACTACGCGG GCGACTACGC GGCGGCGGCC
GCTGCATACC GTCGCGCGGA TGCACTCGAT CCGGCCTATC CGATGTTGCG CAACAATCTC
GCCGCGCTTC GCTATCAGAC CGGCATGACC GCCGAGGCGC TCGCGCTGGC GGAAGCGGCG
ATTCGCGCGG AGCCGGACAA CCAGATGGCG TGGTGCAACT GCTCGAATGC GTGGCTCGCG
CTGCGCGAGC CGGCACGCGC GCTGATCGCG GGCGAGCGCG CCTGCGCGCT CGGGCCGAAC
TACGCGATCG CGCAACTCGC ACGCGCGAAC GCGCTGAAAG AGCTGCAGCG CTGGCCGGAC
GCGCTCGCCG CCGCGGCGCA CGCGCACCGC AGCGCGCCCG ACGATCCCGT CATGCAGTGG
TCGCTCGCGA TGCTGCAACT GCTGCACGGC GACTACGCGA ACGGCTGGGC GAACCATGAG
GCGCGGTGGA ACGGCTCGCG CGAGCTCGGC GACCGCCCGC GCCCCTCGCC GCAGCAGCAG
TGGCGCGGCG AGCCGCTCGC CGGCAAGACA TTGATGCTGT GGGGCGAGCA GGGCTTCGGC
GATGCGCTGC AGTTCGCGCG CTTCGCGCCG ATCATCGCCG AGCAGGCGAC GCGCGCGGGC
GCGCAGGTCG TCTTCGCGTG CTTCGCGGGC CTCGAGCCGC TTTTCGCGCG CAGCTTCGCC
GGCGCGCCGA TGCGGATCGT GCGGCACGAC GCGCCGCAAT TGCCCGCATT CGACCATCAC
CTGCCCGTCG GCAGCGCGCC CCTGTTGCTC GGCGTGCTGC CCGACACGAT CCCGGCCGCG
GGCGGCTACC TGCGCGCGGA TCCGGCGCGC GCCGCGCAAT GGGCGGCGCG GCGGCCGGCC
GACGGCCGGC TGCGCGTCGG GCTCGTCTGG AGCGGCAGCC GCACGCACCA GCGCAACCCG
CTGCGCGCGA TCGATCCGGC GGCGTGCGCG CGCGCATGGC GCGACCTGAC GGGCGTCGCG
TTCCACAGCC TGCAGATCGA CGGCGCCGCC GACGTCGCGA CAATGCGCGC GGCGGGCCTC
GACGTGATCG ACCATACGGC CGAGTTGCCG AGCTTCGACG ACACGGCTGC GTATCTGTCG
AGCCTCGACC TCGTCGTCAC CGTCTGCACG TCGGTCGCGC ACCTCGCGGG CGCGCTCGGC
CGGCCGACGC GGCTGCTGCT CGACGTCAAT CCGCACTGGG TCTGGATGAT CGACCGCGAA
GACAGCCCGT GGTACGGCTC GCTCCGGCTC TACCGGCAGC CCCGGTACCG CGACTGGACG
ACGGTGCTCG ACCGCGTGCG CGACGAACTG GCCGCGCTCG CAGCCGCGCG CGCGTAG
 
Protein sequence
MSVHSDRFAA IQMQLKQGDL IAAADAIDAW RAAEPASADA LACRAHWLRL LGRFDEAAAA 
LEPALAATPP CATAWAERAR LDRLAGQAER AHAAFDAAHR ADPAATAWLA EWIELLHPLH
RPALALPVAQ ALCEHAPDSA QSWFLLGLTH HYAGDYAAAA AAYRRADALD PAYPMLRNNL
AALRYQTGMT AEALALAEAA IRAEPDNQMA WCNCSNAWLA LREPARALIA GERACALGPN
YAIAQLARAN ALKELQRWPD ALAAAAHAHR SAPDDPVMQW SLAMLQLLHG DYANGWANHE
ARWNGSRELG DRPRPSPQQQ WRGEPLAGKT LMLWGEQGFG DALQFARFAP IIAEQATRAG
AQVVFACFAG LEPLFARSFA GAPMRIVRHD APQLPAFDHH LPVGSAPLLL GVLPDTIPAA
GGYLRADPAR AAQWAARRPA DGRLRVGLVW SGSRTHQRNP LRAIDPAACA RAWRDLTGVA
FHSLQIDGAA DVATMRAAGL DVIDHTAELP SFDDTAAYLS SLDLVVTVCT SVAHLAGALG
RPTRLLLDVN PHWVWMIDRE DSPWYGSLRL YRQPRYRDWT TVLDRVRDEL AALAAARA