Gene BURPS668_A2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2044 
Symbol 
ID4888294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1976758 
End bp1978554 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content74% 
IMG OID640131982 
ProductTPR domain-containing protein 
Protein accessionYP_001063039 
Protein GI126443819 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGTTC ATTCCGATCG TTTCGCCGCC ATCCAGATGC AACTGAAGCA AGGCGACCTG 
ATTGCCGCCG CCGACGCGAT CGACGCGTGG CGCGCGGCCG AGCCCGCGTC CGCCGACGCG
CTCGCCTGCC GCGCGCACTG GCTGCGCCTG CTCGGTCGCT TCGACGAAGC GGCGGCCGCG
CTCGAGCCGG CGCTCGCCGC GACGCCGCCG TGCGCGGCCG CGTGGGCCGA GCGCGCGCGC
CTCGACCGGC TCGCCGGGCA GGCCGAGCGC GCGCACGCCG CGTTCGACGC CGCGCATCGC
GCCGATCCAG CCGCCACGGC ATGGCTCGCC GAATGGATCG AACTGCTGCA CCCGCTCCAT
CGTCCCGCGC TCGCGCTGCC GGTTGCGCAG GCGCTGTGCG AGCACGCGCC GGACAGTGCG
CAGTCTTGGT TTCTGCTCGG CCTCACGCAC CACTACGCGG GCGACTACGC GGCGGCGGCC
GCTGCATATC GCCGCGCGGA TGCACTCGAT CCGGCCTATC CGATGTTGCG CAACAATCTC
GCCGCGCTTC GCTATCAGAC CGGCATGACC GCCGAGGCGC TCGCGCTGGC GGAAGCGGCG
ATTCGCGCGG AGCCGGACAA CCAGATGGCG TGGTGCAACT GCTCGAATGC GTGGCTCGCG
CTGCGCGAGC CGGCACGCGC GCTGATCGCG GGCGAGCGCG CCTGCGCGCT CGGGCCGAAC
TACGCGATCG CGCAACTCGC GCGCGCGAAC GCGCTGAAAG AGCTGCAGCG CTGGCCGGAC
GCGCTCGCCG CCGCGGCGCA CGCGCACCGC AGCGCGCCCG ACGATCCCGT CATGCAGTGG
TCGCTCGCGA TGCTGCAACT GCTGCACGGC GACTACGCGA ACGGCTGGGC GAACCATGAG
GCGCGGTGGA ACGGCTCGCG CGAGCTCGGC GACCGCCCGC GCCCCTCGCC GCAGCAGCAG
TGGCGCGGCG AGCCGCTCGC CGGCAAGACA TTGATGCTGT GGGGCGAGCA GGGCTTCGGC
GATGCGCTGC AGTTCGCGCG CTTCGCGCCG ATCATCGCCG AGCAGGCGAC GCGCGCGGGC
GCGCAGGTCG TCTTCGCGTG CTTCGCGGGC CTCGAGCCGC TTTTCGCGCG CAGCTTCGCC
GGCGCGCCGA TGCGGATCGT GCGGCACGAC GCGCCGCAAT TGCCCGCATT CGACCATCAC
CTGCCCGTCG GCAGCGCGCC CCTGTTGCTC GGCGTGCGGC CCGACACGAT CCCGGCCGCG
GGCGGCTACC TGCGCGCGGA TCCGGCGCGC GCCGCGCAAT GGGCGGCGCG GCGGCCGGCC
GACGGCCGGC TGCGCGTCGG GCTCGTCTGG AGCGGCAGCC GCACGCACCA GCGCAACCCG
CTGCGCGCGA TCGATCCGGC GGCGTGCGCG CACGCATGGC GCGACCTGAC GGGCGTCGCG
TTCCACAGCC TGCAGATCGA CGGCGCCGCC GACGTCGCGA CAATGCGCGC GGCGGGCCTC
GACGTGATCG ACCATACGGC CGAGTTGCCG AGCTTCGACG ACACGGCTGC GTATCTGTCG
AGCCTCGACC TCGTCGTCAC CGTCTGCACG TCGGTCGCGC ACCTCGCGGG CGCGCTCGGC
CGGCCGACGC GGCTGCTGCT CGACGTCAAT CCGCACTGGG TCTGGATGAT CGACCGCGAA
GACAGCCCGT GGTACGGCTC GCTCCGGCTC TACCGGCAGC CCCGGTACCG CGACTGGACG
ACGGTGCTCG ACCGCGTGCG CGACGAACTG GCCGCGCTCG CAGCCGCGCG CGCGTAG
 
Protein sequence
MSVHSDRFAA IQMQLKQGDL IAAADAIDAW RAAEPASADA LACRAHWLRL LGRFDEAAAA 
LEPALAATPP CAAAWAERAR LDRLAGQAER AHAAFDAAHR ADPAATAWLA EWIELLHPLH
RPALALPVAQ ALCEHAPDSA QSWFLLGLTH HYAGDYAAAA AAYRRADALD PAYPMLRNNL
AALRYQTGMT AEALALAEAA IRAEPDNQMA WCNCSNAWLA LREPARALIA GERACALGPN
YAIAQLARAN ALKELQRWPD ALAAAAHAHR SAPDDPVMQW SLAMLQLLHG DYANGWANHE
ARWNGSRELG DRPRPSPQQQ WRGEPLAGKT LMLWGEQGFG DALQFARFAP IIAEQATRAG
AQVVFACFAG LEPLFARSFA GAPMRIVRHD APQLPAFDHH LPVGSAPLLL GVRPDTIPAA
GGYLRADPAR AAQWAARRPA DGRLRVGLVW SGSRTHQRNP LRAIDPAACA HAWRDLTGVA
FHSLQIDGAA DVATMRAAGL DVIDHTAELP SFDDTAAYLS SLDLVVTVCT SVAHLAGALG
RPTRLLLDVN PHWVWMIDRE DSPWYGSLRL YRQPRYRDWT TVLDRVRDEL AALAAARA