Gene BTH_I0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I0654 
Symbol 
ID3847760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp752086 
End bp753855 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content72% 
IMG OID637840327 
ProductM48 family peptidase 
Protein accessionYP_441210 
Protein GI83719332 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGT TGCGTCGAAC CATTACGCCA CGGCCGCCCA GATCCTTGTC CATGCGCGTC 
AAACCGTCGT TTGCCGTCCT GCTGTCCGCG GCGCTCGCAC TGCCGCCCGG CAGCCACGCG
CAGTCGCGCA GCGATGCGCC GCCGCTCGAG CCCGCGCTTT CCGCCGGTGC CGCGGACGCG
GCGCCGCTCG CGCGCGAGGC GTTGTCCACG GTGCCGTCCG GCATTGCGGC CGGCGTGTTC
GGCACGTACG GCGGCGCGGA CAGCCGGCTC GCCGGTCCGG CGTCGGGCAC GCCCAGTTTG
CGCGCGCCGC TTCGCTCGTT GCAACTGCCC GATCTCGGCG ACGGCTCGGG CGGCTCGCTG
ACGCCGCAGG CGGAGCGCCG TCTCGGCGAG CGCGTGATGC GCGAGGTGCG GCGCGACCCC
GACTATCTCG ACGACTGGCT CGTGCGCGAC TACCTGAATT CGGTCGCGGC GAAGCTCTCG
GCGGCGGCCG CCGCGCAGTT CATCGGCGGC TACATGCCCG ATTTCGATCT GTTCGCGATG
CGCGATTCGC AGATCAACGC CTTTTCGCTG CCGGGCGGCT TCATCGGCAT CAACAGCGGG
CTCGTCGCGG CGACGCAGAC GGAATCCGAG CTCGCGTCGG TGATCGGCCA CGAGATGGGG
CACGTGCTGC AGCGGCACAT CGCGCGGATG ATCGGCGCAA GCGAGAAGAG CGGTTACGCG
GCGCTCGCGA CGATGCTGTT CGGCGTGCTC GCGGGCATTC TCGCGCGCAG CGGCGATCTC
GGCAGCGCGA TCGCGATGGG CGGCCAGGCG TTCGCGGTCG ACAGCCAGCT CCGGTTCTCG
CGCTCGGCCG AGCGCGAGGC CGACCGCGTC GGCTTTCAAC TGCTCGCGGG CGCCGGCTAC
GATCCGTATG GAATGCCGAG CTTTTTCGAG CGGCTCGAGC GCGCGTCGAT GGGCGACGCG
GGCGTGCCCG CGTATGCGCG CACGCACCCG CTGACGGGCG AGCGGATCGC CGACATGGAC
GATCGCGCGC GGCGCGCGCC GTACCGGCAG CCGCGACAGT CGCCGGAATA CGGTTTCGTG
CGCGCGCGCC TGCGGATGCT GCAGAACCGC TCGGCGACCG ATTACGCGAA CGAGGTGAGG
CGAATGCGCG CGGAACTCGA CGACAGGGTC GCGCCGAACG TCGCGGCGAA CTGGTACGGG
ATCGCGCTCG GCGAGATGCT GGGCGGCCGC TACGACGACG CGGACCGCGC GCTTGCGGCG
GCGCGCGATG CGTTCGCGCG CACGGCCGCG CGCGAGGGCG AGGCGGCGCG CAGCTCGCCG
AGCCTCGATG TGCTCGCGGC GGAGATCGCG CGTCGCGCCG GCCGCGAGGA TGACGCGGTG
CGGCTCGCCG CCGCCGCGCA AGCGCGGTGG CCTGGCTCGC ACGCGGCCAT CGCCGCGCAT
CTGCAGGCGC TTCTTGCCGC GCGGCGCTAC GGGCAGGCGC AGACGCTCGC GCAATCCGAA
GCGGGCAGCG ATCCCGGACA ACCCGACTGG TGGCGCTACC TCGCGCAGGC GAGCCTTGGC
GAGGGCGATG CGGTCACGCA GCGCCGCGCG CTCGCGGAGA AGTTCGCACT TGAAGGTGCG
TGGCCGTCGG CGATCCGGCA ATTGCGCGAG GCGCGCGACC TCAAGTCGGC CAGCTTCTAC
GAGCAGTCGA TCATCAGCGC GCGGCTGCAC GAGTTCGAGG CGCGCTACAA GGAAGAACGG
GAAGAGGACA ATGACAACGG GCGCGGGTGA
 
Protein sequence
MMPLRRTITP RPPRSLSMRV KPSFAVLLSA ALALPPGSHA QSRSDAPPLE PALSAGAADA 
APLAREALST VPSGIAAGVF GTYGGADSRL AGPASGTPSL RAPLRSLQLP DLGDGSGGSL
TPQAERRLGE RVMREVRRDP DYLDDWLVRD YLNSVAAKLS AAAAAQFIGG YMPDFDLFAM
RDSQINAFSL PGGFIGINSG LVAATQTESE LASVIGHEMG HVLQRHIARM IGASEKSGYA
ALATMLFGVL AGILARSGDL GSAIAMGGQA FAVDSQLRFS RSAEREADRV GFQLLAGAGY
DPYGMPSFFE RLERASMGDA GVPAYARTHP LTGERIADMD DRARRAPYRQ PRQSPEYGFV
RARLRMLQNR SATDYANEVR RMRAELDDRV APNVAANWYG IALGEMLGGR YDDADRALAA
ARDAFARTAA REGEAARSSP SLDVLAAEIA RRAGREDDAV RLAAAAQARW PGSHAAIAAH
LQALLAARRY GQAQTLAQSE AGSDPGQPDW WRYLAQASLG EGDAVTQRRA LAEKFALEGA
WPSAIRQLRE ARDLKSASFY EQSIISARLH EFEARYKEER EEDNDNGRG