Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I0654 |
Symbol | |
ID | 3847760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 752086 |
End bp | 753855 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637840327 |
Product | M48 family peptidase |
Protein accession | YP_441210 |
Protein GI | 83719332 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCGT TGCGTCGAAC CATTACGCCA CGGCCGCCCA GATCCTTGTC CATGCGCGTC AAACCGTCGT TTGCCGTCCT GCTGTCCGCG GCGCTCGCAC TGCCGCCCGG CAGCCACGCG CAGTCGCGCA GCGATGCGCC GCCGCTCGAG CCCGCGCTTT CCGCCGGTGC CGCGGACGCG GCGCCGCTCG CGCGCGAGGC GTTGTCCACG GTGCCGTCCG GCATTGCGGC CGGCGTGTTC GGCACGTACG GCGGCGCGGA CAGCCGGCTC GCCGGTCCGG CGTCGGGCAC GCCCAGTTTG CGCGCGCCGC TTCGCTCGTT GCAACTGCCC GATCTCGGCG ACGGCTCGGG CGGCTCGCTG ACGCCGCAGG CGGAGCGCCG TCTCGGCGAG CGCGTGATGC GCGAGGTGCG GCGCGACCCC GACTATCTCG ACGACTGGCT CGTGCGCGAC TACCTGAATT CGGTCGCGGC GAAGCTCTCG GCGGCGGCCG CCGCGCAGTT CATCGGCGGC TACATGCCCG ATTTCGATCT GTTCGCGATG CGCGATTCGC AGATCAACGC CTTTTCGCTG CCGGGCGGCT TCATCGGCAT CAACAGCGGG CTCGTCGCGG CGACGCAGAC GGAATCCGAG CTCGCGTCGG TGATCGGCCA CGAGATGGGG CACGTGCTGC AGCGGCACAT CGCGCGGATG ATCGGCGCAA GCGAGAAGAG CGGTTACGCG GCGCTCGCGA CGATGCTGTT CGGCGTGCTC GCGGGCATTC TCGCGCGCAG CGGCGATCTC GGCAGCGCGA TCGCGATGGG CGGCCAGGCG TTCGCGGTCG ACAGCCAGCT CCGGTTCTCG CGCTCGGCCG AGCGCGAGGC CGACCGCGTC GGCTTTCAAC TGCTCGCGGG CGCCGGCTAC GATCCGTATG GAATGCCGAG CTTTTTCGAG CGGCTCGAGC GCGCGTCGAT GGGCGACGCG GGCGTGCCCG CGTATGCGCG CACGCACCCG CTGACGGGCG AGCGGATCGC CGACATGGAC GATCGCGCGC GGCGCGCGCC GTACCGGCAG CCGCGACAGT CGCCGGAATA CGGTTTCGTG CGCGCGCGCC TGCGGATGCT GCAGAACCGC TCGGCGACCG ATTACGCGAA CGAGGTGAGG CGAATGCGCG CGGAACTCGA CGACAGGGTC GCGCCGAACG TCGCGGCGAA CTGGTACGGG ATCGCGCTCG GCGAGATGCT GGGCGGCCGC TACGACGACG CGGACCGCGC GCTTGCGGCG GCGCGCGATG CGTTCGCGCG CACGGCCGCG CGCGAGGGCG AGGCGGCGCG CAGCTCGCCG AGCCTCGATG TGCTCGCGGC GGAGATCGCG CGTCGCGCCG GCCGCGAGGA TGACGCGGTG CGGCTCGCCG CCGCCGCGCA AGCGCGGTGG CCTGGCTCGC ACGCGGCCAT CGCCGCGCAT CTGCAGGCGC TTCTTGCCGC GCGGCGCTAC GGGCAGGCGC AGACGCTCGC GCAATCCGAA GCGGGCAGCG ATCCCGGACA ACCCGACTGG TGGCGCTACC TCGCGCAGGC GAGCCTTGGC GAGGGCGATG CGGTCACGCA GCGCCGCGCG CTCGCGGAGA AGTTCGCACT TGAAGGTGCG TGGCCGTCGG CGATCCGGCA ATTGCGCGAG GCGCGCGACC TCAAGTCGGC CAGCTTCTAC GAGCAGTCGA TCATCAGCGC GCGGCTGCAC GAGTTCGAGG CGCGCTACAA GGAAGAACGG GAAGAGGACA ATGACAACGG GCGCGGGTGA
|
Protein sequence | MMPLRRTITP RPPRSLSMRV KPSFAVLLSA ALALPPGSHA QSRSDAPPLE PALSAGAADA APLAREALST VPSGIAAGVF GTYGGADSRL AGPASGTPSL RAPLRSLQLP DLGDGSGGSL TPQAERRLGE RVMREVRRDP DYLDDWLVRD YLNSVAAKLS AAAAAQFIGG YMPDFDLFAM RDSQINAFSL PGGFIGINSG LVAATQTESE LASVIGHEMG HVLQRHIARM IGASEKSGYA ALATMLFGVL AGILARSGDL GSAIAMGGQA FAVDSQLRFS RSAEREADRV GFQLLAGAGY DPYGMPSFFE RLERASMGDA GVPAYARTHP LTGERIADMD DRARRAPYRQ PRQSPEYGFV RARLRMLQNR SATDYANEVR RMRAELDDRV APNVAANWYG IALGEMLGGR YDDADRALAA ARDAFARTAA REGEAARSSP SLDVLAAEIA RRAGREDDAV RLAAAAQARW PGSHAAIAAH LQALLAARRY GQAQTLAQSE AGSDPGQPDW WRYLAQASLG EGDAVTQRRA LAEKFALEGA WPSAIRQLRE ARDLKSASFY EQSIISARLH EFEARYKEER EEDNDNGRG
|
| |