Gene Bphy_4589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_4589 
Symbol 
ID6246107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp1652451 
End bp1653749 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content62% 
IMG OID642596339 
Productpeptidase U32 
Protein accessionYP_001860746 
Protein GI186473404 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.92087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCC CACATCTCCT CAGCCCCGCC GGTTCTCTGC GCGCCTTGAA CTACGCTCTT 
GCCTACGGCG CCGATGCGGT GTACGCAGGT CTGCCGCGCT ATAGCCTGCG TGCGCGCAAC
AACGAATTCC GCAACACCGG CTTCCTCGGC GAAGGTATCG AAGCGTGCCG GGCCGCGGGC
AAGCGCTTCT ACCTGACCGT CAATCTGATG GCGCACAACC GCAAGGTCGA CACCTTCATT
AAAGACTTGC GCGATGCGGT GGCCCTTGGG CCGGATGCGT TGATCATGGC CGACCCCGGT
TTGATCATGA TGGTCCGCGA GACATGGCCC GACATGCCGA TCCACCTGTC CGTGCAGGCC
AATACTGTCA ATTACGCCAG CGTCCGTTTC TGGAATTCGA TCGGCGTATC ACGGGTGATC
CTGTCGCGCG AACTCTCGCT TGACGAGATC GCTGAGATCC GCCAACGCTG CCCCGACATC
GAACTCGAAG TCTTTGTGCA TGGCGCGCTG TGCATCGCGT ATTCGGGCCG CTGCCTGCTC
TCCGGCTACT TCAATCACCG CGACTCGAAT CAGGGCACCT GCACCAACGC ATGCCGCTGG
AGCTATCGCA TGACCGAGTC CGCTCTCACG GCCGAAGGTG AACATGTGCC GCGCCATACG
GCAGCGGACC GCGTGTATCT GCTGGAAGAA GAAGCGCGCC GCAGCGGTGA ATGGATGGAG
ATATCCGAAG ACGAACATGG CACGTATCTG ATGAATTCCC GCGACCTGCG CGCGATCGAG
CATGTTCAGC AACTGGTCGA TATCGGCGTC GATTGTCTGA AAATCGAAGG CCGCACCAAG
TCACACTACT ATGTCGCGCG CACCGCGCAG CTTTACGCAC GCGCCATCGA CGACGCGACC
GCGGGCCGCC CGTTCGACAC GGGGCTCATC GGCCGCCTCG ACGGCCTCGC CAATCGCGGC
TATACGGGCG GCTTCCTGCA ACGGCATCGC GCCGATGACT ACCAGAACTA TGCCAGCGGC
GCCTCGGGTA ACGCGCGCCA ACAGTTCGTC GGCGAATGGC TGGCTTATGA TGCAGTGACC
GGCCTGACCA CACTCGAAGT GAAGAATCGC TTTGCCGTTG GGGATGCACT GGAGATCGTT
ACGCCCGACG GCTCGCGGGA CATCACCTTG ACTCGCATGG AAAACATCGA AGGCCAACCC
TGTGACGTCG CGTCCGGCAG CGGTCACACG GTGCGGGCCG ACCTCGGAGA AACCGGGCCG
ATGACGCTGG TGGCTCGATA TGTGACGGAC GCGCAATAG
 
Protein sequence
MKRPHLLSPA GSLRALNYAL AYGADAVYAG LPRYSLRARN NEFRNTGFLG EGIEACRAAG 
KRFYLTVNLM AHNRKVDTFI KDLRDAVALG PDALIMADPG LIMMVRETWP DMPIHLSVQA
NTVNYASVRF WNSIGVSRVI LSRELSLDEI AEIRQRCPDI ELEVFVHGAL CIAYSGRCLL
SGYFNHRDSN QGTCTNACRW SYRMTESALT AEGEHVPRHT AADRVYLLEE EARRSGEWME
ISEDEHGTYL MNSRDLRAIE HVQQLVDIGV DCLKIEGRTK SHYYVARTAQ LYARAIDDAT
AGRPFDTGLI GRLDGLANRG YTGGFLQRHR ADDYQNYASG ASGNARQQFV GEWLAYDAVT
GLTTLEVKNR FAVGDALEIV TPDGSRDITL TRMENIEGQP CDVASGSGHT VRADLGETGP
MTLVARYVTD AQ