Gene Bphyt_4779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_4779 
Symbol 
ID6280182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp885006 
End bp886667 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content62% 
IMG OID642615861 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001888514 
Protein GI187919483 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.138285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00136432 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAATA CCGAGTCAGC TTCTGTTAGT CCGCAGCCGT CGCACAATTC GCATGCTTCA 
GGCGCGCCCG CGCCCGCGCC GGCCAGCGCG CCGCACAAGG CCAAAGATCA GCCGTTCTTC
GACCCGGTTG CTTACGGCAA CGGGCCGGAT GATTCCGTGA CCGAAACCGA TGAAAGCGCG
GCGATCACGC ACCACTCGGT CACCATCGGC GGGCACAAGA TCGACTACAC GGCGACGGCG
GGCCACCTCG TCATCGTCGA CCCGAGTAGT TCAAAGGCGG AAGCCCGCAT GTTCTACGTG
GCGTTCACGC AGGACAATCA GAAGGAAGAA GCCCGGCCGG TCACGTTCTT CTATAACGGC
GGGCCTGGGT CGTCGTCCGT TTTCGTGCTG CTGGGCTCGT TCGCGCCGCG CCGCATCAAG
ACGTCGATGC CGAGCTTCAC GCCACCCGCG CCGTATTCGA TGGAAGACAA CCCGGACAGC
CTGCTCGACA AGAGCGACCT CGTCTTCATC AACCCGGTCG GCACCGGCTA CTCGGCGGCG
ATCGCGCCGA AGAAGAACCG CGACTTCTGG GGCGTCGACC AGGACGCGGA CTCGATCAAG
CAGTTCATCA AGCGTTTTCT GACCAAGAAC AACCGCTGGA ATTCGCCGAA GTACCTGTTC
GGCGAGTCGT ATGGCACGGC GCGCAGTTGC GTGCTGGCGT ACCGTTTGCA CGAAGATGGC
GTGGATCTGA ACGGCATCAC GCTGCAATCG TCGATTCTCG ATTACACGCA GGCCGGCAAC
CCGGTGGGCG CGCTGCCCAC CGCGGCCGCG GACGCGTGGT ATCACAAGAA GCTCGGCATC
GCGCCGCGGC CGACCGACCT CGGCACGTTC GCCGAAGAAG TCGCGCAGTT CGCGCGCACG
GACTATCTGG CCGCGCTGCG TAAGTTTCCG ACCACCGACG CGGCAACGGT CGAAAAGCTC
AGCGAATACA CGGGCATCGA CAAAACGACC TTGCTCGCGT GGAGTCTGGA TGTCGCCTCG
TACGACAGCC GGGGCAATTC GTTGTTCCTC ACCACCCTGC TGAAATCCAA GGGTCTTGCG
CTCGGCGAGT ACGACGGCCG TGTGACGGCG ATAGGCACGG GCATTGCCGG CAAGATCGAC
CCGAATTCCG GCGGCAACGA CCCGACCATG ACGGCGGTGA CCGGCGTCTA CACGACGATG
TGGAACGTCT ATCTGAACGA GCAACTGAAG TACACGTCGA ATTCGTCGTT CACGGATCTG
AACGACCAGG CCTTCAAGTA CTGGGACTTC AGTCACATCG ATCCGACCGG CGCGCAGAAG
GGCGTCGACT CGAAGGGCAA CATCATTCTG TACACCGCCG GGGACCTGGC TGCCGTGATG
GCGCTCAATC CCGACCTGAA GGTGCTGTCG GCCAACGGCT TCTTCGATTT CGTCACGCCG
TTTTATCAGA CCGTGCTCGA TTTACAGCAA ATGCCGCTGC TCAGCCAGCA GGTCCGGCAG
AATCTGTCGG CGCGCTTTTA TCCGTCGGGG CATATGGTTT ATCTCGACGG CGGATCGCGC
ACCGCGCTGA AGGCCGATCT CGCGAAGATG TACGACACGA CGGTGTCCAA TACGCAGGCG
CTGCTTCGTA TCCGCGCATT GCAGGCGCGT GTGGCGCAGT AG
 
Protein sequence
MTNTESASVS PQPSHNSHAS GAPAPAPASA PHKAKDQPFF DPVAYGNGPD DSVTETDESA 
AITHHSVTIG GHKIDYTATA GHLVIVDPSS SKAEARMFYV AFTQDNQKEE ARPVTFFYNG
GPGSSSVFVL LGSFAPRRIK TSMPSFTPPA PYSMEDNPDS LLDKSDLVFI NPVGTGYSAA
IAPKKNRDFW GVDQDADSIK QFIKRFLTKN NRWNSPKYLF GESYGTARSC VLAYRLHEDG
VDLNGITLQS SILDYTQAGN PVGALPTAAA DAWYHKKLGI APRPTDLGTF AEEVAQFART
DYLAALRKFP TTDAATVEKL SEYTGIDKTT LLAWSLDVAS YDSRGNSLFL TTLLKSKGLA
LGEYDGRVTA IGTGIAGKID PNSGGNDPTM TAVTGVYTTM WNVYLNEQLK YTSNSSFTDL
NDQAFKYWDF SHIDPTGAQK GVDSKGNIIL YTAGDLAAVM ALNPDLKVLS ANGFFDFVTP
FYQTVLDLQQ MPLLSQQVRQ NLSARFYPSG HMVYLDGGSR TALKADLAKM YDTTVSNTQA
LLRIRALQAR VAQ