Gene Bphyt_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_4031 
Symbol 
ID6280192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp45686 
End bp47119 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content60% 
IMG OID642615134 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001887787 
Protein GI187918756 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.636683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGTG TGCAGAGCCA ACGTCGGCCA GTGAATTGGC AAAGCCGGCC TTTTTGCGCG 
GATGCCCGTT CAAAGCGTCG TATGGCAAAG CGCGACTTCC CGGCGGGATT CGTGCCGTTC
GGGTTGCAGA CCGCCGGCTT CATCGTACGA GGCCCGAAAC AATTGTGGCC GTATGTTTTG
CCGTGGGTGT TCGCGGTGCT CACCATAGGT TTTTTCTGCC TCGTCGTGCG CGGCGTACTC
ACGCGGCAAT TGCTGGTGCC GCCCGTCGAA ACGCCGCAGT CGCTGGTGGA ACGCGGTTAT
ACATCGAATT TTCTCGCGGA ACGCATCATG TCTTCTATGC GAGCGATCGG TCAGGACGCC
GAGTCGATTC CGCACGACAC CCTGACGGAC AACGACGCAC AACCGGACAT TCAGATCCCC
GGTCAGGAGA TGTCGTATGC AACCACCGTG CGCTTCATTA AAGGCGTCAT CAAACGCACG
GACGTGGCCG TTCATGTCGG CATCACGAAG GTTAACGACA GCGCGGACTC CTATGTGGCG
CATGTGCAGA TTGAAGGCGG TCCGTTTAAT TCCCGCGAAA GCACGGTGTC GTTCGAAGGA
CGCGATCTGG AAAAGTTCGT GCACGACATT GCAGTCAAAG CCATGCGTCT TGCGGAGCCG
AATATTCTCG CCAGTCATCT CTACTCGCAA GTGCAGAAAA CCAAGTGTTC CCTCGCGCAC
TGCGACTACG GCGAGATCGT CGCGATCTAC GACGAAGTGC TCGCACTGCC CGCCTCTGAG
CAAGGCGAAT GGGCGCTCGC CGGCAAAGCG TGGCTGCTTG CCAATCAGGG GCTGTCCAAA
GAGGCCGAGC AGCAGACTCG CGAAGCGTTG ACCGTGTACC AGCATTCAGC GGTGCTGCAC
GCAAGCCTTG GCATTGCACT CGAACAGCAG CATCGCATCG ACGACGCACT CGATGCCTTG
CGAGCCGGCG CACGCGAGAA ATCGAAGACG GCGGAAAATC TGCGTCTTCT CGGCGACGTG
CTATTGCATG CGAATCGCTA TTCCGAAGCG CTCGATGCAT TCAAGCAGGC CGACAAGATG
AAGCCCGACT CGGTCGACAA CCTGCACGAT TGGGGCGAAG CGCTCGTCAC CGTCGGCCGC
TATGACGAAG CAATCGAAAA GCTCTCGCGC GCCGTCGCGC TACGCCCGGA TCTCGCGCCC
TCTTACGCGG AATGGGGCCG GGCACTGGAT CGCAAAGGCG ATCGGTCCGG CGCCTCGCGC
AAATTCGCTC AGGCGTTGCA ACTCGATGCG GGGACTTTGT CGGTGCGCGA GAGTCATCTG
GCGCGGCTTG CCAGAGCGGC GCAGGGAGGC AACCGCCCAG ACGCGTCGAA GCCCGATGAC
GGAGTCAGGC CCGTTTCGAA TCCACCCGCC CCGCTTGCGT TGCAAACCGC GTAA
 
Protein sequence
MSSVQSQRRP VNWQSRPFCA DARSKRRMAK RDFPAGFVPF GLQTAGFIVR GPKQLWPYVL 
PWVFAVLTIG FFCLVVRGVL TRQLLVPPVE TPQSLVERGY TSNFLAERIM SSMRAIGQDA
ESIPHDTLTD NDAQPDIQIP GQEMSYATTV RFIKGVIKRT DVAVHVGITK VNDSADSYVA
HVQIEGGPFN SRESTVSFEG RDLEKFVHDI AVKAMRLAEP NILASHLYSQ VQKTKCSLAH
CDYGEIVAIY DEVLALPASE QGEWALAGKA WLLANQGLSK EAEQQTREAL TVYQHSAVLH
ASLGIALEQQ HRIDDALDAL RAGAREKSKT AENLRLLGDV LLHANRYSEA LDAFKQADKM
KPDSVDNLHD WGEALVTVGR YDEAIEKLSR AVALRPDLAP SYAEWGRALD RKGDRSGASR
KFAQALQLDA GTLSVRESHL ARLARAAQGG NRPDASKPDD GVRPVSNPPA PLALQTA