Gene Bphyt_5839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_5839 
Symbol 
ID6278849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp2046504 
End bp2051213 
Gene Length4710 bp 
Protein Length1569 aa 
Translation table11 
GC content67% 
IMG OID642616903 
Productcellulose synthase operon C domain protein 
Protein accessionYP_001889546 
Protein GI187920514 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0252988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.398453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGA GCGCCCTCGC GTTGTCCTTG CGCCTCGCGC TGAGCCTCAC GGCCGTCTGC 
TGCGTGGCGA CGGTTTCGCC GGATGCGCTG GCGCAGGCGT CGAAAGATCC CTTGAACGTG
CTGATCGATC AGGGCAAGTA CTGGCAATCG CATCAGCGCG GCGACCTCGC CGAGCAGGCA
TGGCAGAAGG TGCTGCGCAT CGATCCCAAG CAGCCCGATG CGCTGTTCGG CATGGGCATG
GTGCTCGCCG ACCGCAAGGA CGGCGCCGGC GCGCAGCAAT ATCTGGCCCG TCTGAAGGCA
GCCGCGCCGA ACTATCCGAA TCTGGATGAG CTCGGCCGCC GTCTCGGCGA ATCGAGCCTG
CGCGACCAGA CCGTGAACGA CGCGCGGCGG CTTGCGCAAA GCGGTCAAAG CGCGAGCGCG
GTGCAGGAAT ACCAGCGCGC GCTGACTGGC AAACCGGCCA CGCCCGAATT GCAGCTCGAG
TACTACCAGG CGCTCTCGGC CACGCCGCAA GGCTGGGATC AGGCGCGGCG CGGTCTCGAA
CAGCTCGCGC GCGACAATCC CGACGATCCG CGTTACGCCC TCGCCTATGC GCAGCATCTG
ACCTATCGCG ACGTGACGCG CCGCGACGGC ATTGCGCGTC TGCAGAAACT GGCGGGCGAC
AGCACGGTCG GCGCGTCGGC GAAAAAGAGC TGGCGTCAGG CGCTGTTGTG GCTCGACGCG
CGGCCCTCCG ACGCCCCGCT CTATCAGGCC TATCTGCAAA CCGCGACCGA TGACGCCGCG
GTCAAGGCGC GCTTCGATTC GATGGTGCAG CAGGACAGCG CGGCGCGCGC ACGCTCGCAG
GAAAATGCCG CGACCGACGC ACGCGGCCGC ACCATCGCCG AGGGTTTCAG CGCGCTCGAC
CGCAACGACG TGGCCACGGC GCGCGCGAAG TTTTCCTCGG TGCTGGCCAC CAGCCCGAAC
GACACCGACG CGCTCGGCGG CATGGGCATC GCCGCGTTGA AGCAGGAGCA CTTCGCGGAA
GCGCGCAACT ATCTGGAACG CGCGTCGCGC AACGGCAATC CCGCGCGCTG GAAAACCGCG
CTCGACAGCG CGAGCTATTG GACCTACACG AGCGACGCGA TCGGCGCGCG CAGCAACGGC
GAATTCGCCA AGGCGAAGTC GCTGTTCGAA CGCGCGATCG CGCTGAATCC GTCCGACGTC
ACCGCGCAGG TGCTGCTCGG CGAAATGCTG CTCGCGAACG GCGACCCGAG CGGCGCGGAA
CAGGCGTACC GGATGGCGCT GCGCCGTCAG GCCGACAATC CCGACGCGGT GCGCGGCCTG
GTCGGCGCAC TCGCCGCGCA AGGCCGCGGC GACGAGGCGC TGCAGTTCGC CAATCAGCTC
AACACCGAAC AGCAATCGAA GGCCGGCGGC ATCAACCGTC TGCGTGGCGA AGCGCAGGCC
GCGCAAGCGC GTGCCGCCGA AGCGCGCGGC GATCTCGGCA GCGCGCGCAG TCTCTTCGAA
GACGCACTGC TGAACACGCC CGACGACCCG TGGCTGCGTC TCGACCTCGC GCGCATTTAC
GTGCGCCAAG GCGCGGTCGC CAACGCGCGC AGCATGATGG ACGGTCTGCT CGCCGCGCAT
CCCGACATGA CCGACGCGCT GTACGCGAGC GCGTTGTTGT CGGCGGAAAC GCAGGACTGG
TCGGCGGGTC TCGCGCAACT CGAACGCATT CCGGTCGCGC AGCGCACCGA CGCGATGACG
ACCTTGCAAC ACCGCTTGTG GGTGCATCAG CAAGCCGACC TCGCCACGCG GATGGCGCGC
AACGGGCAGA CGCAACAGGC GCTCGCCACG TTGCAAGCCG CCGAGCCGGT CGCGGGCAAT
AGTCCCGAAC TGATCGGCGT GATCGCCGCC GCTTACAGCC AGGCCGGCGA CCCGAACCGC
GCGCTCGGAC TCGTGCGCGC CGCCATGAAC GCGGCGCCCG GCAACACCGA TCTGCTGCTG
CAATACGCGG GCATTCTGTC GGCCACGCAG CAGGAAGCCG AACTCGGCAT GGTGATGCGC
CGGCTCGCGT CGATGCAGTT GACGCCGCAG CAACGCACCG ACTTCGGCAA TCTGAATCTC
GGCATCGTCA TCAAGCAGTG CGATGCGGTG CGCCAGCGCG GCGACCTCGC CAGCGCCTAC
GACGTGATCG CGCCGTGGCT CGCAGCCATG CCCGACAATC CCGATCTGCA AGCCGCGCTC
GGCCGCCTCT ATTCGACTGC CGGCGACGAT CGCAACGCGC TCGCCAGCTA TCGCGTCGCG
CTGCAACGCA AGCCCGACGA TCTGAACCTG CTGCAAGCGA CCATCTCCGC CGCGGCCGGC
GCGAAGCAGT TCAGCTACGC CGAAACACTC GCGAAACAGG CCTTGACCGC GGCGCCTGCC
GATCCTGGCG TGCTCGCCAC CGTCGGCCGC ATGTATCGCG CGGAAGGCAA GCTGTCGCTC
GCCTCGACTT ATCTGCAGCG CTCGCTGGTC GCCGCCAACA CGCCGCTGAT GGCCAATGCG
CCGCGCAGTC CGGCGAGCAA CGTGCCGCGC GGATGGGAAG TGGCGATGCG CCGGATCGGC
GCGACGCCGC TGCCGGGCAC CAATCCCTTT GAAGGCAAAA CCGCCACGGT CACACCGACC
GACGCCGACA ACGCCGCACT CGCCGGCGGC AGTTATAACG CCGCGCGTTC AGCGCTTCCG
TATTCGCAGT CTTCCTTGCC GACACAGACC GTGCCGAACT ATCCGCCGCC TACGCAGCCC
GCGCCTTACG TCGCGCCTTA TACAGCGCCC GCGCAGCCTT ATGCGCCGAA CCGTGGGCCT
GCCGCCCTGC CGTATAACGC GCCGCGTCCA GGCGCCGAGG CCGGTGGCTA CGGCCAGGAG
ACCTACGGGT CGAGCCAGTC GGGCGCGCCG TTGCAGCCTT ATCCGGGGCA AGGGCAGCCG
CCGATGCAGC AGCAAGCGCA GATGCCGCCG ATGCAGCAGG CTCCGCAATA CAACCCGGCT
TATCAGCAGC AGGCGCCTTA CTCGCAGCAG CAATACCCGC AGCAGCAGCC ATACGGCGCG
CAGGATGGCT ATGCGGCGAC GCCCTGGCCG ATGTCGCCGG CTGCGCGCCA GGCGCAGGCC
AGTGCCGGTT CGATGCAAGC GCAACCGTAT GGCGCGCCGA GCACGAAGCG CGCGGCCGGC
AAGAAGCAAA GTGCGTCGAA ACATGGCGGC AACGCGGCCG CTTATGCGCA GCAGCCGTAT
GGGCAACAGC AAGGCTATCC ACAGCAGCCT TACTACGGTC AACAGGCGTA TCCGCAACAA
CAGCAGGGCT ACGCGCAACA GCCATATCAA CCGCAACCGC CGCAGCAGCA GGCTTACGCG
AATCAGGGCT ACTACGCGCA GCAGCAGCCG TACATTCCGC AGCCGCCCAC CGGCTACGCC
CAACCGTACT ACCCGGCGCA ACCCGGCGCG AACGGCAACG GCAACACCTA CGCGCAGCCG
AACGTGGCGA ACGCGCAAAC GCTCGGCGTC GCGGAAGAAC TGGCGCAGGT CAATCGCGAG
CAGTCGAGCA GCATCTCGGG CGGCATCGTG TTCCGTAACC GCACCGGCGA GGACGGCCTG
TCCAACCTCA CCGATATCGA AGCGCCGATC CAGGGGCGCA TCAGGGCGGG CAATGGACAC
GTTGTCGTCA CGGCGACGCC TGTCACGCTA GACGCCGGCA CTCCCGCCAA CAATGTCTCG
ACGCTCGCGC GTTTCGGTGC CGGCCTATCG AACGGCACAT CGGTCTCGGC AGCCAATGTC
GGCAGCCAGA CGGCGAGCGG CGTGGGCTTG TCGGTCGGCT ACGAAGGCCG CAGCCTCAGC
GGCGACATCG GCGTGACGCC GCTCGGGTTC CGCGAGACCA ACATCGTGGG CGGCGCGCAA
TACAACGGCG GCGTCACCGA CAAGGTCTCG TATTCGCTGG CCGTCGCGCG CCGCGCGGTG
AGCGACAGCC TGTTGTCCTA TGCCGGCGCG CGGGATTCCG GTTCCGGTCT CGAATGGGGC
GGCGTCACTT CAACCGGCGG CCTCGGCAGC CTCGCATGGG ACGACGGCAC GAGCGGCCTG
TATGTGAACG CGGCGTTCCA GTATTACGAC GGCAACAACG TACTGAGCAA TACTGCCGTC
AAAGGCGGCG GCGGCATCTA TACGCGTCTG CTGAAAGACG CTGATCAGAC ACTCACGATC
GGCGTGAATA CGACGCTGAT GCGTTACGAC AAGAACCAGT CGTACTTCAC CTATGGGCAG
GGCGGCTATT TCAGCCCCCA ACAGTACGTG ATCCTGAACT TGCCGGTGGA GTGGTCCGGG
CGCAACGGCG CGTTCACGTA CGACGTGAAG GGTTCGATCG GCGTGCAGCA TTATCGTCAG
GATGCGTCGA ACTACTTCCC GCTCAACGAT GGTTCAGGCC GGCAAGGTGC GGCTGAGCAG
AACGCGGCGA ATGTCGGTAC GAGCGTGCAG AGCGGCGCGC AGTATCCAGG CCAAAGCAAG
ACCGGCGTGT CGTATTCGCT CAGCGCAGTG GGCGAATATC AACTCGCGCC GCAACTGGCC
TTCGGGGCGA CCGCTTCGCT AGGCAATGCT TATGAATATC GCGAGTATCT CGCGGCGGTT
TATGTGCGGT ATAGCTTCAG CAAGCAGACC GGCTCGCAGC CGTTCCCGCC CACGCCGCTT
GTTTCGCCTT ATCTGTCGTT GTCGAATTGA
 
Protein sequence
MRLSALALSL RLALSLTAVC CVATVSPDAL AQASKDPLNV LIDQGKYWQS HQRGDLAEQA 
WQKVLRIDPK QPDALFGMGM VLADRKDGAG AQQYLARLKA AAPNYPNLDE LGRRLGESSL
RDQTVNDARR LAQSGQSASA VQEYQRALTG KPATPELQLE YYQALSATPQ GWDQARRGLE
QLARDNPDDP RYALAYAQHL TYRDVTRRDG IARLQKLAGD STVGASAKKS WRQALLWLDA
RPSDAPLYQA YLQTATDDAA VKARFDSMVQ QDSAARARSQ ENAATDARGR TIAEGFSALD
RNDVATARAK FSSVLATSPN DTDALGGMGI AALKQEHFAE ARNYLERASR NGNPARWKTA
LDSASYWTYT SDAIGARSNG EFAKAKSLFE RAIALNPSDV TAQVLLGEML LANGDPSGAE
QAYRMALRRQ ADNPDAVRGL VGALAAQGRG DEALQFANQL NTEQQSKAGG INRLRGEAQA
AQARAAEARG DLGSARSLFE DALLNTPDDP WLRLDLARIY VRQGAVANAR SMMDGLLAAH
PDMTDALYAS ALLSAETQDW SAGLAQLERI PVAQRTDAMT TLQHRLWVHQ QADLATRMAR
NGQTQQALAT LQAAEPVAGN SPELIGVIAA AYSQAGDPNR ALGLVRAAMN AAPGNTDLLL
QYAGILSATQ QEAELGMVMR RLASMQLTPQ QRTDFGNLNL GIVIKQCDAV RQRGDLASAY
DVIAPWLAAM PDNPDLQAAL GRLYSTAGDD RNALASYRVA LQRKPDDLNL LQATISAAAG
AKQFSYAETL AKQALTAAPA DPGVLATVGR MYRAEGKLSL ASTYLQRSLV AANTPLMANA
PRSPASNVPR GWEVAMRRIG ATPLPGTNPF EGKTATVTPT DADNAALAGG SYNAARSALP
YSQSSLPTQT VPNYPPPTQP APYVAPYTAP AQPYAPNRGP AALPYNAPRP GAEAGGYGQE
TYGSSQSGAP LQPYPGQGQP PMQQQAQMPP MQQAPQYNPA YQQQAPYSQQ QYPQQQPYGA
QDGYAATPWP MSPAARQAQA SAGSMQAQPY GAPSTKRAAG KKQSASKHGG NAAAYAQQPY
GQQQGYPQQP YYGQQAYPQQ QQGYAQQPYQ PQPPQQQAYA NQGYYAQQQP YIPQPPTGYA
QPYYPAQPGA NGNGNTYAQP NVANAQTLGV AEELAQVNRE QSSSISGGIV FRNRTGEDGL
SNLTDIEAPI QGRIRAGNGH VVVTATPVTL DAGTPANNVS TLARFGAGLS NGTSVSAANV
GSQTASGVGL SVGYEGRSLS GDIGVTPLGF RETNIVGGAQ YNGGVTDKVS YSLAVARRAV
SDSLLSYAGA RDSGSGLEWG GVTSTGGLGS LAWDDGTSGL YVNAAFQYYD GNNVLSNTAV
KGGGGIYTRL LKDADQTLTI GVNTTLMRYD KNQSYFTYGQ GGYFSPQQYV ILNLPVEWSG
RNGAFTYDVK GSIGVQHYRQ DASNYFPLND GSGRQGAAEQ NAANVGTSVQ SGAQYPGQSK
TGVSYSLSAV GEYQLAPQLA FGATASLGNA YEYREYLAAV YVRYSFSKQT GSQPFPPTPL
VSPYLSLSN