Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bxe_B2043 |
Symbol | |
ID | 4007461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia xenovorans LB400 |
Kingdom | Bacteria |
Replicon accession | NC_007952 |
Strand | - |
Start bp | 1112827 |
End bp | 1117593 |
Gene Length | 4767 bp |
Protein Length | 1588 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637950662 |
Product | putative cellulose synthase operon protein C |
Protein accession | YP_553292 |
Protein GI | 91778084 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.360789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATTGA GCGCCCTCGC GTTGTCATTG CGCCTCGCGC TGAGTCTCAC GGCGGTCTGT TGCGTGGCGA CGGTGTCGCC CGATGCGTTG GCACAGGCCT CGAAAGATCC GCTGAACGTG CTGATCGATC AGGGCAAGTA CTGGCAATCG CATCAGCGCG GCGACCTCGC CGAACAGGCG TGGCAGAAAG TGCTGCGGAT CGATCCCAAA CAACCGGATG CGCTCTTCGG CATGGGCATG GTGCTCGCCG ACCGTAAAGA CGGCGCCGGC GCGCAGCAAT ATCTGGCACG CCTGAAGGCG GTCGCGCCGA ACTATCCGAA TCTGGATGAA CTCGGCCGGC GCCTCGGCGA ATCGAGCCTG CGCGACCAGA GCGTGAACGA CGCGCGGCGG CTCGCGCAAA GCGGCCAGAG CGCGAGCGCG GTGCAGGAAT ATCAGCGCGC GCTGACCGGC AAACCGGCCA CGCCCGAATT GCAGCTCGAG TACTACCAGG CGCTCTCGGC CACGCCGCAA GGCTGGGATC AGGCGCGCCG CGGACTGGAG CAACTCGCGC GCGACAACCC CGACGACCCG CGCTATGCCC TCGCTTACGC ACAGCATCTG ACCTATCGCG ACGTGACGCG CCGCGACGGC ATCGCGCGGC TGCAAAAACT CGCGGGTGAC AGCACGGTGG GCGCGTCGGC GAAAAAGAGC TGGCGTCAGG CGCTCCTGTG GCTGGACGCG CGGCCCTCCG ACGCCGCGCT CTATCAGGCC TATCTGCAAA CCGCGACGGA CGACGCCGCG GTCAAGGCGC GCTTCGATTC GATGGTCCAG CAGGACAGCA CGGCCCGCGC GCGCGCGCAG GAAAACGCCG CCGTCGATGC GCGCGGCCGC GCCATCGCCG ACGGCTTCGC CGCGCTCGAT CGCAACGACG TGGCAACGGC GCGCGCGAAG TTTTCCTCCG TGCTGGCCAC CAGCCCGAAC GACACCGACG CGCTCGGCGG CATGGGCATC GCCGCATTGA AGCAGGAGCG TTTCGCCGAA GCCCGCAACG ATCTGGAGCG CGCGTCGCGC AACGGCAATC CGGCGCGCTG GAAAACCGCG CTCGACAGCG CGACCTACTG GACCTATACG AGCGACGCGA TCGGCGCGCG CAGCAACGGC GAGTTCGCGA AGGCGAAGTC GCTGTTCGAG CGCGCGATCG CGCTGAATCC GTCCGACGTC ACCGCCCAGG TCCTGCTCGG CGAAATGCTG CTCGCCAATG GCGACCCGGT CGGCGCCGAG CAGGCGTACC GGATGGCGCT GCGCCGTCAG GCCGACAATC CCGACGCGGT GCGCGGCCTG GTCGGCGCGC TCGCCGCGCA AGGCCGCGGC GACGAGGCGC TGCAGTTCGC CAATCAGCTG AATGCCGAAC AGCAGTCGAA GGCCGGCGGC ATCAACCGTC TGCGTGGCGA AGCACAGGCG GCGCAGGCGC GCGCGGCCGA AGCGCGCGGC GATCTCGGCA GTGCGCGCAG CCTCTTCGAA GACGCGCTGC TGAACACGCC CGACGACCCG TGGCTGCGTC TGGACCTCGC GCGCATCTAC GTGCGCCAGG GCGCGGTGGC CAACGCGCGC AGCATGATGG ACGGCCTGCT CGCCGCGCAT CCCGACATGA CCGACGCGCT GTACGCGAGC GCGTTGCTGT CGGCGGAAAC GCAGGACTGG TCGACGGGTC TCGCGCAACT GGACCGCATT CCGCTCGCGC AGCGCACCGA CGCGATGACG ACCCTGCAAC ACCGCTTGTG GGTCCACCAG CAAGCCGACC TCGCCACGCG GATGGCGCGC AGCGGCCAGA CCCAACAGGC GCTCGCCACG CTGCACGCGG CGGAGCCGGT TGCGGGCAAC AGTCCCGAAC TGATCGGCGT GATCGCCGCC GCCTACCAGC AGGCCGGCGA CCCGAACCGC GCGCTCGGAC TCGTGCGCGC CGCCATGAAC GCGGCGCCCG GCAACACCGA TCTGCTGCTG CAATACGCGG GCATTCTGTC CGCCACTCAG CAGGAAGCCG AACTCGGCAT GGTGATGCGC CGGCTCGCGT CGATGCAATT GACGCCGCAG CAGCGCACCG ACTTCGGCAA TCTGAATCTC GGCATCGTCA TCAAGCAGTG CGACGCGGTG CGTCAGCGCG GCGACCTCGC CGGCGCCTAC GACGTGATCG CGCCGTGGCT CGCGGCCATG CCCGACAATC CCGATCTGCA AGCGGCGCTC GGCCGCCTCT ATTCGACCGC CGGCGACGAT CGCAACGCGG TGGCCAGCTA TCGCGTCGCG CTGCAACGCA AGCCCGACGA CCTGAGCCTG CTGCAGGCGA CGATTTCCGC CGCGGCCGGC GCGAAGCAGT TCAGCTACGC CGAGTCGCTC GCCACGCAGG CCTTGAACGC GGCTCCAGGC GATCCGGGCG TGCTCGCCAC GGTTGGCCGC ATGTATCGCG CGGAAGGCAA GCTGTCGCTC GCGTCGACTT TTCTGCAACG CTCGCTCGTG GCCGCCAACA CGCCGCTGAT GGCCAACGCC CCGCGCAACC CGGCGAGCAA CGTGCCGCGC GGCTGGGAAG TGGCGATGCG CCGGATCGGC GCGACACCGC TGCCGGGCAC CAATCCTTTC GAGGGCAAGA CCGCGACGGT CACGCCGACC GACGCCGACA ACGCCGCGCT TGCGGGCGGC AGCTACAATG CCGCGCGCGC CTCGCTTCCG TATTCGCAGT CTTCCTTGCC GACACAGACC GTGCCGAACT ATCCGCCGCC TACGCAACCC GCGCCTTACG TCGCGCCTTA TACGGCGCCC GCGCAGCCTT ACGCGCCGAA CCGGGCGCCC GCCGCTCTTC CATACAACGC GCCGCGTCAG GGCGCCGATT CGGGCGGCTA TGGTCAGGAA GCCTACGGAT CGAGCCAGTC GGGCGCGCCG TTGCAGCCTT ATCCCGGCCA GGGCCAGCCG CCGATGCAAC CGCAGGCGCA GATGCCACCG ATGCAGCAGG CTCCGCAATA CAACCCGGCT TATCCGCAGC AGGCGCCTTA TCCGCAAGCG CAAGCGCCCT ATCCGCAGCA GGCGCAATAC GGCGCGCCGG ATGGCTATGC CGCCACGCCC TGGCCGATGT CGCCGGCCGC GCGCGAAGCG CAGACCAACG CCGGCTCGAT GCAACAGCAG CCGTACGGCG CACCGGGCGC GAGCACCAAA CGTCCGGCCG GCAGAAAGCA AAGCGCGTCG AAGAACAGCC GCAACGCGCC CGCCTATGCG CAGGCGCCGT ATGGGCAGCA GCAAGGCTAT CCGCAGCAAC AGCAGGCTTA CTACGGCCAG CCGGCGTATG CGCAGCAGCA ACAGCAGGGC TACCCGCAGC AACCTTATCA ACCCTATCAA CCGTATCCGC CGCAGCAGCA AGGCTACGAG AATCAAGGCT CGTATCAGGG CTATTACGCA CAACAGCAGC CGTACATTCC GCAGCCGCCC ACCGGTTATG CGCAACCGTA CTATCCGGCG CAACCCGGCG CGAACGGCGG CGGCAACACG TATGCGCCGT CGAACGTGGC AAGCGCGCAA ACGCTCGGCG TCGCCGAAGA ACTGGCGCAG GTCAACCGCG AGCAGTCGAG CTCGATCTCC GGCGGCATCG TATTCCGCAA CCGCAGCGGC GAGGACGGCC TGTCCAACCT CACCGATATC GAAGCGCCGA TCCAGGGGCG CATCAGGGCG GGCAACGGCC ACGTCGTCGT CACGGCAACG CCTGTCACGC TCGATGCCGG CAGTGCCGCC GGCAACGTCT CCACGCTCGC GCGCTTCGGC TCGGGTCTGT CGAACAGCAC GTCGGAAACC GCGGCTATCG CGGGCAACAA CACCTATGGC AGCCAGACGG CGAGCGGCGT GGGCCTCTCA GTCGGCTATG AAGGCCGCCA GCTCAGCGGC GATATCGGCG TGACGCCGAT CGGTTTCCGC GAGACCAATA TCGTGGGCGG CGCGCAGTAC AACGGTGGCA TCACCGACAA GGTGTCGTAT TCGCTCGCCA TTGCACGGCG CGCGGTGACC GACAGTCTGC TGTCCTACGC CGGCGCGCGC GACGCCGGCT CCGGCCTCGA ATGGGGCGGC GTCACCTCGA ACGGCGGCCT CGGCAGCCTC GCATGGGACG ACGGCACGAG CGGCCTGTAT GTGAACGCGG CGTTCCAGTA TTTCGACGGC ACCAACGTGC CGGGCAATAC CGCCGTCAAG GGCGGCGGCG GAGTCTATAC GCGCCTGCTG AAAGACGCCG ACCAGACGCT CACCGTCGGC GTGAATACCA CGCTGATGCG CTACGACAAG AACCTGTCGT ACTTCACGTA TGGCCAGGGC GGCTATTTCA GCCCGCAACA ATACGTGATC CTGAACCTGC CGGTCGAATG GACCGGCCGC AACGGGGCGT TCACGTACGA CGTGAAGGGC TCGATCGGCG TGCAGCACTA CCGCCAGGAT TCGTCGAACT ACTTCCCGCT CAACGACGGT TCCAACCGCC AGAGTTCGGC GGCGGCGAAT GCGGGTTTCG TCGGCACCGG CGTGGATAGC GGCGCGGTAT ATCCGGGGCA GAGCAAAACG GGCGTGTCGT ATTCGCTCAG CGCGGTGGGC GAATATCAAC TGGCGCCGCA ACTCGCGTTC GGCGCGACCG CTTCGCTCGG CAATGCTTAT GAATATCGGG AGTGGCTCGC GGCGGTTTAT GTGAGGTATA GCTTCAGCAA GCAGACAGGC TTGCAGCCGT TCCCGCCCGC GCCGCTCACT TCGCCTTACC TGTCGTTGTC GAACTGA
|
Protein sequence | MRLSALALSL RLALSLTAVC CVATVSPDAL AQASKDPLNV LIDQGKYWQS HQRGDLAEQA WQKVLRIDPK QPDALFGMGM VLADRKDGAG AQQYLARLKA VAPNYPNLDE LGRRLGESSL RDQSVNDARR LAQSGQSASA VQEYQRALTG KPATPELQLE YYQALSATPQ GWDQARRGLE QLARDNPDDP RYALAYAQHL TYRDVTRRDG IARLQKLAGD STVGASAKKS WRQALLWLDA RPSDAALYQA YLQTATDDAA VKARFDSMVQ QDSTARARAQ ENAAVDARGR AIADGFAALD RNDVATARAK FSSVLATSPN DTDALGGMGI AALKQERFAE ARNDLERASR NGNPARWKTA LDSATYWTYT SDAIGARSNG EFAKAKSLFE RAIALNPSDV TAQVLLGEML LANGDPVGAE QAYRMALRRQ ADNPDAVRGL VGALAAQGRG DEALQFANQL NAEQQSKAGG INRLRGEAQA AQARAAEARG DLGSARSLFE DALLNTPDDP WLRLDLARIY VRQGAVANAR SMMDGLLAAH PDMTDALYAS ALLSAETQDW STGLAQLDRI PLAQRTDAMT TLQHRLWVHQ QADLATRMAR SGQTQQALAT LHAAEPVAGN SPELIGVIAA AYQQAGDPNR ALGLVRAAMN AAPGNTDLLL QYAGILSATQ QEAELGMVMR RLASMQLTPQ QRTDFGNLNL GIVIKQCDAV RQRGDLAGAY DVIAPWLAAM PDNPDLQAAL GRLYSTAGDD RNAVASYRVA LQRKPDDLSL LQATISAAAG AKQFSYAESL ATQALNAAPG DPGVLATVGR MYRAEGKLSL ASTFLQRSLV AANTPLMANA PRNPASNVPR GWEVAMRRIG ATPLPGTNPF EGKTATVTPT DADNAALAGG SYNAARASLP YSQSSLPTQT VPNYPPPTQP APYVAPYTAP AQPYAPNRAP AALPYNAPRQ GADSGGYGQE AYGSSQSGAP LQPYPGQGQP PMQPQAQMPP MQQAPQYNPA YPQQAPYPQA QAPYPQQAQY GAPDGYAATP WPMSPAAREA QTNAGSMQQQ PYGAPGASTK RPAGRKQSAS KNSRNAPAYA QAPYGQQQGY PQQQQAYYGQ PAYAQQQQQG YPQQPYQPYQ PYPPQQQGYE NQGSYQGYYA QQQPYIPQPP TGYAQPYYPA QPGANGGGNT YAPSNVASAQ TLGVAEELAQ VNREQSSSIS GGIVFRNRSG EDGLSNLTDI EAPIQGRIRA GNGHVVVTAT PVTLDAGSAA GNVSTLARFG SGLSNSTSET AAIAGNNTYG SQTASGVGLS VGYEGRQLSG DIGVTPIGFR ETNIVGGAQY NGGITDKVSY SLAIARRAVT DSLLSYAGAR DAGSGLEWGG VTSNGGLGSL AWDDGTSGLY VNAAFQYFDG TNVPGNTAVK GGGGVYTRLL KDADQTLTVG VNTTLMRYDK NLSYFTYGQG GYFSPQQYVI LNLPVEWTGR NGAFTYDVKG SIGVQHYRQD SSNYFPLNDG SNRQSSAAAN AGFVGTGVDS GAVYPGQSKT GVSYSLSAVG EYQLAPQLAF GATASLGNAY EYREWLAAVY VRYSFSKQTG LQPFPPAPLT SPYLSLSN
|
| |