Gene Bcep18194_B2862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B2862 
Symbol 
ID3754629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp3234631 
End bp3237642 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content68% 
IMG OID637767710 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_373617 
Protein GI78063709 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA CCTATGCATT GGTATGGAAC GGCGCCCAGC GATGCTGGAC CGCGGCCGGG 
GAAACCGCGC GCCGCCGCGG CAAGGCAACC GGCGGCAAGC GCGCCGCCGT GACCGCCGTC
TCGCTGCTCG GCTTCGCCGC GCTGCCCGCT TTTGCGCTGC CCACCGGCGA GACAATCATG
TCCGGCCAGG CCGACATCGT GCGCACCGAC GGCGGCCGCA CGATGAACAT CAACCAGCAC
ACCGACAAGC TCATCACGAA CTGGCAGGAC TTCAGCGTGG GCGGCGGCGA ACGCGTCAAC
TTCCACCAGC CGAACAGCCA GTCCCTCGCA CTCAACCGCG TGATCGGCAC CAACGGTAGC
CGTATCGACG GCCAGATTTC CGCCAACGGC CGCGTGTTCC TCGTCAACCC GAACGGCGTG
CTGTTCGGCT CCGGCGCACA GGTCAACGTC GGCGGCCTCG TCGCGTCCAC GCAGAACCTG
TCCGATGCAG ACTTCCTCGC CGGCAACTAC CGCTTCTCCG GCTCCTCGAC GCAAGCCGTC
ACCAACGACG GCACGATCAC CGCTGCCGAC GGCGGCAGCG TTGCGCTGCT CGGCGCGCGC
GTCGCCAACA ACGGCACGAT CCAGGCGAAA CTCGGTAGCG TCGCGCTCGC CGCCGGCAAC
GCGTTCACGG TGAATTTCGA CGGCAGCGGC CTGCTGAACC TGCAGGTCGA CGGCGGCGCA
GTCGATGCGC AGGCGTCCAA CGGCGGCCTG CTGAAAGCCG ACGGCGGCGA GGTGCTGATG
ACTGCCCGCG CGGCTGACAA CCTGCTCGGC GCCGTGGTCA ACAACACCGG CACGATCGAA
GCGCGCGGCC TCAGCTCGCG CGGCGGCAAG ATCACGCTCG ATGGCGGCAC CGTGAACGTC
GGCGGCAAGC TCGACGCGAG CACGGCCGAC GCGGGCGCAC CGGCCGGCGC GGTCACGACG
CGCGGCGAAC GCGTGAAAGT CGCGAACGAC GCGCAAGTCG ATACGCGTGC GGGCAACACG
GCCGGCACGT GGACGATCGA AGCCGCCAAC GCGGGCGTGA ACGGGGCGAA TGTCAACGGT
CAAGCAATCG ACGCCGACAC GCTGTCGCGC AACCTCGGCA CGACGAACGT CGCGCTGACG
AACACGAAGG GCGACCTGAC GGTCGGCGGC CCGGTCGCGT GGACGAGCGA CAACGCACTG
ACGCTCACGT CGCAGAAGGG CAACGTCGAC CTGAATCAAA CGCTGTCGGC CACCGGCGCG
AATGCGAGCC TGGCCCTCAA CGCCGCGAAC CGGATCCGCG TAAACGACGC CGTGACGCTC
ACCGGCCGTA ACGCGCACCT CGAGCTGAAT TCGACCAATG GCCACACGCT TGCGAACGAC
AAGGGCGTCG TTACGCTGTC GGGCGACAAC GCGTCGTACA GCTCGAATGG CGAAGGCTAC
AAGGTGCTGC ACACGCTTGC CGACCTGCGC AACGTCGACG CGGACCTGAA CGGCCGCTAC
GTGCTCGGCA ACGGCATCGA CGGTGCGAAC GCCGGCTTCA ACAGCATCGG CGGCAGCAAG
ACGTTCAACG GTACGTTCGA CGGCCTGGGC AACACCGTCC GTCGCCTGAC CGTCAGCAAC
CCCGGCAACA CCAGGGTCGG CCTGTTCTCG GCGAACTTTG GATCGATCGG CAACCTGAAA
CTCGATTCGC TCAACGTCAA CAGCGCGTCG ACGTCCCCCA ACGCATTCAT GGGCGGGCTC
GTCGGCATCA ACTACGGCGG CCGGATCCAC GACGTCGCGG CCACGAACAT GAGCGTCGTC
CACAACGGCA AGGGGATCGC CGTGATCGGC GGGATCGTCG GCGTGAACTA CGACGGTGCA
ATCGACAACG CCCACTTCCG CGGTCGGATT GACGGCACCC GCGACACGAT CAGCATCGGC
GGCATCGCCG GCCAAAACGA AGGCACGCGT GCAACGATCG AGCGCAGCAG CGCGAGCGCC
GACATCAAGA TTGCACGGAC CTACCGCTTC CCCGTGTATG GCCAGGGCGC GGGCATGCTG
GTCGGCCGAA ACACCGGGAC CATAGCAAAT TCGTCCGCCA GCGGCCGCAT CGCAGCCGGC
GAAGGCTTGA ACGTCGGCGG GCTCGTTGGC ATGAACGACG GCGGCACGCT GCGCAACGTG
TCGGCGGTCA CGACGATCTC GGCGGGCGAG GGCAGCAATG TCGGCGGGCT GGTCGGCCGG
GCCCTCGGCG GCTCGATCGA GCACGCGTCG GCCAGCGGCT CGATCAAGAC GATGCATGCC
GCGGCAACGG GCGGTCTCGT CGGACTGAAC GAACGCGGCC GGATTGCCAA CGCATCGTCC
GAGGTCGAGA TCGATGCATT AGGCGGCGGC CCGGTGGGCG GCCTCGTCGG CCGCAACGAC
CGCGGCGCTA TCGAAAATGT GAGCGCAGCC GGCAACGTGC AAGCCTACGT CGCGGCACCC
GTGGGCGGGC TGGTCGGCCA CAATACGGGC ACGATCGAGA ATGCGTCCGC CAGCGGCAAC
GTGACCGCCG GCACACGCTC GAACGCGGGG GGGCTGGTCG GGACCAACGG GGGTACGATC
GCGCAAGCGT CGGCCAGCGG CAACGTCACG GCCGGCAGGG AATCGAACGC GGGCGGGCTC
GTCGGCCTGA ACGACTTCAA CGGCGCGATC CGCCAGTCGT CGTCGTCCGG CACCGTCACC
GCGGACCTTT CGTGGGTTGG CGGCTTGGTC GGCACCAACG TCAACGTGAT CGAAAACAGC
CAGTCATCGG GTTCGATCGA CGGCGTGAAC TCGGATCTCG GTGGCCTGGT AGCGCTGAAC
ATGGGCACCA TCCGGTCGTC GCAGTCGAGC ACCCGGATCG GCACCGGTCC GTTGCCGGTA
CCGATCCTCC GCGGCAGCCT GGTCGCCCTG AACTTCGGGA GCATCGAGTC GAGCACCGCG
TCAGGTCCGT CGGCGGGCAT GCAGCTCGTC GGCGATAACT GGGGTACGGT TGACGGCAAG
ACCGGCTGGT AA
 
Protein sequence
MNKTYALVWN GAQRCWTAAG ETARRRGKAT GGKRAAVTAV SLLGFAALPA FALPTGETIM 
SGQADIVRTD GGRTMNINQH TDKLITNWQD FSVGGGERVN FHQPNSQSLA LNRVIGTNGS
RIDGQISANG RVFLVNPNGV LFGSGAQVNV GGLVASTQNL SDADFLAGNY RFSGSSTQAV
TNDGTITAAD GGSVALLGAR VANNGTIQAK LGSVALAAGN AFTVNFDGSG LLNLQVDGGA
VDAQASNGGL LKADGGEVLM TARAADNLLG AVVNNTGTIE ARGLSSRGGK ITLDGGTVNV
GGKLDASTAD AGAPAGAVTT RGERVKVAND AQVDTRAGNT AGTWTIEAAN AGVNGANVNG
QAIDADTLSR NLGTTNVALT NTKGDLTVGG PVAWTSDNAL TLTSQKGNVD LNQTLSATGA
NASLALNAAN RIRVNDAVTL TGRNAHLELN STNGHTLAND KGVVTLSGDN ASYSSNGEGY
KVLHTLADLR NVDADLNGRY VLGNGIDGAN AGFNSIGGSK TFNGTFDGLG NTVRRLTVSN
PGNTRVGLFS ANFGSIGNLK LDSLNVNSAS TSPNAFMGGL VGINYGGRIH DVAATNMSVV
HNGKGIAVIG GIVGVNYDGA IDNAHFRGRI DGTRDTISIG GIAGQNEGTR ATIERSSASA
DIKIARTYRF PVYGQGAGML VGRNTGTIAN SSASGRIAAG EGLNVGGLVG MNDGGTLRNV
SAVTTISAGE GSNVGGLVGR ALGGSIEHAS ASGSIKTMHA AATGGLVGLN ERGRIANASS
EVEIDALGGG PVGGLVGRND RGAIENVSAA GNVQAYVAAP VGGLVGHNTG TIENASASGN
VTAGTRSNAG GLVGTNGGTI AQASASGNVT AGRESNAGGL VGLNDFNGAI RQSSSSGTVT
ADLSWVGGLV GTNVNVIENS QSSGSIDGVN SDLGGLVALN MGTIRSSQSS TRIGTGPLPV
PILRGSLVAL NFGSIESSTA SGPSAGMQLV GDNWGTVDGK TGW