Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_B2862 |
Symbol | |
ID | 3754629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007511 |
Strand | + |
Start bp | 3234631 |
End bp | 3237642 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637767710 |
Product | filamentous haemagglutinin-like protein |
Protein accession | YP_373617 |
Protein GI | 78063709 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA CCTATGCATT GGTATGGAAC GGCGCCCAGC GATGCTGGAC CGCGGCCGGG GAAACCGCGC GCCGCCGCGG CAAGGCAACC GGCGGCAAGC GCGCCGCCGT GACCGCCGTC TCGCTGCTCG GCTTCGCCGC GCTGCCCGCT TTTGCGCTGC CCACCGGCGA GACAATCATG TCCGGCCAGG CCGACATCGT GCGCACCGAC GGCGGCCGCA CGATGAACAT CAACCAGCAC ACCGACAAGC TCATCACGAA CTGGCAGGAC TTCAGCGTGG GCGGCGGCGA ACGCGTCAAC TTCCACCAGC CGAACAGCCA GTCCCTCGCA CTCAACCGCG TGATCGGCAC CAACGGTAGC CGTATCGACG GCCAGATTTC CGCCAACGGC CGCGTGTTCC TCGTCAACCC GAACGGCGTG CTGTTCGGCT CCGGCGCACA GGTCAACGTC GGCGGCCTCG TCGCGTCCAC GCAGAACCTG TCCGATGCAG ACTTCCTCGC CGGCAACTAC CGCTTCTCCG GCTCCTCGAC GCAAGCCGTC ACCAACGACG GCACGATCAC CGCTGCCGAC GGCGGCAGCG TTGCGCTGCT CGGCGCGCGC GTCGCCAACA ACGGCACGAT CCAGGCGAAA CTCGGTAGCG TCGCGCTCGC CGCCGGCAAC GCGTTCACGG TGAATTTCGA CGGCAGCGGC CTGCTGAACC TGCAGGTCGA CGGCGGCGCA GTCGATGCGC AGGCGTCCAA CGGCGGCCTG CTGAAAGCCG ACGGCGGCGA GGTGCTGATG ACTGCCCGCG CGGCTGACAA CCTGCTCGGC GCCGTGGTCA ACAACACCGG CACGATCGAA GCGCGCGGCC TCAGCTCGCG CGGCGGCAAG ATCACGCTCG ATGGCGGCAC CGTGAACGTC GGCGGCAAGC TCGACGCGAG CACGGCCGAC GCGGGCGCAC CGGCCGGCGC GGTCACGACG CGCGGCGAAC GCGTGAAAGT CGCGAACGAC GCGCAAGTCG ATACGCGTGC GGGCAACACG GCCGGCACGT GGACGATCGA AGCCGCCAAC GCGGGCGTGA ACGGGGCGAA TGTCAACGGT CAAGCAATCG ACGCCGACAC GCTGTCGCGC AACCTCGGCA CGACGAACGT CGCGCTGACG AACACGAAGG GCGACCTGAC GGTCGGCGGC CCGGTCGCGT GGACGAGCGA CAACGCACTG ACGCTCACGT CGCAGAAGGG CAACGTCGAC CTGAATCAAA CGCTGTCGGC CACCGGCGCG AATGCGAGCC TGGCCCTCAA CGCCGCGAAC CGGATCCGCG TAAACGACGC CGTGACGCTC ACCGGCCGTA ACGCGCACCT CGAGCTGAAT TCGACCAATG GCCACACGCT TGCGAACGAC AAGGGCGTCG TTACGCTGTC GGGCGACAAC GCGTCGTACA GCTCGAATGG CGAAGGCTAC AAGGTGCTGC ACACGCTTGC CGACCTGCGC AACGTCGACG CGGACCTGAA CGGCCGCTAC GTGCTCGGCA ACGGCATCGA CGGTGCGAAC GCCGGCTTCA ACAGCATCGG CGGCAGCAAG ACGTTCAACG GTACGTTCGA CGGCCTGGGC AACACCGTCC GTCGCCTGAC CGTCAGCAAC CCCGGCAACA CCAGGGTCGG CCTGTTCTCG GCGAACTTTG GATCGATCGG CAACCTGAAA CTCGATTCGC TCAACGTCAA CAGCGCGTCG ACGTCCCCCA ACGCATTCAT GGGCGGGCTC GTCGGCATCA ACTACGGCGG CCGGATCCAC GACGTCGCGG CCACGAACAT GAGCGTCGTC CACAACGGCA AGGGGATCGC CGTGATCGGC GGGATCGTCG GCGTGAACTA CGACGGTGCA ATCGACAACG CCCACTTCCG CGGTCGGATT GACGGCACCC GCGACACGAT CAGCATCGGC GGCATCGCCG GCCAAAACGA AGGCACGCGT GCAACGATCG AGCGCAGCAG CGCGAGCGCC GACATCAAGA TTGCACGGAC CTACCGCTTC CCCGTGTATG GCCAGGGCGC GGGCATGCTG GTCGGCCGAA ACACCGGGAC CATAGCAAAT TCGTCCGCCA GCGGCCGCAT CGCAGCCGGC GAAGGCTTGA ACGTCGGCGG GCTCGTTGGC ATGAACGACG GCGGCACGCT GCGCAACGTG TCGGCGGTCA CGACGATCTC GGCGGGCGAG GGCAGCAATG TCGGCGGGCT GGTCGGCCGG GCCCTCGGCG GCTCGATCGA GCACGCGTCG GCCAGCGGCT CGATCAAGAC GATGCATGCC GCGGCAACGG GCGGTCTCGT CGGACTGAAC GAACGCGGCC GGATTGCCAA CGCATCGTCC GAGGTCGAGA TCGATGCATT AGGCGGCGGC CCGGTGGGCG GCCTCGTCGG CCGCAACGAC CGCGGCGCTA TCGAAAATGT GAGCGCAGCC GGCAACGTGC AAGCCTACGT CGCGGCACCC GTGGGCGGGC TGGTCGGCCA CAATACGGGC ACGATCGAGA ATGCGTCCGC CAGCGGCAAC GTGACCGCCG GCACACGCTC GAACGCGGGG GGGCTGGTCG GGACCAACGG GGGTACGATC GCGCAAGCGT CGGCCAGCGG CAACGTCACG GCCGGCAGGG AATCGAACGC GGGCGGGCTC GTCGGCCTGA ACGACTTCAA CGGCGCGATC CGCCAGTCGT CGTCGTCCGG CACCGTCACC GCGGACCTTT CGTGGGTTGG CGGCTTGGTC GGCACCAACG TCAACGTGAT CGAAAACAGC CAGTCATCGG GTTCGATCGA CGGCGTGAAC TCGGATCTCG GTGGCCTGGT AGCGCTGAAC ATGGGCACCA TCCGGTCGTC GCAGTCGAGC ACCCGGATCG GCACCGGTCC GTTGCCGGTA CCGATCCTCC GCGGCAGCCT GGTCGCCCTG AACTTCGGGA GCATCGAGTC GAGCACCGCG TCAGGTCCGT CGGCGGGCAT GCAGCTCGTC GGCGATAACT GGGGTACGGT TGACGGCAAG ACCGGCTGGT AA
|
Protein sequence | MNKTYALVWN GAQRCWTAAG ETARRRGKAT GGKRAAVTAV SLLGFAALPA FALPTGETIM SGQADIVRTD GGRTMNINQH TDKLITNWQD FSVGGGERVN FHQPNSQSLA LNRVIGTNGS RIDGQISANG RVFLVNPNGV LFGSGAQVNV GGLVASTQNL SDADFLAGNY RFSGSSTQAV TNDGTITAAD GGSVALLGAR VANNGTIQAK LGSVALAAGN AFTVNFDGSG LLNLQVDGGA VDAQASNGGL LKADGGEVLM TARAADNLLG AVVNNTGTIE ARGLSSRGGK ITLDGGTVNV GGKLDASTAD AGAPAGAVTT RGERVKVAND AQVDTRAGNT AGTWTIEAAN AGVNGANVNG QAIDADTLSR NLGTTNVALT NTKGDLTVGG PVAWTSDNAL TLTSQKGNVD LNQTLSATGA NASLALNAAN RIRVNDAVTL TGRNAHLELN STNGHTLAND KGVVTLSGDN ASYSSNGEGY KVLHTLADLR NVDADLNGRY VLGNGIDGAN AGFNSIGGSK TFNGTFDGLG NTVRRLTVSN PGNTRVGLFS ANFGSIGNLK LDSLNVNSAS TSPNAFMGGL VGINYGGRIH DVAATNMSVV HNGKGIAVIG GIVGVNYDGA IDNAHFRGRI DGTRDTISIG GIAGQNEGTR ATIERSSASA DIKIARTYRF PVYGQGAGML VGRNTGTIAN SSASGRIAAG EGLNVGGLVG MNDGGTLRNV SAVTTISAGE GSNVGGLVGR ALGGSIEHAS ASGSIKTMHA AATGGLVGLN ERGRIANASS EVEIDALGGG PVGGLVGRND RGAIENVSAA GNVQAYVAAP VGGLVGHNTG TIENASASGN VTAGTRSNAG GLVGTNGGTI AQASASGNVT AGRESNAGGL VGLNDFNGAI RQSSSSGTVT ADLSWVGGLV GTNVNVIENS QSSGSIDGVN SDLGGLVALN MGTIRSSQSS TRIGTGPLPV PILRGSLVAL NFGSIESSTA SGPSAGMQLV GDNWGTVDGK TGW
|
| |