Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C7171 |
Symbol | |
ID | 3734816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | - |
Start bp | 750798 |
End bp | 755153 |
Gene Length | 4356 bp |
Protein Length | 1451 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637760872 |
Product | hemagluttinin/autotransporter adhesin |
Protein accession | YP_366859 |
Protein GI | 78060284 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.339179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGGA AGCAGATTTC GGCGCTTGCC GCCGCCATGT TCACGGGTGC AGGTGTCCTG ATCTCGGGCG CCGCGCATGC GGACAATTTC GTCGACCGCG GCAACCCGGA CAACGCGCTG AGCGGTCAGT GCATCGACGG CACGAACCCG CTGTGCGTGT CGACCAAGAG CGGGACCTAC GCCGCGGTGA GCTCGACGAA CGTCACGATG GGAACGAATG CCCGGGCCGG TACCAGCGGC ATCGCGATCG GCGACCAGTC GAATGCGGCC AGCAAGGGCG GCACCAGCAG CGGCGGTTCG ATTGCTGTCG GCGTCGGCGC GCAGGCGCTG GCGAACTCGG CGACGGCGAT CGGCACGGTG GCCGTCGCGC AAGGCAATAC GGCGCTCGCC ATTGGCCGCC AGTCGGCCGC GGTCGGCGAC TTCTCGATGG CAGTCGGCAA CGTCGCGGAC GCACGCGGCA CGAGCTCGAT CGCACTCGGC CACTCGGCGC TGGCAAGCGG CGATCGCTCG GTTGCGATCG GCGGCGCCAA CCCGACGACG AGCGACGGCG TGTCGGCCGG CGCGTCGTAT GACGCGGCCA CCCAGACGCG CGCCGGCGGC ACGCAATCCG TGGCGATCGG CGCGGGCGCG CAGACGAACG ACAACAACCA GGTGGCGATC GGTTCGGGCA GCGCCGGTGC GAACAATGGC GGCACGCCGG TGTTCGGCGG CACGGCCGCG CCGGTTGGCG GTGCGGTGTC GTTCGGCGCA ATCGGCAAGG AACGCCAGCT CAAGAACGTC GCGGCAGGCG CGGCCGATAC GGACGCCGTC AACGTCCAGC AGTTGAAGAA CGTGAACGGC ACGCTCAGCA CGAGCATCGC CACGGTCGAC GCGCGCGTGA CGTCGGTCGG CAATTCGCTG TCGACCACGA TCTCGAACGT CGACCAGCGC GTGACCAACG TGGGCAACAG CCTCAGCACC AGCATCGTGA CCGCAACGAA GAACGTCGTG AAATACACCG ACGATTCGCA TGCGGCGATC GCACTCGACG GCAGCAACGG CACGACGATC AACGGCGTCG CGGCCGGTGT CGCCGACACC GATGCGGTCA ACGTCGGTCA GCTGAAGGGC ACCGTCGCAC CGCTGCAGAC GTCCATTTCG ACCGCCGCGT CGAACATCAC GAACCTGCAG ACCAGCGTCA CCGGCATCAA CACGTCGCTG AGCACGGCGA CCACGAACAT CAGCAACCTG CAGGCGGCCG ACGCACGCAA CGTGAAGTAC GACGGCCAGA GCGGATTCGA CTCGGTGACG TTCGCCGGCA CGAACGGCAC GACGCTGCAC AACGTCGCAG CCGGTGTCGC GAACACGGAT GCCGTGAACG TCGGGCAACT GACCGGCGGC CTGTCGTCGC TCAGCACGAG CGTGACGAAC AACATCAGCA CGACGCTCGG CAGCCTGAGC ACGTCGATCA ACAACCAGAT CGGCAATGCG ACGAAGAACG CGGTGCAGTA CGACGACGAT GCCCACAGCG GCGTGACGCT CGGCGGCAAG GGCGCGCAGT CACCGGTCGC GCTGCACAAC GTCGCGGACG GTGTTGCGAC GGGCGACGCC GTGAACGTCG GCCAGCTCGG CAAGGCCACC GACACGCTCA ACCAGTCGAT CACGAATGTC GGCAACAGCG TGACGACGCT CGGCAACCAG GTTACGACGA ATACGGGCAA CATCGCCGCG CTTCAGCAGG ATGCGCTCCT GTGGAACACG AACCTCGGTG CGTACGATGC GAGCCACGGC GGCAATGGCC CGCAACGCAT CGGCAACGTC GCGGCCGGCG TCGCCGACAC CGATGCGGTC AACGTCGGTC AGTTGACGAG CACCGTCGCA CCGCTACAGA CGTCCATCTC GACGGCAGCA TCGAACATCA CGAACCTGCA GGGCAACGTC ACCAGCATCA ACACGTCGCT GAGCACGGCG ACCACCAATA TCAGCGACCT GAAGGCAGCC GACGCACGCA ACGTGAAGTA CGACGGCCAG AGCGGATTCG ACTCGGTGAC GTTCGCCGGC ACGAACGGCA CGACGCTGCA CAACGTCGCG GCCGGTGTCG CGAACACCGA TGCCGTGAAC GTCGGGCAAC TGAATGGCGG CCTGTCGTCG CTCAGCACGA GCGTGACGAA CAACATCAAC ACGACGCTCG GCAGCCTCAG CACGTCGATC AACAACCAGA TCGGCAACGC GACGAAGAAC GCGGTGCAGT ACGACGATGA CGCGCACAGC GGCGTGACGC TCGGCGGCAA GGGTGCGCAG TCGCCGGTCG CGCTGCACAA CGTCGCGGAC GGTGTTGCGA CGGGCGACGC CGTGAACGTC GGCCAGCTCG GCAAGGCCAC CGACACGCTG AACCAGTCGA TCACGAACGT CAGCAACAGC GTGACCACGC TCGGCAACCA GGTGACGACG AACACGGGCA ACATCGCGGC GCTCCAGCAG GACGCGCTCC AGTGGAACGC GAACCTCGGC ACCTACGATG CGAGCCACGG CGGCAACGGC CCGCAACGCA TCGGCAATGT CGCGGCCGGC AAGAACGGAA CGGACGCGGT CAACGTCGAC CAGTTGAACG CGGCGATTCA GGACGGCACG AGCCAGCTCG ACGCGCTCGC GGTGAAGTAC GACGACGCGA GCAAGAAACA GGTTTCGCTC GGCGGGGGCA ACGGCGCGAG CCCGGTGCGC CTGACCAACG TCGCGGAAGG CAACGTTGCG GCCGGCAGCA CGGACGCGGT GAACGGCGCG CAACTGCGCC GCGCGACCGA CGGTACGGCA GCCGCGCTGG GCGGTGGCGC AACGGCGAAT CCGGACGGCT CGATCACGGC GCCGGCCTAT AAAGTCGGCG GCGGTTCGTT CAACAACGTC GGCGACGCGC TCACGAACCT CGACGGTCGC GTCGGCAGCA ACACGACGAC GCTCGAGAAT CACGAAACGC GCATCGGCAA TGCCGAAACG AACATCGCCG GCAACACGGC CGCGATTGCC GGGCTGCAGC AGGATGCGCT TCAGTTTGAC CCGAAGGCCG GCGCCTACAA CGCGGCGCGC GGTGGTGCAC CGACGAAGCT GACGAACGTG GCCGACGGCA ACATCGCCGC GGGCAGCACG GACGCGGTGA ACGGCGGCCA GTTGTCGGGC GTGAAGTCGT CGCTCGAACA GCAGATCACG CAGGTGTCCA ACCAGGCGGG CGAGGCCGTG AAGAACGTCG TCAAGTACGA TGTCGACACG AACGGCAATC GGCTGAATTC GGTATCGCTG ATCGGCGGCG ACACGAATGC GGCGGTCGTG CTGAAGAACG TCGCCGCGGG CACCGACGAT ACGGACGCGG TGAACGTGAA GCAGCTGAAG GGCGTGCAGT CGAGTCTCAA CCAGCTCGGC GCGCTCGCGG TGCAGTATGA CGACAGCTCG AAGAGCTCGA TCACGCTCGG CGGCGCCGGC GGCACGCGCA TCACCAACGT GCAGGCAGGT GCGCTCAGCG CAACCAGCAC CGACGCGGTG AACGGCTCGC AGCTCTACGC GACGAACCAG CAGGTCGCGA AGAACACGAC CGACATCACG AACCTGCAGG GGAACGTGAC CAACATCGCG AACGGCAAGG CCGGCCTTGT GCAACAGCAG GACCCGAACG GCGCGATCAC GGTCGGGAAG GACACCGGCG GGTCCAGCGT GAACTTCTCC GGCACGGCAG GCGACCGCGT GCTGACCGGC GTTGCGGCGG GCGTGAACGA GAACGACGCG GTCAACATGG GCCAGTTCAA CAATGCGCTG AAGAACGCGG CGGCCAACGA CCAGATCCGC GCGGCGGCGA CCGATGCGAA CACCACGTGG ATCGCGCGTG CCGATGCGGG TTCGATCGGC TCGACGGCAA CCGCGACCGG CAAGAACGCG GTGGCGGTCG GCCAGGGCTC CGTTGCCGAT CGAGATAATT CGTTCTCGGT GGGTGCGAAG GGCAGCGAAC GCCAGGTCAC GAACGTCGCG GCCGGCACGG CGCCGACCGA TGCGGTGAAC GTGCAGCAGC TGAACGACAA CCTGTCGGCC GCGTCGAATC AGGCGAAGGG CTACACCGAC CAGCGCATCG GCCAGGTGTA CAACTCGTTC AACGACCTGA AGAAGGACAT GTACGGCGGC GTGGCATCGG CAATGGCCGT GGCCGGCCTG CCGCAACCGA CGGGCGCGGG CCGCTCGATG GTCTCGGCGG CGACGTCGAA CTATCACGGC CAGCAGGGTT TCGCCGCCGG TTATTCGTAC GTGACGGAAA GCAACCGCTG GGTCGTCAAG GCGTCGGTGA CGGGCAACAC GCGTTCGGAC TTCGGCGCGG TGGTGGGCGC GGGTTACCAG TTCTGA
|
Protein sequence | MKRKQISALA AAMFTGAGVL ISGAAHADNF VDRGNPDNAL SGQCIDGTNP LCVSTKSGTY AAVSSTNVTM GTNARAGTSG IAIGDQSNAA SKGGTSSGGS IAVGVGAQAL ANSATAIGTV AVAQGNTALA IGRQSAAVGD FSMAVGNVAD ARGTSSIALG HSALASGDRS VAIGGANPTT SDGVSAGASY DAATQTRAGG TQSVAIGAGA QTNDNNQVAI GSGSAGANNG GTPVFGGTAA PVGGAVSFGA IGKERQLKNV AAGAADTDAV NVQQLKNVNG TLSTSIATVD ARVTSVGNSL STTISNVDQR VTNVGNSLST SIVTATKNVV KYTDDSHAAI ALDGSNGTTI NGVAAGVADT DAVNVGQLKG TVAPLQTSIS TAASNITNLQ TSVTGINTSL STATTNISNL QAADARNVKY DGQSGFDSVT FAGTNGTTLH NVAAGVANTD AVNVGQLTGG LSSLSTSVTN NISTTLGSLS TSINNQIGNA TKNAVQYDDD AHSGVTLGGK GAQSPVALHN VADGVATGDA VNVGQLGKAT DTLNQSITNV GNSVTTLGNQ VTTNTGNIAA LQQDALLWNT NLGAYDASHG GNGPQRIGNV AAGVADTDAV NVGQLTSTVA PLQTSISTAA SNITNLQGNV TSINTSLSTA TTNISDLKAA DARNVKYDGQ SGFDSVTFAG TNGTTLHNVA AGVANTDAVN VGQLNGGLSS LSTSVTNNIN TTLGSLSTSI NNQIGNATKN AVQYDDDAHS GVTLGGKGAQ SPVALHNVAD GVATGDAVNV GQLGKATDTL NQSITNVSNS VTTLGNQVTT NTGNIAALQQ DALQWNANLG TYDASHGGNG PQRIGNVAAG KNGTDAVNVD QLNAAIQDGT SQLDALAVKY DDASKKQVSL GGGNGASPVR LTNVAEGNVA AGSTDAVNGA QLRRATDGTA AALGGGATAN PDGSITAPAY KVGGGSFNNV GDALTNLDGR VGSNTTTLEN HETRIGNAET NIAGNTAAIA GLQQDALQFD PKAGAYNAAR GGAPTKLTNV ADGNIAAGST DAVNGGQLSG VKSSLEQQIT QVSNQAGEAV KNVVKYDVDT NGNRLNSVSL IGGDTNAAVV LKNVAAGTDD TDAVNVKQLK GVQSSLNQLG ALAVQYDDSS KSSITLGGAG GTRITNVQAG ALSATSTDAV NGSQLYATNQ QVAKNTTDIT NLQGNVTNIA NGKAGLVQQQ DPNGAITVGK DTGGSSVNFS GTAGDRVLTG VAAGVNENDA VNMGQFNNAL KNAAANDQIR AAATDANTTW IARADAGSIG STATATGKNA VAVGQGSVAD RDNSFSVGAK GSERQVTNVA AGTAPTDAVN VQQLNDNLSA ASNQAKGYTD QRIGQVYNSF NDLKKDMYGG VASAMAVAGL PQPTGAGRSM VSAATSNYHG QQGFAAGYSY VTESNRWVVK ASVTGNTRSD FGAVVGAGYQ F
|
| |