Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1204 |
Symbol | |
ID | 4885859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1139926 |
End bp | 1146087 |
Gene Length | 6162 bp |
Protein Length | 2053 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640131143 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_001062201 |
Protein GI | 126444454 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.19066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACG ACGACAAGCA CAACAAGCCA GGCCATCCGA ACCCGTCGAT CCTCCGTCTG CTCGCGCGCG TCGGCGCGGC CGCGGCGCTC GGCGCGGCCG CCGCGCTGTC GCTGCACGCC GACGCGGCGC GCACCGTGAC CGTGTCGCCG CAAGGCACCG TCGCCGAAGT CCGGCAGGCC GTCGTCAAGT TCGACGAGGC GATGGTCGCG TTCGGCTCGG CGTCCGCGCC GGCCCCCGCG CGCCTCGCGT GCGCCGATCC CGCCGCCGCG CGCGGCCACG GCCGCTGGCT CGACGAGAAG ACCTGGGCCT ATGACTTCGA GAACGATCTG CCGCCCGGCG TACGCTGCAC GGTCGCGCTC AACGACACGC TGCGCTCGGC CGCCGGCCAC GCGGTGACGG GCCCGCGCCG CTTCGCGTTC CAGACGGGCG GCCCGTTCCC GGTCGCGGTA CGGCCGGGCG CGCGCGAGAT CGAAGAGCGG CAGGTATTCG TCGTCAAGCT GAACGGCCCG GCCGACGAAC GCTCGGCGCT CGCGAGCATC TGGTGCGAGG CCGCCGGCAT CGGCAACCGC ATCCCGGTGG CGCCCGCCGA CGCGCCGACG CGCGCCGCGC TCCTCGATCA CTTCCACTGG AAAAAGGACG CCGCGCGCGT GCTCACGCTC GCGTGCGCGC AGGCGCTGCC CGCGGGCGCG AAGATGCAGC TCGTCTACGG CCGCGGCGTC GCGAGCCCGA GCGGCATCGC GAACGACACC GAGCGGCGCT ACGACTTCAC GGTGCGCGCG CCGTTCGCCG CGAGCTTCTC GTGCGAGCGC GAGAACGCGA AGGCGCCGTG CACACCGCTG CGCCCGCTCA CGCTGTCGTT CAACGCGCCG GTGGCGCGCC GCGCCGCCGG CGAGATCCGG CTGCGCGGCC CGCACGGCGC GATCGAGCCG TTCTTCAAGC CCGACGATCG CGCGGAGGAA GTCACGAGCG TGCAGTTCGC CGCGCCGCTG CCCGCGCAGG CCGCGCTGAC GATCGAGCTG CCGCCGGCGC TGCGCGACGT GACGGGCCGC ACGCTGTCGA ACGCCGATCT GTTCCCGCTC GCGACGCGCA CCGCGCCGAT GCCGCCGCTC GCGAAGTTCT CGTCGGGCAC GTTCGGCATC GTCGAGCGCT TCGCCGAGCC GGATTCGCCC GCGCTCGTGC CCGTCACGCT GCGCAACGTC GAGGCCGACC TGCGCATCGC GGGGCTGAAC GCCGGCGGCG CGCAGTTCTC GAACCTGAAG GTCGAGAACG ACAGCGAGAT CCGCCGCTGG ATGCAGCTCG TCGAGCGTTT CGACGGCCGC GCGATGAGCG TCGAGTCGAT CGACAAGCTC CGCCCCGGCC TGCTCGCGCG CGGCCAGCAT CCCGTCTACG TGCCGCTCGC CGCGGGCGAG CGCGCGCCGA AGCCGCAGCA CCGGCAGATC GACATCCGCT CGCTGTCGCT GCTCGCGGGC GAGCCCGGCG TACGGACGCT GACGCTGCCG AAGGCCGACC CGAAGGCGCT GCGCCCGTTC GAGATCGTCG GCGTGCCGAT CGACAAGCCG GGCTTTCACG TGCTCGAGCT CGCGTCGCCC GCGCTCGGCC GCTCGCTGCT CGCCAAGCCC GCGAAGATGT ACGTGCGCAC CGCGGTGCTC GTCACGAACC TCGGCGTGCA TCTGAAGCTC GGCCGCGAGA ACAGCGTCGT CTGGGTGACG ACGCTCGACA AGGGCAAGCC CGTGCCGAAC GCGCAGGTGC GCGTGTCCGA CTGCAACGGC GACGAAATCG CGGCCGGCAG GACCAACGCG CAAGGGCTCG TCACGATCGA TGCGCCTCTC GAGCCCAAGC GCGCATGCGA CAGCTCGAAC GGCGACGGCG ACTATTTCGT GTCCGCCCGC GTCGACGATC CGAAGACGGG CCCCGACATG GCGTTCGTGC GCTCGAGCTG GAACCGCGGC ATCGAATCGT GGCGCTTCAA CGTGCCGACC GACATGAGCG ACACGCCGAC CGTGCGCGCG CATACGGTGT TCGACCGCAC GCTCGTGCGC GCGGGCGAGA CGGTATCGAT GAAGCACTTC GTGCGCGAGG AGACGCTGCG GGGCCTCGCG TTCCCGCCGC GCTACCCGTC GCGCGCGACG ATCCGCCATC TCGGCAGCGG CCAGACGTAC CGCGTGCCGC TCGCATGGGC CGCCGATCAC ACCGCCGACA CGCGCTTCGC GCTGCCCGCG GCGGCGAAGC TCGGCGAATA CAGCGTCGAG CTCGAGGACG GCCCCGAGGA CGCGCCGAGC GCGAGCTACT ACGGCGGCAG CTTCCGCGTC GAGGCGTTCC GGCTGCCCGT CTTCAAAGGC TCGATCGGCG TGCGCGACGC GAAGGCGAGC CCGCTCGTCG GCGCGAAGGA CGCGCCGCTC GCGGTGCAGA TCGATTACGT GTCGGGCGGC GGCGCGTCGA ACCTGCCCGT GCAGGTGTCG GCGCTCGTCA AGCGCGCCGA GCCGCCGTTC GCCGAGCGCT ATCCCGATTT CGGCTTCGAG CCGTACCGCC CGCAAACGCA GGACGCGACG GCCGACGACG AGGACACGCA GGACGGCGAG AACGCGTCGC GCGACACCGA TCCCGACGCG ACGAAGCTCA TCGCCGACAA GATCGCGCTC ACGCTCGATC GCACCGGCTC GGGCGCGCTC ACGCTGAAGG GCCTGCCGGC CGTCGACGCG CCCAAGCGCG TCGCGCTCGA GGCGACGTTC GCCGATCCGA ACGGCGAGGT GCAGACGATT CGCGGCGACG CGATGCTGTG GCCGGCCGCG GTCGTCGCCG GCATCCAGGC GGGCCACTGG GTGTCGGTCG GCCAGCGCGT GCCGGTGCAG GCGCTCGTTG TCGATCTGCA GGGCCGCCCG CGCGCGTCGG CGGCGGTCGA GATCAAGGGC GTCGCGCGCG TGACGACCTC CTCGCGCAAG CGGATGGTCG GCGGCTTCTA CGCGTACGAC AACCAGAGCG ACACGCGCGA GCTGGGCGTG CTGTGCTCGG GCAAGACCGA CGCGCAGGGC CGGCTGGCGT GCGAGGCCAC GCTCTCGCAG GCGGGCAACG TGCAACTGAT CGCGGTCGCG AAGGACGGCG ACGGCCGCGC GTCGAACGCG TCGACGTCGG TATGGGTCAC GCGCGAGGAC GATCTCTGGT TCGGCGGCGA GAACACCGAC CGGATCGACG TGATCCCCGA GAAGGCGTCG TACGAGCCGG GCGACACCGC GCGCTTCCAG GTGCGCATGC CGTTTCGCCA TGCCACGGCG CTCGTCGCCG TCGAGCGCGG CGGCGTGATG CAGACGCGCG TCGTCGAGCT GAACGGCAAG AATCCGACCG TCGATCTGAA GGTCGGCGAC ACGTGGGGGC CGAACGTCTA CGTATCGGTG CTCGCGCTGC GCGGGCGGCT GCGCGACGTG CCGTGGTACT CGTTCTTCAC GTGGGGCTGG AAGGCGCCCC TCGAATGGGC GCGCGCGTTC TGGCGCGAAG GCCGCCACTA CGAGGCGCCG AGCGCGCTCG TCGACCTGTC GAAGCCCGCG TTCCGCTACG GCCTGGGCGA GATCAAGGTC GGCACGGGCG CGCACCGGCT CGGCGTCGCG GTGACGACCG ACGCGGCCCG CTATCCGGTG CGCGGCACCG CGCACGCGCG CGTGAAGGTC ACGCTGCCGG ACGGCAAGCC CGCGCCCGCC GGCACGCAGA TCGCGCTCGC CGCGGTCGAC GAGGCGCTCC TCGAGCTGAT GCCGAACCGC AGTTGGGACC TGCTCGATGC GATGCTGCAA CGGCGCGCGT ACGGCATCGA GACGGCCACC GCGCAAATGG AGATCGTCGG CCGCCGCCAC TTCGGACGCA AGGCCGTGCC CGCGGGCGGC GGCGGCGGGA TGGCGCCGAC CCGCGAGCTG TTCGACACGC TGTTGCTGTG GAACCCGCGC GTCACGCTCG ACGCGAACGG CAGCGCGAGC GTCGACGTGC CGCTCAACGA TGCGCTCACG CGCTTTCGGA TCGTCGCGAT CGCGGCGACG GGCGCGGAGC GCTTCGGCAC CGGCAGCGCG ACGATCCGCA GCACGCAGGA TCTGCAACTG ATCTCGGGCC TGCCGCCGCT CGTGCGCGAA GGCGACGCGT TCCGCGCGCA GGTGACGGTG CGCAACACGA CCGAGCGCAA GATGGACGTC GTCGTCACGC CGCGCGTGCC GGGCATCGAC GCGGCGCCGC GGAAGATATC ACTCGCGCCC GATTCCGCGC AGGAAATCGC GTGGGACGTC ACGGTGCCCG AGACGGCGCT CGACGCCGCG GGCGCGCTGA ACTGGCGCAT CGAGGCGGCC GAGCAAGGCG GCAAGCGCGC GGCCGACGCG CTCGCGCTCG CGCAGAAGGT CGTGCCGGCG GTGCCCGTGA CGGTCCAGCA GGCGACGCTC GCGCAAGTCG ACGGCACGCT GAGCGTGCCC GTCGCGCCGC CCGCCGGCGC CATGCCCGAC GCGCGCGGCG CGCCGCGCGG CGGCATCGCC GTGTCGCTGC AATCGACGCT CGCCGACGGG CTGCCCGGCG TGCGCCGCTG GTTCGAGCGC TATCCGTACC GCTGCCTCGA ACAGCAGGCG TCGCGCGCGA TCGGCTTGCG CGACGCCGCG CAATGGCAGG CGCTCGCCGC GCGGATGCCG GTCTACCTCG ACCGCGACGG GCTCGCGAGC TACTTCCCGC CTTCGTCCGA TGATGCGCAC TCCGGCAGCC CGCCGCTGTC CGCGTACCTG CTCGTGCTCG CCGACGAGGC GAGCCGCGCC GACGCGCGCT TCGCGCTGCC CGAGGACGTG CGCACGCAGC TCGAGGCCGG GCTCGCGCGC TTCGTCGAGG GGCGCATCGA GCGCGACACC TGGGCGCCGC GCCAGGATCG CGACCTGCGC AAGCTCGCGG CGATCGAGGC GCTGTCGCGC TACGGCGCCG CGCAAGGCCG GATGCTCGGC TCGATCGAGA TCGCGCCGAA CCAGTGGCCG ACCTCGGCCG TGCTCGACTA TCACGCGATC CTCACGCGCG TGAAGGACAT CGCGCGGCGC GACGAGAAGC GCGCGCAGGC CGAGCAGATC CTGCGCGCGC GGCTCGCCTA CCAGGGCACG CAGCTCGTGT TCTCGACCGC GCGCGGCGAC GACCTGTGGT GGCTGATGAC AAGCAACGAG ACGAACGCGG CGCGCCTCGC GCTGGCGTTC GCCGGCGAGG CGGGCTGGAA GGACGAGATG CCGCGCGTCG CGGCCGGCCT GCTCGCGCTG CAGAAGAACG GCGCGTGGCA GACGACGACC GCGAACGCGC TCGGCCTGCT CGCGCTCGAG CGCTTCTCGC GCACGTACGA GCGCGCGCCG GTTGCCGGCG CGACGAAGAT CGCGTTGGGC GGCGACACGC GCTCGATCGC GTGGTCGCAG CCGGCGGGCG CGGGGGGCGC CACTGTCGCG ACGGGCGCGA CGGGCGCGGC GGCAACGGCC GGCGCGGCAT TGGCGTCCGG CGCTTCGGCG TCGGCGGCCG CGAAGCCGGC CGCCACGCAA TCGCGCACGC CGCCGCCGTC GAGCGGCACG CCGCCGCCGA GCGCCGCGAC GCGGGCGGCC GCCGCGCACA GCGTGACGCT GCCGTGGCCG CGCGGCACAC GCACGCCGGG CACGCTGTCG ATCGTGCACG AAGGCAGCGG GCGGCCGTGG GCGACGATCG AAAGCCTCGC CGCGGTGCCG GTGCGCGCGC CGTTCGCGGC CGGCTACCGG ATCGCGAAAA CCGTGACGCC GGTGTCGCCC GCGGTCAGCG GCGCGCTCAC GCGCGGCGAC GTGCTGCGCG TGCGTCTCGA CATCGACGCG CAGAGCGACA TGACGTGGGT GGTCGTCAAC GATCCGATTC CGGCCGGCGC GACGATCCTG GGCTCCGGCC TCGGCCGCGA CTCCGAGGCC GCGACGCAGG GCGAGAAGTC GCCCGACGGC GCGTGGCCCG CGTTCGTCGA GCGCGACTTC GACGGCTATC GCGCGTACTA CGACTATTTG CCGAAGGGCA AATTGACGGT CGAGTACACG GTGCGCGTGA ACAACGTCGG CACGTTCGGG CTGCCGCCGA CGCGCGTCGA GGCGCTCTAC GCGCCGTCCG TGTACGGGCT GTGGCCGAAC CCGCCGATGA CGGTCAAGCC GGCCGTCGCG GGCAAGCCGT GA
|
Protein sequence | MKHDDKHNKP GHPNPSILRL LARVGAAAAL GAAAALSLHA DAARTVTVSP QGTVAEVRQA VVKFDEAMVA FGSASAPAPA RLACADPAAA RGHGRWLDEK TWAYDFENDL PPGVRCTVAL NDTLRSAAGH AVTGPRRFAF QTGGPFPVAV RPGAREIEER QVFVVKLNGP ADERSALASI WCEAAGIGNR IPVAPADAPT RAALLDHFHW KKDAARVLTL ACAQALPAGA KMQLVYGRGV ASPSGIANDT ERRYDFTVRA PFAASFSCER ENAKAPCTPL RPLTLSFNAP VARRAAGEIR LRGPHGAIEP FFKPDDRAEE VTSVQFAAPL PAQAALTIEL PPALRDVTGR TLSNADLFPL ATRTAPMPPL AKFSSGTFGI VERFAEPDSP ALVPVTLRNV EADLRIAGLN AGGAQFSNLK VENDSEIRRW MQLVERFDGR AMSVESIDKL RPGLLARGQH PVYVPLAAGE RAPKPQHRQI DIRSLSLLAG EPGVRTLTLP KADPKALRPF EIVGVPIDKP GFHVLELASP ALGRSLLAKP AKMYVRTAVL VTNLGVHLKL GRENSVVWVT TLDKGKPVPN AQVRVSDCNG DEIAAGRTNA QGLVTIDAPL EPKRACDSSN GDGDYFVSAR VDDPKTGPDM AFVRSSWNRG IESWRFNVPT DMSDTPTVRA HTVFDRTLVR AGETVSMKHF VREETLRGLA FPPRYPSRAT IRHLGSGQTY RVPLAWAADH TADTRFALPA AAKLGEYSVE LEDGPEDAPS ASYYGGSFRV EAFRLPVFKG SIGVRDAKAS PLVGAKDAPL AVQIDYVSGG GASNLPVQVS ALVKRAEPPF AERYPDFGFE PYRPQTQDAT ADDEDTQDGE NASRDTDPDA TKLIADKIAL TLDRTGSGAL TLKGLPAVDA PKRVALEATF ADPNGEVQTI RGDAMLWPAA VVAGIQAGHW VSVGQRVPVQ ALVVDLQGRP RASAAVEIKG VARVTTSSRK RMVGGFYAYD NQSDTRELGV LCSGKTDAQG RLACEATLSQ AGNVQLIAVA KDGDGRASNA STSVWVTRED DLWFGGENTD RIDVIPEKAS YEPGDTARFQ VRMPFRHATA LVAVERGGVM QTRVVELNGK NPTVDLKVGD TWGPNVYVSV LALRGRLRDV PWYSFFTWGW KAPLEWARAF WREGRHYEAP SALVDLSKPA FRYGLGEIKV GTGAHRLGVA VTTDAARYPV RGTAHARVKV TLPDGKPAPA GTQIALAAVD EALLELMPNR SWDLLDAMLQ RRAYGIETAT AQMEIVGRRH FGRKAVPAGG GGGMAPTREL FDTLLLWNPR VTLDANGSAS VDVPLNDALT RFRIVAIAAT GAERFGTGSA TIRSTQDLQL ISGLPPLVRE GDAFRAQVTV RNTTERKMDV VVTPRVPGID AAPRKISLAP DSAQEIAWDV TVPETALDAA GALNWRIEAA EQGGKRAADA LALAQKVVPA VPVTVQQATL AQVDGTLSVP VAPPAGAMPD ARGAPRGGIA VSLQSTLADG LPGVRRWFER YPYRCLEQQA SRAIGLRDAA QWQALAARMP VYLDRDGLAS YFPPSSDDAH SGSPPLSAYL LVLADEASRA DARFALPEDV RTQLEAGLAR FVEGRIERDT WAPRQDRDLR KLAAIEALSR YGAAQGRMLG SIEIAPNQWP TSAVLDYHAI LTRVKDIARR DEKRAQAEQI LRARLAYQGT QLVFSTARGD DLWWLMTSNE TNAARLALAF AGEAGWKDEM PRVAAGLLAL QKNGAWQTTT ANALGLLALE RFSRTYERAP VAGATKIALG GDTRSIAWSQ PAGAGGATVA TGATGAAATA GAALASGASA SAAAKPAATQ SRTPPPSSGT PPPSAATRAA AAHSVTLPWP RGTRTPGTLS IVHEGSGRPW ATIESLAAVP VRAPFAAGYR IAKTVTPVSP AVSGALTRGD VLRVRLDIDA QSDMTWVVVN DPIPAGATIL GSGLGRDSEA ATQGEKSPDG AWPAFVERDF DGYRAYYDYL PKGKLTVEYT VRVNNVGTFG LPPTRVEALY APSVYGLWPN PPMTVKPAVA GKP
|
| |