Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A2407 |
Symbol | |
ID | 3694104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 2916811 |
End bp | 2923014 |
Gene Length | 6204 bp |
Protein Length | 2067 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637732661 |
Product | alpha-2-macroglobulin family protein |
Protein accession | YP_337557 |
Protein GI | 76817415 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTATCAA ACCGAAAGCT TCCCTCCGAG GGCCATCGCG CGATGAAGCA CGACGACAAG CACAACAAGC CAGGCCATCC GAACCCGTCG ATCCTCCGTC TGCTCGCGCG CATCGGCGCG GCCGCGGCGC TCGGCGCGGC CGCCGCGCTG TCGCTGCACG CCGACGCGGC GCGCACCGTG AACGTGTCGC CGCAAGGCAC CGTCGCCGAA GTCCGGCAGG CCGTCGTCAA GTTCGACGAG GCGATGGTCG CGTTCGGCTC GGCGTCCGCG CCGGCCCCCG CGCGCCTCGC GTGCGCCGAT CCCGCCGCCG CGCGCGGCCA CGGCCGCTGG CTCGACGAGA AGACCTGGGC CTATGACTTC GAGAACGATC TGCCGCCCGG CGTACGCTGC ACGGTCGCGC TCAACGACAC GCTGCGCTCG GCCGCCGGCC ACGCGGTGAC GGGCCCGCGC CGCTTCGCGT TCCAGACGGG CGGCCCGTTC CCGGTCGCGG TACGGCCGGG CGCGCGCGAG ATCGAAGAGC GGCAGGTGTT CGTCGTCAAG CTGAACGGCC CGGCCGACGA ACGCTCGGCG CTCGCGAGCA TCTGGTGCGA GGCCGCCGGC ATCGGCAACC GCATCCCGGT GGCGCCCGCC GACGCGCCGA CGCGCGCCGC GCTCCTCGAT CACTTCCACT GGAAAAAGGA CGCCGCGCGC GTGCTCACGC TCGCGTGCGC GCAGGCGCTG CCCGCGGGCG CGAAGATGCA GCTCGTCTAC GGCCGCGGCG TCGCGAGCCC GAGCGGCATC GCGAACGACA CCGAGCGGCG CTACGACTTC ACGGTGCGCG CGCCGTTCGC CGCGAGCTTC TCGTGCGAGC GCGAGAACGC GAAGGCGCCG TGCACGCCGC TGCGCCCGCT CACGCTGTCG TTCAACGCGC CGGTGGCGCG CCGCGCCGCC GGCGAGATCC GGCTGCGCGG CCCGCACGGC GCGGTCGAGC CGTTCTTCAA GCCCGACGAT CGCGCGGAGG AAGTCACGAG CGTGCAGTTC GCCGCGCCGC TGCCCGCGCA GGCCGCGCTG ACGATCGAGC TGCCGCCGGC GCTGCGCGAC GTGACGGGCC GCACGCTGTC GAACGCCGAT CTGTTCCCGC TCGCGACGCG CACCGCGCCG ATGCCGCCGC TCGCGAAGTT CTCGTCGGGC ACGTTCGGCA TCGTCGAGCG CTTCGCCGAG CCGGATTCGC CCGCGCTCGT GCCCGTCACG CTGCGCAACG TCGAGGCCGA CCTGCGCATC GCGGGGCTGA ACGCGGGCGG CGCGCAGTTC TCGAACCTGA AGGTCGAGAA CGACAGCGAG ATCCGCCGCT GGATGCAGCT CGTCGAGCGT TTCGACGGCC GCGCGATGAG CGTCGAGTCG ATCGACAAGC TCCGCCCCGG CCTGCTCGCG CGCGGCCAGC ATCCCGTCTA CGTGCCGCTC GCCGCGGGCG AGCGCGCGCC GAAGCCGCAG CACCGGCAGA TCGACATCCG CTCGCTGTCG CTGCTCGCGG GCGAGCCCGG CGTACGGACG CTGACGCTGC CGAAGGCCGA CCCGAAGGCG CTGCGCCCGT TCGAGATCGT CGGCGTGCCG ATCGACAAGC CGGGCTTTCA CGTGCTCGAG CTCGCGTCGC CCGCGCTCGG CCGCTCGCTG CTCGCCAAGC CCGCGAAGAT GTACGTGCGC ACCGCGGTGC TCGTCACGAA CCTCGGCGTG CATCTGAAGC TCGGCCGCGA GAACAGCGTC GTCTGGGTGA CGACGCTCGA CAAGGGCAAG CCCGTGCCGA ACGCGCAGGT GCGCGTGTCC GACTGCAACG GCGACGAAAT CGCGGCCGGC AGGACCAACG CGCAAGGGCT CGTCACGATC GATGCGCCCC TCGAGCCCAA GCGCGCATGC GACAGCTCGA ACGGCGACGG CGACTATTTC GTGTCCGCCC GCGTCGACGA TCCGAAGACG GGCCCCGACA TGGCGTTCGT GCGCTCGAGC TGGAACCGCG GCATCGAATC GTGGCGCTTC AACGTGCCGA CCGACATGAG CGACACGCCG ACCGTGCGCG CGCATACGGT GTTCGACCGC ACGCTCGTGC GCGCGGGCGA GACGGTATCG ATGAAGCACT TCGTGCGCGA GGAGACGCTG CGGGGCCTCG CGTTCCCGCC GCGCTACCCG TCGCGCGCGA CGATCCGCCA TCTCGGCAGC GGCCAGACGT ACCGCGTGCC GCTCGCATGG GCCGCCGATC ACACCGCCGA CACGCGCTTC GCGCTGCCCG CGGCGGCGAA GCTCGGCGAA TACAGCGTCG AGCTCGAGGA CGGCCCCGAG GACGCGCCGA GCGCGAGCTA CTACGGCGGC AGCTTCCGCG TCGAGGCGTT CCGGCTGCCC GTCTTCAAAG GCTCGATCGG CGTGCGCGAC GCGAAGGCGA GCCCGCTCGT CGGCGCGAAG GACGCGCCGC TCGCGGTGCA GATCGATTAC GTGTCGGGCG GCGGCGCGTC GAACCTGCCC GTGCAGGTGT CGGCGCTCGT CAAGCGCGCC GAGCCGCCGT TCGCCGAGCG CTATCCCGAT TTCGGCTTCG AGCCGTACCG CCCGCAAACG CAGGACGCGA CGGCCGACGA CGAGGACACG CAGGACGGCG AGAACGCGTC GCGCGACACC GATCCCGACG CGACGAAGCT CATCGCCGAC AAGATCGCGC TCACGCTCGA TCGCACCGGC TCGGGCGCGC TCACGCTGAA GGGCCTGCCG GCCGTCGACG CGCCCAAGCG CGTCGCGCTC GAGGCGACGT TCGCCGATCC GAACGGCGAG GTACAGACGA TTCGCGGCGA CGCGATGCTG TGGCCGGCCG CGGTCGTCGC CGGCATCCAG GCGGGCCACT GGGTGTCGGT CGGCCAGCGC GTGCCGGTGC AGGCGCTCGT TGTCGATCTG CAGGGCCGCC CGCGCGCGTC GGCGGCGGTC GAGATCAAGG GCGTCGCGCG CGTGACGACC TCCTCGCGCA AGCGGATGGT CGGCGGCTTC TACGCGTACG ACAACCAGAG CGACACGCGC GAGCTGGGCG TGCTGTGCTC GGGCAAGACC GACGCGCAGG GCCGGCTGGC GTGCGACGCC ACGCTCTCGC AGGCGGGCAA CGTGCAACTG ATCGCGGTCG CGAAGGACGG CGACGGCCGC GCGTCGAACG CGTCGACGTC GGTATGGGTC ACGCGCGAGG ACGATCTCTG GTTCGGCGGC GAGAACACCG ACCGGATCGA CGTGATCCCC GAGAAGGCGT CGTACGAGCC GGGCGACACC GCGCGCTTCC AGGTGCGCAT GCCGTTTCGC CATGCCACGG CGCTCGTCGC CGTCGAGCGC GGCGGCGTGA TGCAGACGCG CGTCGTCGAG CTGAACGGCA AGAATCCGAC CGTCGATCTG AAGGTCGGCG ACACGTGGGG GCCGAACGTC TACGTATCGG TGCTCGCGCT GCGCGGGCGG CTGCGCGACG TGCCGTGGTA CTCGTTCTTC ACGTGGGGCT GGAAGGCGCC CCTCGAATGG GCGCGCGCGT TCTGGCGCGA AGGCCGCCGC TACGAGGCGC CGAGCGCGCT CGTCGACCTG TCGAAGCCCG CGTTCCGCTA CGGCCTGGGC GAGATCAAGG TCGGCACGGG CGCGCACCGG CTCGGCGTCG CAGTGACGAC CGACGCGGCC CGCTATCCGG TGCGCGGCAC CGCGCACGCG CGCGTGAAGG TCACGCTGCC GGACGGCAAG CCCGCGCCCG CCGGCACGCA GATCGCGCTC GCCGCGGTCG ACGAGGCGCT CCTCGAGCTG ATGCCGAACC GCAGTTGGGA CCTGCTCGAT GCGATGCTGC AACGGCGCGC GTACGGCGTC GAGACGGCCA CCGCGCAAAT GGAGATCGTC GGCCGCCGCC ACTTCGGACG CAAGGCCGTG CCCGCGGGCG GCGGCGGCGG GATGGCGCCG ACCCGCGAGC TGTTCGACAC GCTGTTGCTG TGGAACCCGC GCGTCACGCT CGACGCGAAC GGCAGCGCGA GCGTCGACGT GCCGCTCAAC GATGCGCTCA CGCGCTTTCG GATCGTCGCG ATCGCGGCGA CGGGCGCGGA GCGCTTCGGC ACCGGCAGCG CGACGATCCG CAGCACGCAG GATCTGCAAC TGATCTCGGG CCTGCCGCCG CTCGTGCGCG AAGGCGACGC GTTCCGCGCG CAGGTGACGG TGCGCAACAC GACCGAGCGC AAGATGGACG TCGTCGTCAC GCCGCGCGTG CCGGGCATCG ACGCGGCGCC GCGGAAGATA TCGCTCGCGC CCGATTCCGC GCAGGAAATC GCGTGGGACG TCACGGTGCC CGAGACGGCG CTCGACGCCG CGGGCGCGCT GAACTGGCGC ATCGAGGCGG CCGAGCAAGG CGGCAAGCGC GCGGCCGACG CGCTCGCGCT CGCGCAGAAG GTCGTGCCGG CGCTGCCCGT GACGGTCCAG CAGGCGACGC TCGCGCAAGT CGACGGCACG CTGAGCGTGC CCGTCGCGCC GCCCGCCGGC GCCATGCCCG ACGCGCGCGG CGCGCCGCGC GGCGGCATCG CCGTGTCGCT GCAATCGACG CTCGCCGACG GGCTGCCCGG CGTGCGCCGC TGGTTCGAGC GCTATCCGTA CCGCTGCCTC GAACAGCAGG CGTCGCGCGC GATCGGCTTG CGCGACGCCG CGCAATGGCA GGCGCTCACC GCGCGGATGC CGGTCTACCT CGACCGCGAC GGGCTCGCGA GCTACTTCCC GCCTTCGTCC GACGATGCGC ACTCCGGCAG CCCGCCGCTG TCCGCGTACC TGCTCGTGCT CGCCGACGAG GCGAGCCGCG CCGACGCGCG CTTCGCGCTG CCCGAGGACG TGCGCACGCA GCTCGAGGCC GGGCTCGCGC GCTTCGTCGA GGGGCGCATC GAGCGCGACA CCTGGGCGCC GCGCCAGGAT CGCGACCTGC GCAAGCTCGC GGCGATCGAG GCGCTGTCGC GCTACGGCGC CGCGCAAGGC CGGATGCTCG GCTCGATCGA GATCGCGCCG AACCAGTGGC CGACCTCGGC CGTGCTCGAC TATCACGCGA TCCTCACGCG CGTGAAGGAC ATCGCGCGGC GCGACGAGAA GCGCGCGCAG GCCGAGCAGA TCCTGCGCGC GCGGCTCGCC TACCAGGGCA CGCAGCTCGT GTTCTCGACC GCGCGCGGCG ACGACCTGTG GTGGCTGATG ACAAGCAACG AGACGAACGC GGCGCGCCTC GCGCTGGCAT TCGCCGGCGA GGCGGGCTGG AAGGACGAGA TGCCGCGCGT CGCGGCCGGC CTGCTCGCGC TGCAGAAGAA CGGCGCGTGG CAGACGACGA CCGCGAACGC GCTCGGCCTG CTCGCGCTCG AGCGCTTCTC GCGCACGTAC GAGCGCGCGC CGGTTGCCGG CGCGACGAAG ATCGCGTTGG GCGACGACAC GCGCTCGATC GCGTGGTCGC AGCCGGCGGG CGCGGGGGGC GCCACTGTCG CGACGGGCGC GACGGGCGCG GCGGCAACGG CCGGCGCGGC ATTGGCGTCC GGCGCTTCGG CGTCGGCGGC CGCGAAGCCG GCCGCCACGC AATCGCGCAC GCCGCCGCCG TCGAGCGGCA CACCGCCGCC GAGCGCCGCG ACGCGGGCGG CCGCCGCGCA CAGCGTGACG CTGCCGTGGC CGCGCGGCGC ACGCACGCCG GGCACGCTGT CGATCGTGCA CGAAGGCAGC GGGCGGCCGT GGGCGACGAT CGAAAGCCTC GCCGCGGTGC CGGTGCGTGC GCCGTTCGCG GCCGGCTACC GGATCGCGAA AACCGTGACG CCGGTGTCGC CCGCGGTCAG CGGCGCGCTC ACGCGCGGTG ACGTGCTGCG CGTGCGTCTC GACATCGACG CGCAGAGCGA CATGACGTGG GTGGTCGTCA ACGATCCGAT TCCGGCCGGC GCGACGATCC TGGGCTCCGG CCTCGGCCGC GACTCCGAGG CCGCGACGCA GGGCGAGAAG TCGCCCGACG GCGCGTGGCC CGCGTTCGTC GAGCGCGACT TCGACGGCTA TCGCGCGTAC TACGACTATT TACCGAAGGG CAAATTGACG GTCGAGTACA CGGTGCGCGT GAACAACGTC GGCACGTTCG GGCTGCCGCC GACGCGCGTC GAGGCGCTCT ACGCGCCGTC CGTGTACGGG CTGTGGCCGA ACCCGCCGAT GACGGTCAAG CCGGCCGTCG CGAGCAAGCC GTGA
|
Protein sequence | MLSNRKLPSE GHRAMKHDDK HNKPGHPNPS ILRLLARIGA AAALGAAAAL SLHADAARTV NVSPQGTVAE VRQAVVKFDE AMVAFGSASA PAPARLACAD PAAARGHGRW LDEKTWAYDF ENDLPPGVRC TVALNDTLRS AAGHAVTGPR RFAFQTGGPF PVAVRPGARE IEERQVFVVK LNGPADERSA LASIWCEAAG IGNRIPVAPA DAPTRAALLD HFHWKKDAAR VLTLACAQAL PAGAKMQLVY GRGVASPSGI ANDTERRYDF TVRAPFAASF SCERENAKAP CTPLRPLTLS FNAPVARRAA GEIRLRGPHG AVEPFFKPDD RAEEVTSVQF AAPLPAQAAL TIELPPALRD VTGRTLSNAD LFPLATRTAP MPPLAKFSSG TFGIVERFAE PDSPALVPVT LRNVEADLRI AGLNAGGAQF SNLKVENDSE IRRWMQLVER FDGRAMSVES IDKLRPGLLA RGQHPVYVPL AAGERAPKPQ HRQIDIRSLS LLAGEPGVRT LTLPKADPKA LRPFEIVGVP IDKPGFHVLE LASPALGRSL LAKPAKMYVR TAVLVTNLGV HLKLGRENSV VWVTTLDKGK PVPNAQVRVS DCNGDEIAAG RTNAQGLVTI DAPLEPKRAC DSSNGDGDYF VSARVDDPKT GPDMAFVRSS WNRGIESWRF NVPTDMSDTP TVRAHTVFDR TLVRAGETVS MKHFVREETL RGLAFPPRYP SRATIRHLGS GQTYRVPLAW AADHTADTRF ALPAAAKLGE YSVELEDGPE DAPSASYYGG SFRVEAFRLP VFKGSIGVRD AKASPLVGAK DAPLAVQIDY VSGGGASNLP VQVSALVKRA EPPFAERYPD FGFEPYRPQT QDATADDEDT QDGENASRDT DPDATKLIAD KIALTLDRTG SGALTLKGLP AVDAPKRVAL EATFADPNGE VQTIRGDAML WPAAVVAGIQ AGHWVSVGQR VPVQALVVDL QGRPRASAAV EIKGVARVTT SSRKRMVGGF YAYDNQSDTR ELGVLCSGKT DAQGRLACDA TLSQAGNVQL IAVAKDGDGR ASNASTSVWV TREDDLWFGG ENTDRIDVIP EKASYEPGDT ARFQVRMPFR HATALVAVER GGVMQTRVVE LNGKNPTVDL KVGDTWGPNV YVSVLALRGR LRDVPWYSFF TWGWKAPLEW ARAFWREGRR YEAPSALVDL SKPAFRYGLG EIKVGTGAHR LGVAVTTDAA RYPVRGTAHA RVKVTLPDGK PAPAGTQIAL AAVDEALLEL MPNRSWDLLD AMLQRRAYGV ETATAQMEIV GRRHFGRKAV PAGGGGGMAP TRELFDTLLL WNPRVTLDAN GSASVDVPLN DALTRFRIVA IAATGAERFG TGSATIRSTQ DLQLISGLPP LVREGDAFRA QVTVRNTTER KMDVVVTPRV PGIDAAPRKI SLAPDSAQEI AWDVTVPETA LDAAGALNWR IEAAEQGGKR AADALALAQK VVPALPVTVQ QATLAQVDGT LSVPVAPPAG AMPDARGAPR GGIAVSLQST LADGLPGVRR WFERYPYRCL EQQASRAIGL RDAAQWQALT ARMPVYLDRD GLASYFPPSS DDAHSGSPPL SAYLLVLADE ASRADARFAL PEDVRTQLEA GLARFVEGRI ERDTWAPRQD RDLRKLAAIE ALSRYGAAQG RMLGSIEIAP NQWPTSAVLD YHAILTRVKD IARRDEKRAQ AEQILRARLA YQGTQLVFST ARGDDLWWLM TSNETNAARL ALAFAGEAGW KDEMPRVAAG LLALQKNGAW QTTTANALGL LALERFSRTY ERAPVAGATK IALGDDTRSI AWSQPAGAGG ATVATGATGA AATAGAALAS GASASAAAKP AATQSRTPPP SSGTPPPSAA TRAAAAHSVT LPWPRGARTP GTLSIVHEGS GRPWATIESL AAVPVRAPFA AGYRIAKTVT PVSPAVSGAL TRGDVLRVRL DIDAQSDMTW VVVNDPIPAG ATILGSGLGR DSEAATQGEK SPDGAWPAFV ERDFDGYRAY YDYLPKGKLT VEYTVRVNNV GTFGLPPTRV EALYAPSVYG LWPNPPMTVK PAVASKP
|
| |