Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0002 |
Symbol | |
ID | 3692145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 15359 |
End bp | 20032 |
Gene Length | 4674 bp |
Protein Length | 1557 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637730256 |
Product | putative polyketide synthase |
Protein accession | YP_335161 |
Protein GI | 76819558 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCAA CGTTTAACTC CGACGAGGTC GCGCTTGCCG GCGACGCCCG GCGCACGGAC GGCGATGCCG ATCGCGCGGC CGCGTCGCCG CACGGGTTTC GCTTCGTCGA GAAGGCGCGT ATCGCCGACG AGCCCGGCCT GTCGGCGCGG CTCGCCGCGC GCAAGGGCGA CGCGCATCAG GACGGCGTGT TTCTCGACCT GTGGCCGTCG ATCTTCCTGT CGGACGAACA CGGCGACCAC ATCCACGTGC TGCGGGATCG CGGCCTGCTG TTCGTCACGC GCTTCGTCGG CGCGCCGCAG CGCGAGGCCG CGCTGCTGAG CGCGCTGCTC GATCGCGCGC GCGGCGAGGG CCGCGCGCTC TGCTACCTGG ACATGTCGAA CCGGCGCAAG CCCGACATCG AGCGCGAGTG CGGGCTGCTG TCGACGCCGC TCGGCGTCGT GCAGACGATC GACGACATTC GCCGCTTCAC GCTCGACGGG CCGCGGATGC GCAAGCTGCG TTACCTCGTG TCGAAGTTCG GGCGCGACCC GTCGTGCCGC GTCGTCGAAT ACACGGAGCC GGACCCCGCC GTCGACGGCG AGATCCGCGC GGTGATGGCG GCCTGGAGCC GCGAGAAAGG CGTCGTCAAC AAGGTCGATG CGATTCTCGC GGACATGCGC GTCGGCAATC TGCTCAAGCG CTACCGCGTC TACCTGACCT ATCTCGGCGA CCGGCTGCAG AACGTCGTCG TGCTGAGCCA CATCGGCGAA GGCTATATCA CCGATCAAGA GTATTTCGTG CCGGACATGC CGCTCGGCGG GACCGAGTAT GCATACGCGA CGATCATCGA ACGGCTCGCC GCAGAGGGGC ACCGCAAATT CAGTCTCGGG TTGACGTGGG GCCTGTTCGA GCCCGAGGCG GGCTTCAGCG ACGCCGAGGG CTGGACGCTC GTCAACCGCA CGGAAGGGCA GCTCGCGCAG ATCTTCCGGC GCGGCGTGCA GAACCATCAG TACAAGAACA AGTACTGCCC GGCGGAATAT CCGCTGTACC TGTACCGAAG CGCCGACAGC CGGCCGCAGA TCATCAAGCA GTGCATGGGG CAGTTCTTTC GCAACGGGGT GCCGTACGAC GAAATCGCGC GCCAGATCGA GGCGGACGAC GCGCGGGCCT TGGCGGCGGC CGGCGCCGCG CCGCCGCGCG CGACGCACGG CGGCGACGAG GCCGAAGCGC GTTCGGCCCT CGGCGGCGCG CCCGACGATG CGCGCGCCGG CGCGCCCGAC GATGCATCCT TCGACGCACC CGACGAAGCG CCCATCGAGG TATCCGACCA CGCACCCGGC AGCACGCCGC CCCCGCGCGC CGCGCGCGAC GAGCGCGGCG GCGAGCGCGA CGACGGCAAG GGCGGCCAAG CCGGCCAAGC CGACAGAAGC GGCAAGCACG ACCCAGCCGA CGCCCGCGGC CGTGCCGCGT CCGGCACGCC GGACGCGGCG CGAGCGCCGG CGCTCGCCGA CATTCCCGAC GCGTTCTTCG ATGCGACGCA GGCCGATCCG AACGCGATCC GGCTCGACCT CGTCAGCGAT TCGTGGGCCC ATCTCGGCTA CCCGTTCATT CGCGAGCGGG CGCGCCGGCT GCTTGCCGGC CTCGCGTCGC CGCACGCCGA TCCGGCCGCG CCGTGCGGTC TTTTCGGCGT CGATCACTGC GTGCTGACGA CTTCCGGGCG CAATGCGGAG CGGGTGTTCT TCAATCTGTT TCCGGCCAAG CGCAAGACGA TCCTGCAAAA CATCCCGTTC TTCTCGACCC GGCACAACGC GGCGAAAGCG GGATTCGCCT CGGTCGAGAT TCCCGATCCG CGGATCTTCG ATCCGGATTG CCGCGAAATA TTCCGCGGCG GCATCGATTT CGCGCGCCTG CGCGAGCAAC TGGAGGCGCG GCCGGACGGC GTCGCAATGG TGCTGATCGA GCTGTGCAAC AACGCGAGCG GCGGCTATCC GGTGCCGCTC GCGCAGATCG CCGACGTATC CGCGCTGTGC CGCGCGCGCG GCGTGCCGTT CGTGATGGAC GTCACGCGCA TCGTCAGGAA CGCGGAGCTG ATCCGGCGTC ACGAGCCCGG CTGCGCGAAC GTCGGGCTCT GGGACACCGT CGCCCGGATC GTCGCCCATG CGGACGTCGT GTTCGGCAGC CTCTGCAAGG ATTTCGGCGT GAGCGCGGGC GGCATCGTCG CGGCGAACGA CGGGCGGCTG ATCGGCAAGG CGCGGCGCTA CGCGGAAATC GAGGGCGCGC TGCTCGACCA CGTGCAGACG CAGGTGGTGT GCGCGTCGCT CGGCGAGCGC GACGCGCTCG AGCGGGGCGT CGCGGCGCAG CTCGATGTCG CGCGGCGCGT GAGCGACGCG CTCGACGCGC GGCGGATTCC GGCGCTGCTG CCCGTGGTCG GGCATTGCGC GCTGGTGCGC GCGGCCGACA TGCCGGGCTA TGCCGGCCGC CGGTATCCGC GCGAATCGCT GCTGCGCGCG CTGCTCGAGC GGCACGGCGT GCGCGCCGGC ATCCATCTCG CGGGCAGCGG CGCGGAGCGC GTCATCGACC GGTGCATCCG CATCGCGCTG CCGATCGGCC TGGACGACGC GCGGCTCGCA TCCGGGCTCG CCGACGCGCT GGCCGGAACC GCGCCGGGCG CAACGGATGC GCCCGCCGCG CTGCCCGACC TGCTGCATGC GCGCGCCCCC GGTGCCGCGG ACACGGCCGA CACCGTCGAC ACGGTCGATA CCGTCGATAC GGCTGATACG GCCGATACGG CTGATACGGC CGCGCAAGCC GGCGCCCGTC GCGGCGAGGC GCACGCGTCA CGCGCGCCGA TGCGGGCGAG CGACGACGAT GCGATCGCGA TCGTCGGCAT GGCGGGCCGC TACCCCGGCG CCGACGATCT GTCCGCGTTC TGGCGCAACC TCGTCGACGG CGTGAACGCG ATCACGGAAA TCCCGGCCGA GCGCTGGGAC TGGCGCGCGC ATTACCACCC CGATCCCGAG CAGGCGGCGC GGCTGCGCAA GTCGTACGGC AAGTGGGGCG GCTTTCTCGG CGAGTTCGAC TGTTTCGATC CGCTGTTCTT CTGGATGGCG CCGCGCCGCA TCGCGATGAT CGATCCGCAG GAGCGGCTGT TCCTCGAGGA GTGCTGGAAG GCGCTCGAGG ATGCGGGCTA CCCGCCGTCC CGCCTCGGCG ACGCGCTGCG CGAGCGCACG GGCGTGTTCG GCGGGCTGTC GAAGCACGGC TTCAGCCTGT ATGCGTCGCA GTATGCGGGC ACCCAGCCGC ATACGTCGCC CGCGTCGATG GTCGGCCGCG TGTCGCACTT CTTCGATCTG AAGGGCCCGA GCGTGGCGAT CGACAACCAT TGCGCGTCGT CGCTCGTCGC CGTTCACGAG GCCTGCGAAT ACCTGCGGCG GGGCGACGGC GATCTCGCGA TCGCGGGCGG CGTCAGCCTG TGCCTGCACC CGTCGAGCTA TGTGCAGCTC TCGCTCGTGC GGATGCTCTC GCGCGACGCG CACTGCGCGG CGTTCGACGA GGGCGGCGCG GGCTACGTGC CGGGCGAGGG GGTGGGCGTC GTCGTGCTCA AGCGGCTCGC GCAGGCGCGC GCGCACGGCG ATCCGATCCA CGCGGTGATC CGCTCCGGCG CGGTCAATCA CAACGGCCGC ATGCGCTACT ACGGCCAGCC CGATCAGGCG GGCCAGCAGG CCGCCATCCG GGCCGCGCTC GCGCGCGCGC GGATCGATCC GCGCTCGATC AGCTACATCG AGGCGGCCGC GAGCGGCGTC GAGACGACGG ACGCGGTCGA GATGGCCGCG CTGACCGAGG TGTTCGGCGA TCGGGCGGGC GCCGCGGGCG CCTACACGAT CGGTACGGTC AAGCCGGCGA TCGGGCACGG CGAGGCCGCG TCGGGCATGT CGCAACTGAT GCGCGTCGCG CTGTCGCTCA AGCACGCGAC GCTCACGCCG ACCCGGCTGC CACGGCGGCC GAGCCCGCTG ATCGATTTCG ATCGGCTGCC GTTCCGGCTC GCGGCCGAGG CGGCGCCGTG GGCGCCGGTG AGCGTCGACG GCCGGCCGGT GCCGCGGCGC GCCGGGGTCA CCGCGATCGG CAACGGCGTC AACGCGCATC TGGTGCTCGA GGAATGGCCG GGCGCGCCCG CCGACGATTC CGCCGCCGCG CCGCGCGAGC CGCAGGTGTT CGTGCTGTCC GCGCAGGACG GCGAGCGGCT CGCGGCATAC GTCGAGCGAT GGATCGCGTT CCTCGCGAGC GGCGCGACGC CCGATTTCGG GCGGATGCTG CGCACGCTGC AGATCGCGCG CGAGCCGATG CCCGCGCGGC TCGCGCTCGT CGCCTCCGAT CGCGACGACT TGCTGCGCGC GTTGCGCGCG TGGCGCGACG GCGGCGCGTC GTCGCGCGTG CATCGCGGCG ACGCCCGCCG GCGCGCCGGG CAGGCCGCGC TGGCGGAGCA GGCGTGCGAT CCGCGCGCGT GCGCGCCCGA CGAGGCGGCC GCGGCCTGGG TGCAAGGGCG CACGGTGCGC TGGGAGGCGC TTCACCGAGG CGGGCCGTGG CGGCGCGTCG GCGGTCTGCC GGCCTATCCG TTCGCGCGCG AGCGGTACTG GATCGCGGAC GCGGCATCCG GCGCGCCGGC AGGCAGGGAG GAAGCATCGG CGCGGCCCGA TTGA
|
Protein sequence | MQATFNSDEV ALAGDARRTD GDADRAAASP HGFRFVEKAR IADEPGLSAR LAARKGDAHQ DGVFLDLWPS IFLSDEHGDH IHVLRDRGLL FVTRFVGAPQ REAALLSALL DRARGEGRAL CYLDMSNRRK PDIERECGLL STPLGVVQTI DDIRRFTLDG PRMRKLRYLV SKFGRDPSCR VVEYTEPDPA VDGEIRAVMA AWSREKGVVN KVDAILADMR VGNLLKRYRV YLTYLGDRLQ NVVVLSHIGE GYITDQEYFV PDMPLGGTEY AYATIIERLA AEGHRKFSLG LTWGLFEPEA GFSDAEGWTL VNRTEGQLAQ IFRRGVQNHQ YKNKYCPAEY PLYLYRSADS RPQIIKQCMG QFFRNGVPYD EIARQIEADD ARALAAAGAA PPRATHGGDE AEARSALGGA PDDARAGAPD DASFDAPDEA PIEVSDHAPG STPPPRAARD ERGGERDDGK GGQAGQADRS GKHDPADARG RAASGTPDAA RAPALADIPD AFFDATQADP NAIRLDLVSD SWAHLGYPFI RERARRLLAG LASPHADPAA PCGLFGVDHC VLTTSGRNAE RVFFNLFPAK RKTILQNIPF FSTRHNAAKA GFASVEIPDP RIFDPDCREI FRGGIDFARL REQLEARPDG VAMVLIELCN NASGGYPVPL AQIADVSALC RARGVPFVMD VTRIVRNAEL IRRHEPGCAN VGLWDTVARI VAHADVVFGS LCKDFGVSAG GIVAANDGRL IGKARRYAEI EGALLDHVQT QVVCASLGER DALERGVAAQ LDVARRVSDA LDARRIPALL PVVGHCALVR AADMPGYAGR RYPRESLLRA LLERHGVRAG IHLAGSGAER VIDRCIRIAL PIGLDDARLA SGLADALAGT APGATDAPAA LPDLLHARAP GAADTADTVD TVDTVDTADT ADTADTAAQA GARRGEAHAS RAPMRASDDD AIAIVGMAGR YPGADDLSAF WRNLVDGVNA ITEIPAERWD WRAHYHPDPE QAARLRKSYG KWGGFLGEFD CFDPLFFWMA PRRIAMIDPQ ERLFLEECWK ALEDAGYPPS RLGDALRERT GVFGGLSKHG FSLYASQYAG TQPHTSPASM VGRVSHFFDL KGPSVAIDNH CASSLVAVHE ACEYLRRGDG DLAIAGGVSL CLHPSSYVQL SLVRMLSRDA HCAAFDEGGA GYVPGEGVGV VVLKRLAQAR AHGDPIHAVI RSGAVNHNGR MRYYGQPDQA GQQAAIRAAL ARARIDPRSI SYIEAAASGV ETTDAVEMAA LTEVFGDRAG AAGAYTIGTV KPAIGHGEAA SGMSQLMRVA LSLKHATLTP TRLPRRPSPL IDFDRLPFRL AAEAAPWAPV SVDGRPVPRR AGVTAIGNGV NAHLVLEEWP GAPADDSAAA PREPQVFVLS AQDGERLAAY VERWIAFLAS GATPDFGRML RTLQIAREPM PARLALVASD RDDLLRALRA WRDGGASSRV HRGDARRRAG QAALAEQACD PRACAPDEAA AAWVQGRTVR WEALHRGGPW RRVGGLPAYP FARERYWIAD AASGAPAGRE EASARPD
|
| |