Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1481 |
Symbol | |
ID | 4887197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1421835 |
End bp | 1426511 |
Gene Length | 4677 bp |
Protein Length | 1558 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640131420 |
Product | polyketide synthase |
Protein accession | YP_001062477 |
Protein GI | 126445080 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.457199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGCAA CGTTTAACCG CGGCGAGGTC GCGCTTGCCG GCGACGCCCG GCGCACGGAC GGCGATGCCG ATCGCGCGGC CGCGTCGCCG CACGGGTTTC GCTTCGTCGA GAAGGCGCGT ATCGCCGACG AGCCCGGCCT GTCGGCGCGG CTCGCCGCGC GCAAGGGCGA CGCGCATCAG GACGGCGTGT TTCTCGACCT GTGGCCGTCG ATCTTCCTGT CGGACGAACA CGGCGACCAC ATCCACGTGC TGCGGGATCG CGGCCTGCTG TTCGTCACGC GCTTCGTCGG CGCGCCGCAG CGCGAGGCCG CGCTGCTGAG CGCGCTGCTC GATCGCGCGC GCGGCGAGGG CCGCGCGCTC TGCTACCTGG ACATGTCGAA CCGGCGCAAG CCCGACATCG AGCGCGAGTG CGGGCTGCTG TCGACGCCGC TCGGCGTCGT GCAGACGATC GACGACATTC GCCGCTTCAC GCTCGACGGG CCGCGGATGC GCAAGCTGCG TTACCTCGTG TCGAAGTTCG GGCGCGACCC GTCGTGCCGC GTCGTCGAAT ACACGGAGCC GGACCCCGCC GTCGACGGCG AGATCCGCGC GGTGATGGCG GCCTGGAGCC GCGAGAAAGG CGTCGTCAAC AAGGTCGATG CGATTCTCGC GGACATGCGC GTCGGCAATC TGCTCAAGCG CTACCGCGTC TATCTGACCT ATCTCGGCGA CCGGCTGCAG AACGTCGTCG TGCTGAGCCA CATCGGCGAC GGCTATATCA CCGATCAAGA GTATTTCGTG CCGGACATGC CGCTCGGCGG GACCGAGTAT GCATACGCGA CGATCATCGA ACGGCTCGCC GCCGAGGGGC ACCGCAAATT CAGTCTCGGG TTGACGTGGG GCCTGTTCGA GCCCGAGGCG GGCTTCAGCG ACGCCGAGGG CTGGGCGCTC GTCAACCGCA CGGAAGGGCA GCTCGCGCAG ATCTTCCGGC GCGGCGTGCA GAACCATCAG TACAAGAACA AGTACTGCCC GGCGGAATAT CCGCTGTACC TGTACCGAAG CGCCGACAGC CGGCCGCAGA TCATCAAGCA GTGCATGGGG CAGTTCTTTC GCAACGGGGT GCCGTACGAC GAAATCGCGC GCCAGATCGA GGCGGACGAC GCGCGGGCCT TGGCGGCGGC CGGCGCCGTG CCGCCGCGCG CGACGCACGG CGGCGACGAG GCCGAAGCGC GTTCGGCCCC CGGCGGCGCG CCCGACGATG CGCGCGCCGG CGCGCCCGAC GATGCATCCT TCGACGCACC CGACGAAGCG CCCATCGAGG TATCCGACCA CGCACCCGGC AGCACGCCGC CCCCGCGCGC CGCGCGCGAC GAGCGCGGCG GCGAGCGCGA CGACGGCAAG GGCGGCCAAG CCGGCCAAGC CGACAGAAGC GGCAAGCACG ACCCAGCCGA CGCCCACGGC CGTGCCGCGT CCGGCACGCC GGACGCGGCG CGAGCGCCGG CGCCCGCCGA CATTCCCGAC GCGTTCTTCG ATGCGACGCA GGCCGATCCG AACGCGATCC GGCTCGACCT CGTCAGCGAT TCGTGGGCCC ATCTCGGCTA CCCGTTCATT CGCGAACGGG CGCGCCGGCT GCTTGCCGGC CTCGCGTCGC CGCACGCCGA TCCGGCCGCG CCGTGCGGCC TTTTCGGCGT CGATCACTGC GTGCTGACGA CTTCCGGGCG CAATGCGGAG CGGGTGTTCT TCAATCTGTT TCCGGCCAAG CGCAAGACGA TCCTGCAAAA CATCCCGTTC TTCTCGACCC GGCACAACAC GGCGAAAGCG GGATTCGCCT CGGTCGAGAT TCCCGATCCG CGGATCTTCG ATCCGGATTG CCGCGAAATA TTCCGCGGCG GCATCGATTT CGCGCGCCTG CGCGAGCAAC TGGAGGCGCG GCCGGACGGC GTCGCGATGG TGCTGATCGA GCTGTGCAAC AACGCGAGCG GCGGCTATCC GGTGCCGCTC GCGCAGATCG CCGACGTATC CGCGCTGTGC CGCGCGCGCG GCGTGCCGTT CGTGATGGAC GTCACGCGCA TCGTCAGGAA CGCGGAGCTG ATCCGGCGTC ACGAGCCCGG CTGCGCGAAC GTCGGGCTCT GGGACACCGT CGCCCGGATC GTCGCCCATG CGGACGTCGT ATTCGGCAGC CTCTGCAAGG ATTTCGGCGT GAGCGCGGGC GGCATCGTCG CGGCGAACGA CGGGCGGCTG ATCGGCAAGG CGCGGCGCTA CGCGGAAATC GAGGGCGCGC TGCTCGACCA CGTGCAGACG CAGGTGGTGT GCGCGTCGCT CGGCGAGCGC GACGCGCTCG AGCGGGGCGT CGCGGCGCAG CTCGATGTCG CGCGGCGCGT GAGCGACGCG CTCGATGCGC GGCGGATTCC GGCGCTGCTG CCCGTGGTCG GGCATTGCGC GCTGGTGCGC GCGGCCGACA TGCCGGGCTA TGCCGGCCGC CGGTATCCGC GCGAATCGCT GCTGCGCGCG CTGCTCGAGC GGCACGGCGT GCGCGCCGGC ATCCATCTCG CGGGCAGCGG CGTGGAGCGC GTCATCGACC GGTGCATCCG CATCGCGCTG CCGATCGGCC TGGACGACGC GCGGCTCGCA TCCGGGCTCG CCGACGCGCT GGCCGGAACC GCGCCGGGCG CAACGGATGC GCCCGCCGCG CTGCCCGACC TGCTGCATGC GCGCGCCCCC GGTGCCGCAG ACACGGCCGA CACCGTCGAC ACGGTCGATA CCGTCGATAC GGCTGATACG GCTGATACGG CTGATACGGC CGCGCAGGCC GGCGTCCGTC GCGGCGAGGC GCACGCGTCA CGCGCGCCGA TGCGGGCGAG CGACGACGAT GCGATCGCGA TCGTCGGCAT GGCGGGCCGC TACCCCGGCG CCGACGATCT GTCCGCGTTC TGGCGCAACC TCGTCGACGG CGTGAACGCG ATCACGGAAA TCCCGGCCGA GCGCTGGGAC TGGCGCGCGC ATTACCACCC CGATCCCGAG CAGGCGGCGC GGCTGCGCAA GTCGTACGGC AAGTGGGGCG GCTTTCTCGG CGAGTTCGAC TGTTTCGATC CGCTGTTCTT CTGGATGGCG CCGCGCCGCA TCGCGATGAT CGATCCGCAG GAGCGGCTGT TCCTCGAGGA GTGCTGGAAG GCGCTCGAGG ATGCGGGCTA CCCGCCGTCC CGCCTCGGCG ACGCGCTGCG CGAGCGCACG GGCGTGTTCG GCGGGCTGTC GAAGCACGGC TTCAGCCTGT ATGCGTCGCA GTATGCGGGC ACCCAGCCGC ATACGTCGCC CGCGTCGATG GTCGGCCGCG TGTCGCACTT CTTCGATCTG AAGGGCCCGA GCGTGGCGAT CGACAACCAT TGCGCGTCGT CGCTCGTCGC CGTTCACGAG GCCTGCGAAT ACCTGCGGCG GGGCGACGGC GATCTCGCGA TCGCGGGCGG CGTCAGCCTG TGCCTGCACC CGTCGAGCTA TGTGCAGCTC TCGCTCGTGC GGATGCTCTC GCGCGACGCG CACTGCGCGG CGTTCGACGA GGGCGGCGCG GGCTACGTGC CGGGCGAGGG GGTGGGCGTC GTCGTGCTCA AGCGGCTCGC GCAGGCGCGC GCGCACGGCG ATCCGATCCA CGCGGTGATC CGCTCCGGCG CGGTCAATCA CAACGGCCGC ATGCGCTACT ACGGCCAGCC CGATCAGGCG GGCCAGCAGG CCGCCATCCG GGCCGCGCTC GCGCGCGCGC GGATCGATCC GCGCTCGATC AGCTACATCG AGGCGGCCGC GAGCGGCGTC GAGACGACGG ACGCGGTCGA GATGGCCGCG CTGACCGAGG TGTTCGGCGA TCGGGCGGGC GCCGCGGGCG CCTACACGAT CGGCACGGTC AAGCCGGCGA TCGGGCACGG CGAGGCCGCG TCGGGCATGT CGCAACTGAC GCGCGTCGCG CTGTCGCTCA AGCACGCGAC GCTCACGCCG ACCCGGCTGC CACGGCGGCC GAGCCCGCTG ATCGATTTCG ATCGGCTGCC GTTCCGGCTC GCGGCCGAGG CGGCGCCGTG GGCGCCGGTG AGCGTCGACG GCCGGCCGGT GCCGCGGCGC GCCGGGGTCA CCGCGATCGG CAACGGCGTC AACGCGCATC TGGTGCTCGA GGAATGGCCG GGCGCGCCCG CCGACGATTC CGCCGCCGCG CCGCGCGAGC CGCAGGTGTT CGTGCTGTCC GCGCAGGACG GCGAGCGGCT CGCGGCGTAC GTCGAGCGAT GGATCGCGTT CCTCGCGAGC GGCGCGACGC CCGATTTCGG GCGGATGCTG CGCACGCTGC AGATCGCACG CGAGCCGATG CCCGCGCGGC TCGCGCTCGT CGCCTCCGAT CGCGACGACT TGCTGCGCGC GTTGCGCGCG TGGCGCGACG GCGGCGGCGC GTCGTCGCGC GTGCATCGCG GCGACGCCCG CCGGCGCGCC GGGCAGGCCG CGCTGGCGGA GCAGGCGTGC GATCCGCGCG CGTGCGCGCC CGACGAGGCG GCCGCGGCCT GGGTGCAAGG GCGCACGGTG CGCTGGGAGG CGCTTCACCG AGGCGGGCCG TGGCGGCGCG TCGGCGGTCT GCCGGCCTAT CCGTTCGCGC GCGAGCGGTA CTGGATCGCG GACGCGGCAT CCGGCGCGCC GGCAGGCAGG GAGGAAGCAT CGGCGCGGCC CGATTGA
|
Protein sequence | MQATFNRGEV ALAGDARRTD GDADRAAASP HGFRFVEKAR IADEPGLSAR LAARKGDAHQ DGVFLDLWPS IFLSDEHGDH IHVLRDRGLL FVTRFVGAPQ REAALLSALL DRARGEGRAL CYLDMSNRRK PDIERECGLL STPLGVVQTI DDIRRFTLDG PRMRKLRYLV SKFGRDPSCR VVEYTEPDPA VDGEIRAVMA AWSREKGVVN KVDAILADMR VGNLLKRYRV YLTYLGDRLQ NVVVLSHIGD GYITDQEYFV PDMPLGGTEY AYATIIERLA AEGHRKFSLG LTWGLFEPEA GFSDAEGWAL VNRTEGQLAQ IFRRGVQNHQ YKNKYCPAEY PLYLYRSADS RPQIIKQCMG QFFRNGVPYD EIARQIEADD ARALAAAGAV PPRATHGGDE AEARSAPGGA PDDARAGAPD DASFDAPDEA PIEVSDHAPG STPPPRAARD ERGGERDDGK GGQAGQADRS GKHDPADAHG RAASGTPDAA RAPAPADIPD AFFDATQADP NAIRLDLVSD SWAHLGYPFI RERARRLLAG LASPHADPAA PCGLFGVDHC VLTTSGRNAE RVFFNLFPAK RKTILQNIPF FSTRHNTAKA GFASVEIPDP RIFDPDCREI FRGGIDFARL REQLEARPDG VAMVLIELCN NASGGYPVPL AQIADVSALC RARGVPFVMD VTRIVRNAEL IRRHEPGCAN VGLWDTVARI VAHADVVFGS LCKDFGVSAG GIVAANDGRL IGKARRYAEI EGALLDHVQT QVVCASLGER DALERGVAAQ LDVARRVSDA LDARRIPALL PVVGHCALVR AADMPGYAGR RYPRESLLRA LLERHGVRAG IHLAGSGVER VIDRCIRIAL PIGLDDARLA SGLADALAGT APGATDAPAA LPDLLHARAP GAADTADTVD TVDTVDTADT ADTADTAAQA GVRRGEAHAS RAPMRASDDD AIAIVGMAGR YPGADDLSAF WRNLVDGVNA ITEIPAERWD WRAHYHPDPE QAARLRKSYG KWGGFLGEFD CFDPLFFWMA PRRIAMIDPQ ERLFLEECWK ALEDAGYPPS RLGDALRERT GVFGGLSKHG FSLYASQYAG TQPHTSPASM VGRVSHFFDL KGPSVAIDNH CASSLVAVHE ACEYLRRGDG DLAIAGGVSL CLHPSSYVQL SLVRMLSRDA HCAAFDEGGA GYVPGEGVGV VVLKRLAQAR AHGDPIHAVI RSGAVNHNGR MRYYGQPDQA GQQAAIRAAL ARARIDPRSI SYIEAAASGV ETTDAVEMAA LTEVFGDRAG AAGAYTIGTV KPAIGHGEAA SGMSQLTRVA LSLKHATLTP TRLPRRPSPL IDFDRLPFRL AAEAAPWAPV SVDGRPVPRR AGVTAIGNGV NAHLVLEEWP GAPADDSAAA PREPQVFVLS AQDGERLAAY VERWIAFLAS GATPDFGRML RTLQIAREPM PARLALVASD RDDLLRALRA WRDGGGASSR VHRGDARRRA GQAALAEQAC DPRACAPDEA AAAWVQGRTV RWEALHRGGP WRRVGGLPAY PFARERYWIA DAASGAPAGR EEASARPD
|
| |