Gene BURPS668_A1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1481 
Symbol 
ID4887197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1421835 
End bp1426511 
Gene Length4677 bp 
Protein Length1558 aa 
Translation table11 
GC content72% 
IMG OID640131420 
Productpolyketide synthase 
Protein accessionYP_001062477 
Protein GI126445080 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.457199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCAA CGTTTAACCG CGGCGAGGTC GCGCTTGCCG GCGACGCCCG GCGCACGGAC 
GGCGATGCCG ATCGCGCGGC CGCGTCGCCG CACGGGTTTC GCTTCGTCGA GAAGGCGCGT
ATCGCCGACG AGCCCGGCCT GTCGGCGCGG CTCGCCGCGC GCAAGGGCGA CGCGCATCAG
GACGGCGTGT TTCTCGACCT GTGGCCGTCG ATCTTCCTGT CGGACGAACA CGGCGACCAC
ATCCACGTGC TGCGGGATCG CGGCCTGCTG TTCGTCACGC GCTTCGTCGG CGCGCCGCAG
CGCGAGGCCG CGCTGCTGAG CGCGCTGCTC GATCGCGCGC GCGGCGAGGG CCGCGCGCTC
TGCTACCTGG ACATGTCGAA CCGGCGCAAG CCCGACATCG AGCGCGAGTG CGGGCTGCTG
TCGACGCCGC TCGGCGTCGT GCAGACGATC GACGACATTC GCCGCTTCAC GCTCGACGGG
CCGCGGATGC GCAAGCTGCG TTACCTCGTG TCGAAGTTCG GGCGCGACCC GTCGTGCCGC
GTCGTCGAAT ACACGGAGCC GGACCCCGCC GTCGACGGCG AGATCCGCGC GGTGATGGCG
GCCTGGAGCC GCGAGAAAGG CGTCGTCAAC AAGGTCGATG CGATTCTCGC GGACATGCGC
GTCGGCAATC TGCTCAAGCG CTACCGCGTC TATCTGACCT ATCTCGGCGA CCGGCTGCAG
AACGTCGTCG TGCTGAGCCA CATCGGCGAC GGCTATATCA CCGATCAAGA GTATTTCGTG
CCGGACATGC CGCTCGGCGG GACCGAGTAT GCATACGCGA CGATCATCGA ACGGCTCGCC
GCCGAGGGGC ACCGCAAATT CAGTCTCGGG TTGACGTGGG GCCTGTTCGA GCCCGAGGCG
GGCTTCAGCG ACGCCGAGGG CTGGGCGCTC GTCAACCGCA CGGAAGGGCA GCTCGCGCAG
ATCTTCCGGC GCGGCGTGCA GAACCATCAG TACAAGAACA AGTACTGCCC GGCGGAATAT
CCGCTGTACC TGTACCGAAG CGCCGACAGC CGGCCGCAGA TCATCAAGCA GTGCATGGGG
CAGTTCTTTC GCAACGGGGT GCCGTACGAC GAAATCGCGC GCCAGATCGA GGCGGACGAC
GCGCGGGCCT TGGCGGCGGC CGGCGCCGTG CCGCCGCGCG CGACGCACGG CGGCGACGAG
GCCGAAGCGC GTTCGGCCCC CGGCGGCGCG CCCGACGATG CGCGCGCCGG CGCGCCCGAC
GATGCATCCT TCGACGCACC CGACGAAGCG CCCATCGAGG TATCCGACCA CGCACCCGGC
AGCACGCCGC CCCCGCGCGC CGCGCGCGAC GAGCGCGGCG GCGAGCGCGA CGACGGCAAG
GGCGGCCAAG CCGGCCAAGC CGACAGAAGC GGCAAGCACG ACCCAGCCGA CGCCCACGGC
CGTGCCGCGT CCGGCACGCC GGACGCGGCG CGAGCGCCGG CGCCCGCCGA CATTCCCGAC
GCGTTCTTCG ATGCGACGCA GGCCGATCCG AACGCGATCC GGCTCGACCT CGTCAGCGAT
TCGTGGGCCC ATCTCGGCTA CCCGTTCATT CGCGAACGGG CGCGCCGGCT GCTTGCCGGC
CTCGCGTCGC CGCACGCCGA TCCGGCCGCG CCGTGCGGCC TTTTCGGCGT CGATCACTGC
GTGCTGACGA CTTCCGGGCG CAATGCGGAG CGGGTGTTCT TCAATCTGTT TCCGGCCAAG
CGCAAGACGA TCCTGCAAAA CATCCCGTTC TTCTCGACCC GGCACAACAC GGCGAAAGCG
GGATTCGCCT CGGTCGAGAT TCCCGATCCG CGGATCTTCG ATCCGGATTG CCGCGAAATA
TTCCGCGGCG GCATCGATTT CGCGCGCCTG CGCGAGCAAC TGGAGGCGCG GCCGGACGGC
GTCGCGATGG TGCTGATCGA GCTGTGCAAC AACGCGAGCG GCGGCTATCC GGTGCCGCTC
GCGCAGATCG CCGACGTATC CGCGCTGTGC CGCGCGCGCG GCGTGCCGTT CGTGATGGAC
GTCACGCGCA TCGTCAGGAA CGCGGAGCTG ATCCGGCGTC ACGAGCCCGG CTGCGCGAAC
GTCGGGCTCT GGGACACCGT CGCCCGGATC GTCGCCCATG CGGACGTCGT ATTCGGCAGC
CTCTGCAAGG ATTTCGGCGT GAGCGCGGGC GGCATCGTCG CGGCGAACGA CGGGCGGCTG
ATCGGCAAGG CGCGGCGCTA CGCGGAAATC GAGGGCGCGC TGCTCGACCA CGTGCAGACG
CAGGTGGTGT GCGCGTCGCT CGGCGAGCGC GACGCGCTCG AGCGGGGCGT CGCGGCGCAG
CTCGATGTCG CGCGGCGCGT GAGCGACGCG CTCGATGCGC GGCGGATTCC GGCGCTGCTG
CCCGTGGTCG GGCATTGCGC GCTGGTGCGC GCGGCCGACA TGCCGGGCTA TGCCGGCCGC
CGGTATCCGC GCGAATCGCT GCTGCGCGCG CTGCTCGAGC GGCACGGCGT GCGCGCCGGC
ATCCATCTCG CGGGCAGCGG CGTGGAGCGC GTCATCGACC GGTGCATCCG CATCGCGCTG
CCGATCGGCC TGGACGACGC GCGGCTCGCA TCCGGGCTCG CCGACGCGCT GGCCGGAACC
GCGCCGGGCG CAACGGATGC GCCCGCCGCG CTGCCCGACC TGCTGCATGC GCGCGCCCCC
GGTGCCGCAG ACACGGCCGA CACCGTCGAC ACGGTCGATA CCGTCGATAC GGCTGATACG
GCTGATACGG CTGATACGGC CGCGCAGGCC GGCGTCCGTC GCGGCGAGGC GCACGCGTCA
CGCGCGCCGA TGCGGGCGAG CGACGACGAT GCGATCGCGA TCGTCGGCAT GGCGGGCCGC
TACCCCGGCG CCGACGATCT GTCCGCGTTC TGGCGCAACC TCGTCGACGG CGTGAACGCG
ATCACGGAAA TCCCGGCCGA GCGCTGGGAC TGGCGCGCGC ATTACCACCC CGATCCCGAG
CAGGCGGCGC GGCTGCGCAA GTCGTACGGC AAGTGGGGCG GCTTTCTCGG CGAGTTCGAC
TGTTTCGATC CGCTGTTCTT CTGGATGGCG CCGCGCCGCA TCGCGATGAT CGATCCGCAG
GAGCGGCTGT TCCTCGAGGA GTGCTGGAAG GCGCTCGAGG ATGCGGGCTA CCCGCCGTCC
CGCCTCGGCG ACGCGCTGCG CGAGCGCACG GGCGTGTTCG GCGGGCTGTC GAAGCACGGC
TTCAGCCTGT ATGCGTCGCA GTATGCGGGC ACCCAGCCGC ATACGTCGCC CGCGTCGATG
GTCGGCCGCG TGTCGCACTT CTTCGATCTG AAGGGCCCGA GCGTGGCGAT CGACAACCAT
TGCGCGTCGT CGCTCGTCGC CGTTCACGAG GCCTGCGAAT ACCTGCGGCG GGGCGACGGC
GATCTCGCGA TCGCGGGCGG CGTCAGCCTG TGCCTGCACC CGTCGAGCTA TGTGCAGCTC
TCGCTCGTGC GGATGCTCTC GCGCGACGCG CACTGCGCGG CGTTCGACGA GGGCGGCGCG
GGCTACGTGC CGGGCGAGGG GGTGGGCGTC GTCGTGCTCA AGCGGCTCGC GCAGGCGCGC
GCGCACGGCG ATCCGATCCA CGCGGTGATC CGCTCCGGCG CGGTCAATCA CAACGGCCGC
ATGCGCTACT ACGGCCAGCC CGATCAGGCG GGCCAGCAGG CCGCCATCCG GGCCGCGCTC
GCGCGCGCGC GGATCGATCC GCGCTCGATC AGCTACATCG AGGCGGCCGC GAGCGGCGTC
GAGACGACGG ACGCGGTCGA GATGGCCGCG CTGACCGAGG TGTTCGGCGA TCGGGCGGGC
GCCGCGGGCG CCTACACGAT CGGCACGGTC AAGCCGGCGA TCGGGCACGG CGAGGCCGCG
TCGGGCATGT CGCAACTGAC GCGCGTCGCG CTGTCGCTCA AGCACGCGAC GCTCACGCCG
ACCCGGCTGC CACGGCGGCC GAGCCCGCTG ATCGATTTCG ATCGGCTGCC GTTCCGGCTC
GCGGCCGAGG CGGCGCCGTG GGCGCCGGTG AGCGTCGACG GCCGGCCGGT GCCGCGGCGC
GCCGGGGTCA CCGCGATCGG CAACGGCGTC AACGCGCATC TGGTGCTCGA GGAATGGCCG
GGCGCGCCCG CCGACGATTC CGCCGCCGCG CCGCGCGAGC CGCAGGTGTT CGTGCTGTCC
GCGCAGGACG GCGAGCGGCT CGCGGCGTAC GTCGAGCGAT GGATCGCGTT CCTCGCGAGC
GGCGCGACGC CCGATTTCGG GCGGATGCTG CGCACGCTGC AGATCGCACG CGAGCCGATG
CCCGCGCGGC TCGCGCTCGT CGCCTCCGAT CGCGACGACT TGCTGCGCGC GTTGCGCGCG
TGGCGCGACG GCGGCGGCGC GTCGTCGCGC GTGCATCGCG GCGACGCCCG CCGGCGCGCC
GGGCAGGCCG CGCTGGCGGA GCAGGCGTGC GATCCGCGCG CGTGCGCGCC CGACGAGGCG
GCCGCGGCCT GGGTGCAAGG GCGCACGGTG CGCTGGGAGG CGCTTCACCG AGGCGGGCCG
TGGCGGCGCG TCGGCGGTCT GCCGGCCTAT CCGTTCGCGC GCGAGCGGTA CTGGATCGCG
GACGCGGCAT CCGGCGCGCC GGCAGGCAGG GAGGAAGCAT CGGCGCGGCC CGATTGA
 
Protein sequence
MQATFNRGEV ALAGDARRTD GDADRAAASP HGFRFVEKAR IADEPGLSAR LAARKGDAHQ 
DGVFLDLWPS IFLSDEHGDH IHVLRDRGLL FVTRFVGAPQ REAALLSALL DRARGEGRAL
CYLDMSNRRK PDIERECGLL STPLGVVQTI DDIRRFTLDG PRMRKLRYLV SKFGRDPSCR
VVEYTEPDPA VDGEIRAVMA AWSREKGVVN KVDAILADMR VGNLLKRYRV YLTYLGDRLQ
NVVVLSHIGD GYITDQEYFV PDMPLGGTEY AYATIIERLA AEGHRKFSLG LTWGLFEPEA
GFSDAEGWAL VNRTEGQLAQ IFRRGVQNHQ YKNKYCPAEY PLYLYRSADS RPQIIKQCMG
QFFRNGVPYD EIARQIEADD ARALAAAGAV PPRATHGGDE AEARSAPGGA PDDARAGAPD
DASFDAPDEA PIEVSDHAPG STPPPRAARD ERGGERDDGK GGQAGQADRS GKHDPADAHG
RAASGTPDAA RAPAPADIPD AFFDATQADP NAIRLDLVSD SWAHLGYPFI RERARRLLAG
LASPHADPAA PCGLFGVDHC VLTTSGRNAE RVFFNLFPAK RKTILQNIPF FSTRHNTAKA
GFASVEIPDP RIFDPDCREI FRGGIDFARL REQLEARPDG VAMVLIELCN NASGGYPVPL
AQIADVSALC RARGVPFVMD VTRIVRNAEL IRRHEPGCAN VGLWDTVARI VAHADVVFGS
LCKDFGVSAG GIVAANDGRL IGKARRYAEI EGALLDHVQT QVVCASLGER DALERGVAAQ
LDVARRVSDA LDARRIPALL PVVGHCALVR AADMPGYAGR RYPRESLLRA LLERHGVRAG
IHLAGSGVER VIDRCIRIAL PIGLDDARLA SGLADALAGT APGATDAPAA LPDLLHARAP
GAADTADTVD TVDTVDTADT ADTADTAAQA GVRRGEAHAS RAPMRASDDD AIAIVGMAGR
YPGADDLSAF WRNLVDGVNA ITEIPAERWD WRAHYHPDPE QAARLRKSYG KWGGFLGEFD
CFDPLFFWMA PRRIAMIDPQ ERLFLEECWK ALEDAGYPPS RLGDALRERT GVFGGLSKHG
FSLYASQYAG TQPHTSPASM VGRVSHFFDL KGPSVAIDNH CASSLVAVHE ACEYLRRGDG
DLAIAGGVSL CLHPSSYVQL SLVRMLSRDA HCAAFDEGGA GYVPGEGVGV VVLKRLAQAR
AHGDPIHAVI RSGAVNHNGR MRYYGQPDQA GQQAAIRAAL ARARIDPRSI SYIEAAASGV
ETTDAVEMAA LTEVFGDRAG AAGAYTIGTV KPAIGHGEAA SGMSQLTRVA LSLKHATLTP
TRLPRRPSPL IDFDRLPFRL AAEAAPWAPV SVDGRPVPRR AGVTAIGNGV NAHLVLEEWP
GAPADDSAAA PREPQVFVLS AQDGERLAAY VERWIAFLAS GATPDFGRML RTLQIAREPM
PARLALVASD RDDLLRALRA WRDGGGASSR VHRGDARRRA GQAALAEQAC DPRACAPDEA
AAAWVQGRTV RWEALHRGGP WRRVGGLPAY PFARERYWIA DAASGAPAGR EEASARPD