Gene BURPS1710b_A1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1644 
Symbol 
ID3692719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2006482 
End bp2009523 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content68% 
IMG OID637731898 
Productputative peptide synthase protein 
Protein accessionYP_336801 
Protein GI76818767 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAGC GCAGCGGCGC GCGTGATCGG ATTCGCTCGA ACAATCCCGC TTTTTACCGG 
GCAGGCCGCC GGTTCGTTTC ATGCAGTCGA GTGTCAATCG GCCGTCGACG CTCGCGAGAG
CGCAACGGCG GCCAGTCAGG TAGGGATTCA CCCGCGATCG TCAACCGACG CGTTCGGGCG
CGCGTGCCCG GACGTTCGTT CCCACTTGCA CCATCAGCAT GGAGTACCTG CACCATGACG
GCATCCACCC TCGATTTACC GCGCGATTGC GAACACGCAT TGCGCGCCGC TTCGCCGCCA
AACATCGTCG ACCTGCTGTT GCGGGCCGCA CGGCTGCATC CGCATACGGG CGTGCGCTTC
ATCGCCGCCG AATCCGAACA CAAGGGCGCC TTCGTCACAT ATCCCGAGCT GCTCGACGAG
GCGCGCCGCA TCCTGGGCGG CCTGCGCGCG CGCGGCTATC GGTCCGGCAT GAAGGTCGCG
CTGCTGCTCG AGCACGCGAG CGATTTCATT CCGGCGTTCT GGGCCTGCGC GCTCGGCGGC
TTCGTGCCGT GCCCGCTCGT GCCGATCCGC AACGATCCCG AGCGCTGGGC GAAGCACCTC
GCGCACGTCG ATACGCTGCT CGACCATCCG CTGCTCGTCA CCACCGAAGC GCTGAACAAC
GATCTGCCGG GCGGCGCGTC GGCCGTCAAC CTGAACGCGC TGCGCGCGAG CCTGCCCGAT
GCGTCGACGC ACGTCGCGCA ACCGTCGGAC CCGGCGGTTT TCGTGCTCAC GTCGGGCTCC
ACCGGCAATT CGAAGGCGGT CGTGCTCACG CACGGCAACC TGCTCGCGTC GATGGCGGGC
AAGAACGATC GGCAGCAGCT CGCGGGCGCG GACGTCACGC TCAACTGGAT CTCGTTCGAC
CACGTCGCCG CGCTGCTCGA AGCGCACCTG CTGCCGCTGT ACGTCGGCGC CGTGCAGCTT
CACGTCGAAG CCGCGGCGGT TCTCACCGAT CCGCTGCGCT TCTTGCGGCT CGTCAGCCGC
TATCGCGTGA CGATGACGTT CTCGCCGAAC TTCCTGTTCG GGCAACTGAA CGCCGCGCTC
GAAGCGATGG GCGACGAGGC GCTCGCCGCG TGGCGCGGCG CGGTGGATCT GTCGTCGCTG
CGGCACGTCG TGTCGGGCGG CGAGGCGATC GTTGTCGCGA CCGGGCAGCG CTTTCTCGAT
CTGCTCGCGC CGTGCGGCCT CGCGCGCGAT GCGCTGTGGC CCGCGTTCGG GATGACGGAG
ACGTGCGCCG GCTCCGTGTA TTCGCGCGAG TTCCCGGAAG GCGACGCGGG CCGCGAGTTC
GCATCGCTCG GCCTGCCGGT GGCCGGGCTG CAGATGCGCA TCGCGGACGA CCGCAACAAC
GTGCTGCCGG AAGGCGAGGC GGGCGAGTTC CAGGTGCGCG GCCCGATGAT CTTCCAGCGC
TATCACAACA ATGCCGAGGC GACGCGCGCG GCGTTCACGA GCGACGGCTG GTTCCGCACG
GGCGACCTCG GGCGCATCGA GCGCGGCCGG CTGTGGCTCG TCGGCCGCAG CAAGGACAGC
ATCATCGTCA ACGGCGTCAA TTACTTCAGC CACGAGCTGG AGACGACCCT CGAGGCGCTC
GACGGCGTCA AGCCCTCGTT CGTCGCGGCG TTTCCGACGC GCGGGGCCGG CGACGAATCC
GAGCAACTCG TCGTCACGTT CACGCCGTCG TTTCCGCTCG ACGACGAGGA CGCGCTGTAT
CGCCTCGTCA TCGCGATCCG CAACAGCACG ATCCTGCTGT GGGGCTTCCG GCCCGCGCTG
ATCCTGCCGC TGCCGGAGGA CGAATTCCCG AAGACGAGCC TCGGCAAGAC CCAGCGCGCG
ATCATGCGCA AGCGCCTCGA AGCGGGCAGC TACGACGGCT ACAAGGCGCG CGTCGCCGAT
CTCGCGAACC GGCAGATGGG CGGCTATGTC GCGCCCGACG GGCAGGCCGA GGCCGCGGTG
GCCGCGATCT TCGCGCGGAT GTTCCAGCTC GCGCCCGAGG CGATCAGCGC GACCGCGAGC
TTCTTCGATC TCGGCGGCAC GTCGCTCGAC ATCCTGAAGC TCAAGCGCCA GGTCGAACAG
CGGCTCGGCG TGATCGACCT GCCGATCGTG ACGATCCTCC AGAACCCGAG CGTGCGCGCG
CTGGCCGCGC GTCTCGCCCC GGGCGAGCGC GTGACGGCGG GCGAATACGA TCCGGTCGTG
CCGTTGCAGC TCACCGGCGG CAAGACGCCG CTGTTCTGCG TGCACCCCGG CGTCGGCGAG
GTGCTCGTGT TCGTCAACCT CGCGAAGTAC TTCGTCAACG AGCGCCCGTT CTACGCATTG
CGCGCGCGCG GCTTCAACGA AGGGGAGACG TATTTCTCCA GCTTCGACGA AATGGTGAAC
ACGTATGTCG ACGCGATCCG CAAGCGGCAG CCGCACGGGC CGTACGCGGT GGCCGGCTAT
TCGTACGGCG GCGCGGTCGC GTTCGAGATC GCGAAGGTGC TCGAAGCGCA GGGCGAGCGG
GTGGATTTCG TCGGCAGCTT CAATCTGCCG CCGCACATCA AGTACCGGAT GGACGAGCTC
GACGAGGTGG AGGGCGCGGT CAACCTCGCG TTCTTCCTGT CGCTGATCGA CAAGCAACAG
TCGCTCACGC TGCCGCCGCA ACTGCGCGCG GCGATGCCGG AGCAAGACCC GCTCGCGTAC
CTGATCGACC ACGCGCCGCC CGGGCGGCTC GTCGAGCTCG ACCTCGATCT GGCGAAATTC
CGCGCGTGGG CGGGGCTCGC GCAATCGCTG CTCACGCTCG GGCGTTCGTA CGCGCCGTCG
GGCAGCGTGC GGGCGATGTC GATCTTCTAT GCGATTCCGC TGCGCGGCAC GAAGGACGAC
TGGCTGAACA AGGAACTGCG CCGCTGGGAC GAGTTCACGC GCGCGCCGAA CCGCTATATC
GACGTGGCGG GCGAACACTA CACGCTGATG GGGCCCGCGC ACGTCGCGAC GTTCCAGGCG
GTGCTGCGGG CCGAGCTCGA TCGCGCGCTC GGCGGCAAAT GA
 
Protein sequence
MPERSGARDR IRSNNPAFYR AGRRFVSCSR VSIGRRRSRE RNGGQSGRDS PAIVNRRVRA 
RVPGRSFPLA PSAWSTCTMT ASTLDLPRDC EHALRAASPP NIVDLLLRAA RLHPHTGVRF
IAAESEHKGA FVTYPELLDE ARRILGGLRA RGYRSGMKVA LLLEHASDFI PAFWACALGG
FVPCPLVPIR NDPERWAKHL AHVDTLLDHP LLVTTEALNN DLPGGASAVN LNALRASLPD
ASTHVAQPSD PAVFVLTSGS TGNSKAVVLT HGNLLASMAG KNDRQQLAGA DVTLNWISFD
HVAALLEAHL LPLYVGAVQL HVEAAAVLTD PLRFLRLVSR YRVTMTFSPN FLFGQLNAAL
EAMGDEALAA WRGAVDLSSL RHVVSGGEAI VVATGQRFLD LLAPCGLARD ALWPAFGMTE
TCAGSVYSRE FPEGDAGREF ASLGLPVAGL QMRIADDRNN VLPEGEAGEF QVRGPMIFQR
YHNNAEATRA AFTSDGWFRT GDLGRIERGR LWLVGRSKDS IIVNGVNYFS HELETTLEAL
DGVKPSFVAA FPTRGAGDES EQLVVTFTPS FPLDDEDALY RLVIAIRNST ILLWGFRPAL
ILPLPEDEFP KTSLGKTQRA IMRKRLEAGS YDGYKARVAD LANRQMGGYV APDGQAEAAV
AAIFARMFQL APEAISATAS FFDLGGTSLD ILKLKRQVEQ RLGVIDLPIV TILQNPSVRA
LAARLAPGER VTAGEYDPVV PLQLTGGKTP LFCVHPGVGE VLVFVNLAKY FVNERPFYAL
RARGFNEGET YFSSFDEMVN TYVDAIRKRQ PHGPYAVAGY SYGGAVAFEI AKVLEAQGER
VDFVGSFNLP PHIKYRMDEL DEVEGAVNLA FFLSLIDKQQ SLTLPPQLRA AMPEQDPLAY
LIDHAPPGRL VELDLDLAKF RAWAGLAQSL LTLGRSYAPS GSVRAMSIFY AIPLRGTKDD
WLNKELRRWD EFTRAPNRYI DVAGEHYTLM GPAHVATFQA VLRAELDRAL GGK