Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1675 |
Symbol | |
ID | 4887188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1617640 |
End bp | 1621701 |
Gene Length | 4062 bp |
Protein Length | 1353 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640131613 |
Product | CtaG |
Protein accession | YP_001062670 |
Protein GI | 126442826 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGA CGAGCGAGAC CGCGCTGCAC GGCTTCCGGC TGTCGCCCGT GCAGCAGCGT GTGTGGACCT TGACGGACGC GGGGCGGCTG CCGGCCGCGC AACGGGTGGT CGCACGCTTT GCGTGGACGG CCGCGCCGCC GTCGCGTGCG GCGCTCGAGG CCGCGCTGAA CGCGCTGATC GGCCGCCACG AGATCCTGCG CGCGCGCTTC GCGCGATTGC CGGAGATGCG CCTGCCGCTG CAAACCGCGC CGGACGGCGC GCGCCTGCGG CTCGACGGCG CATGCGCGCC CGACGTGCCC GGCACGCCGG ACACGCCCGA CATGCCCGAC ATGCCCGACG CGGGTGCGCC GGCGCTGCAC GCGGCGATGG ACGCAACGGG CCTCGTGCTG TCGCTGCCGG CGCTGTGCGC CGACGGACCG ACGATGCTGC ACCTCGGCGC CGCGCTCGCC GCGCGGCTCG CCGGCGACGG CGCGCGTGCC GATGCCGACG GCCAGGCGCC GCTGCAATTC TTCGATCTCG CGCAGTGGCA GCACGATCTG CTCGCCGAGG AGGGCGCGTG GCGCTGGCCC GACGGCGCGC TGCAGTCCGC GCACGGGCCG TGGCCCGCGC TGCCGGCGCT GCGCTCCGCG CGCGCGGCCG CGTACCGCCC GGCGCGGCAC GCGTGCGCGC TGCCCGCGGA TACCGGCGCG CGCGTCGGCG CGCTTTGCGC GCGCCTCGGC GTGACGCCGG CCGATCTACT GTGCGCCGTC TGGGCGGTGC TGCTGCACCG ACTGTCGGGC GGCGACACGC TGCCGGTGGC GATCGTTGCC GACGGTCGGC CGTTCGACGA GCTCGCCACG GCGCTCGGCC CGCTCGCCGC GCCCGTGCCG CTTTCGCTGT CGGCCGCGCT GGGCGCGCCG GACGGCACGT TCGCCGGCCT CGCGAAAGCG GCGGCCGACG CGCGCGCGAC GGCGGCCGAT CTGCGTCATC ATCTCGTCCC GCCCGACGAG GCGCGCTTCG CGCCGTTCCA GATCGAAGTG CTCGACGCGG GGCGGGCGGA CGACGCGCTG CTGCGCGTCG TCGATGTGGC GACGGCGGGC GAGCCGGCGT TGCTGAAGCT CTCGGTGCTC GTCGGGCCCG GCGGCGCGCT GCGCGCGGAT ATCGACCACG ATGCCGGCGC GGTGCGCGAC GGCGGTGCGA TCGCGCGGCT GGCGGAGCAG TTCGCGACGC TGCTCGACGC GGTGCTCGCG AACCCGGACG CGACGTTCGA CGAGCCGGAC CTGCTGGGCG CGGCGGAGCG GCGGCTCGTC ACGGAGGATT TTCAGGGGCC GCAGGTGAAT CACGGCCGTT GGACGCCCGT GCATCTGGCG GTGGCCGCGG CCGCCGCCGC GGCGCCCGAT CGGCTCGCCG TGCAGGACGG CGCCGCGCGG CTGACGGCGG CGGCTCTCGA GCGCGCCGTT CGTGCGCTCG CGGCGCGGCT GACCGCGGCG GGCGTCGCGG CGCAGACGCT TGTCGCGCTG CATCTGCCGC GCGGCGCGGC GCTCGTGACG GCGATGCTCG CCGTGATGCG GGCGGGCGGC GTTTTTCTGC CGATGCCGCC GGAGCTGCCC GCCGCGCGCC GGCGTTACAT GCTCGAGGAC AGCGGCGCGC GGATCGTGCT GACGCTGCCG GACGCCGCGG ACGATCTGCC GCGGGATCTC GCGCTCGTGT GCGTGTACCC GGGGGAGGCC GATGTCGATG TCGATGTCGA CGTCGAGGCC GAAGCCGATG TCGACATCGA CGCGCGCACG GCGGCATCCG GACCGACCGC GCGCGTCACG GCCGACGCGC CTGCCGACGC GGGCGCGGAT GCGACGACCG GCGCTTGCCC CGACGCGTGG CCTGCCCCCG ACCCGACGCA GGCCGCCTAC GTGCTGTACA CGTCCGGCTC GACGGGGCGG CCGAAAGGCG TCGTGGTCAC GCACGGCTCG CTCGCGAACC ACATGGCGTG GATGACGCGC GCGTTCCCGC TCGACGCGCA CGACGCGGTG CTGCAAAAGA CCTCGGCCGC GTTCGACGCG TCGATCTGGG AATTCTTCCT GCCGCTGCTG GCGGGCGCGC GGCTCGTGAT GGCGCCGCCC GGCCTCGAGC GCGACGTGCC GGCGCTCGTC GCGACGCTCG CGCGCGAGCG CATCACCGTG CTGCAGCTCG TGCCGAGCCT GCTGCGCGTG CTCGTCGACG CGCCGGGCTT CGGCGCGTGC GACGCGCTGC GATGCGTGTT CTGCGGCGGC GAAGCGCTGA CGGCCGATCT CGCCCGGCGC TTCGCCGCCG CGCACCGCGC GGCGCTCGTC AACCTGTACG GCCCGACCGA GACGACGATC CAGGTTTGCG CGGAGCGGGT CGACGCGGCC GACGATCCCG TGCCGGTCGG CCGCCCGATC GACAACGTCC GGCTGTACGT CGTCGATTCG CGCAACCGGC TCTCGCCCGT GGGCGTGCGC GGCGAGATCC TGATCGGCGG CGCGGCGCCC GCGCGCGGCT ATCTCGATCG GCCGGCGCTC ACCGCCGCGC GCTTCGTCGC CGATCCGATC GATCCGCGCG CGCCGCGCGT CTATCGCAGC GGCGACGTGG GCGCGTGGCG CGCCGACGGC CGGCTCGACT TTTTCGGCCG CGCCGACGAT CAGGTGAAAC TGCGCGGCTA CCGGATCGAG CTCGGCGAAG TGGAGGCGAC GATCGCCCGG CATCCGGACG TCGCCAACGC GGCGGCGAGC GTCGATCTGG ACGCGAACGG CATCGCGCGG CTCGTCTGCG CGTACGACTG CCGCGCCGGC CGCGGCGTCG AGCCCGCGCC GCTGCGCGCG TGGCTCGCGA CGCAGTTGCC GGACTACATG GTTCCCGGCC GGTGCCGCCG GCTCGACGCG CTGCCGCGCA ACGCGAGCGG CAAGATCGAC CGCGCGGCGC TGGCGCGCGG CGTCGACGCG CCGCGTGACG GCGCCGCGCC GCGCGATCCG GTCGAGCTGC GGCTCGAGCG CGTGTGGGAG GCGGTGCTCG ACGTTCAGCC GGTCGGCGTC GATCGGACGT TCTTCGATCT CGGCGGCCAT TCACTGCTGG CCGTGCGGCT GATGGCGGAA GTCAAGCGCG AGTTCGGCTG CGATCTGCCG CTTGCGTCGC TGTTCGAGGC GCCCACCGTC GCCGCGCAGG CGGCGCTCAT CCGGCAGCGC GAGCGGGCGC ACCCGGTGGT GGTGCGCGTC AATCGCGGCA TCGACGGCGA GCGCCCGGTG TTCCTCGTGC ACCCGACGGG CGGCAACGTG CTTTGCTACC GCGATCTCGC GCGGCGGCTC GGGCCGGCCC GGCCGATCTA CGCGCTGCAG GACCCGGGCC TCGAGGGCGA CGCCGGCTAC GACAGCGTCG AGGAGCTGGC GGCGCGCCAT ATCGCGCACA TCCGCCCGCT CGCGGGCGAC GGCCCGTATT ACCTCGCGGG CTGGTCGTCG GGCGGCGTCG TCGCGTTCGA GATCGCGCGC CAGTTGCTCG CGCAGCGGTG CGAGGTCGGG CTGCTCGCGT TGATCGACAG CGTCGCGTCC GACGGCGCGC AGCCGGCCCC GCGCACCGAC GCCGAGCTGA TCGGCTCGAT CGGCCGGCTG CTCGCGTTCG CGGCGGGCGT CGACGCGCCG GATCTGGCGG CGCTCGAACC GTCGGCCGCG ATGGCGCGGC TGCGCGAGCT GGCCGTCGCG GCCGGCTCGC TGCCGCCCGA TGCGCCGCCC GAGCGGATGC GCCGGCTTTT CGACGTCTTC CGCCGCAACG CCGCCGCCGT GCGCCGCTAC CGGCCCGGGC CGTATCCGCG CCGCGTGCTG CTGCTGCGCG CAACGCAGCC GCTGCCCGAA CCGGTGCGCG ACGCCGCCGC GCGCCAGCGC GGCGATTCGC CGGAGCTGGG CTGGGAGCGC GTCGCGGTGG TGAGCCGCTG CGACATCCCC GCGCATCATC TGTCGATCGT CCACGAGCCG GCCGCGGCGC TCGTCGGCGC GCGGATTCGC GACGCGCTGC ACGCGGCGGA CCGCATCGAG GCGATCGGCG AGCGGGTTTT CTTCACGTTG CTCGGACACT GA
|
Protein sequence | MSATSETALH GFRLSPVQQR VWTLTDAGRL PAAQRVVARF AWTAAPPSRA ALEAALNALI GRHEILRARF ARLPEMRLPL QTAPDGARLR LDGACAPDVP GTPDTPDMPD MPDAGAPALH AAMDATGLVL SLPALCADGP TMLHLGAALA ARLAGDGARA DADGQAPLQF FDLAQWQHDL LAEEGAWRWP DGALQSAHGP WPALPALRSA RAAAYRPARH ACALPADTGA RVGALCARLG VTPADLLCAV WAVLLHRLSG GDTLPVAIVA DGRPFDELAT ALGPLAAPVP LSLSAALGAP DGTFAGLAKA AADARATAAD LRHHLVPPDE ARFAPFQIEV LDAGRADDAL LRVVDVATAG EPALLKLSVL VGPGGALRAD IDHDAGAVRD GGAIARLAEQ FATLLDAVLA NPDATFDEPD LLGAAERRLV TEDFQGPQVN HGRWTPVHLA VAAAAAAAPD RLAVQDGAAR LTAAALERAV RALAARLTAA GVAAQTLVAL HLPRGAALVT AMLAVMRAGG VFLPMPPELP AARRRYMLED SGARIVLTLP DAADDLPRDL ALVCVYPGEA DVDVDVDVEA EADVDIDART AASGPTARVT ADAPADAGAD ATTGACPDAW PAPDPTQAAY VLYTSGSTGR PKGVVVTHGS LANHMAWMTR AFPLDAHDAV LQKTSAAFDA SIWEFFLPLL AGARLVMAPP GLERDVPALV ATLARERITV LQLVPSLLRV LVDAPGFGAC DALRCVFCGG EALTADLARR FAAAHRAALV NLYGPTETTI QVCAERVDAA DDPVPVGRPI DNVRLYVVDS RNRLSPVGVR GEILIGGAAP ARGYLDRPAL TAARFVADPI DPRAPRVYRS GDVGAWRADG RLDFFGRADD QVKLRGYRIE LGEVEATIAR HPDVANAAAS VDLDANGIAR LVCAYDCRAG RGVEPAPLRA WLATQLPDYM VPGRCRRLDA LPRNASGKID RAALARGVDA PRDGAAPRDP VELRLERVWE AVLDVQPVGV DRTFFDLGGH SLLAVRLMAE VKREFGCDLP LASLFEAPTV AAQAALIRQR ERAHPVVVRV NRGIDGERPV FLVHPTGGNV LCYRDLARRL GPARPIYALQ DPGLEGDAGY DSVEELAARH IAHIRPLAGD GPYYLAGWSS GGVVAFEIAR QLLAQRCEVG LLALIDSVAS DGAQPAPRTD AELIGSIGRL LAFAAGVDAP DLAALEPSAA MARLRELAVA AGSLPPDAPP ERMRRLFDVF RRNAAAVRRY RPGPYPRRVL LLRATQPLPE PVRDAAARQR GDSPELGWER VAVVSRCDIP AHHLSIVHEP AAALVGARIR DALHAADRIE AIGERVFFTL LGH
|
| |