Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0204 |
Symbol | |
ID | 3845561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 239300 |
End bp | 242251 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637837510 |
Product | peptide synthetase, putative |
Protein accession | YP_438406 |
Protein GI | 83716490 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCATTCG GCCGCCGATT CGCGTTCGCA GGGATCGGCG GCCAGTCAGG TAGGGATTCA CCCGCAATCG TCAATCGATG CGTCCGCGCG GACACGCCCG GACGTTTGTT CCCACTTGCA CCATCAGCAT GGAGTGCCTG CACCATGACG CCATCCACTC TCGATTCGCC GCGCGTTTCC GAACACGAGT CGCGCGCCTC TTCGCCGCAA AACATCGTCG ACCTGCTGTT GCGGGCCGCA CGGCTGCATC CGCATACGGG CGTGCGCTTC ATCGCCGCGC AAGCCGAGGA AAAGGGCGCC TTCGTCACGT ATCCGGAGCT GCTCGACGAG GCGCGCCGCA TCCTGGGCGG CATGCGGGCC CGCGGTTACC GGTCCGGCAT GAAGGTCGCG CTGCTGCTCG AGCACGCGAG CGATTTCATT CCGGCGTTCT GGGCGTGCGC GCTCGGCGGA TTCGTGCCGT GTCCGCTCGT GCCGATTCGC AACGATTCCG AGCGTTGGGC GAAGCATCTC GCACACGTGG ACGCGCTGCT CGACCGCCCG CTGCTCATCA CCACCGAAGC GCTGAAGAGC GATCTGCCGG GCGGCGCGCT CGCCGTCAAC CTGAACGCGC TGCGCGCGAG CTTGCCCGAC GAGTCGGTGC ACGCCGCGCA GCCGTCCGAG CCGGCCGTCT TCGTGCTCAC GTCGGGCTCC ACCGGCAATT CGAAGGCGGT CGTGCTCACG CATGGCAACC TGCTCGCGTC GATGGCCGGC AAGAACGAGC GGCAGCAGCT CGCCGGCGCG GACGTCACGC TCAACTGGAT CTCGTTCGAT CACGTCGCCG CGCTGCTCGA AGCGCACCTG CTGCCGCTCT ACGTCGGCGC CGTGCAACTG CATGTCGAAT CAGCGGCGAT CCTGACCGAT CCGCTGCGCC TGCTGCGGCT CGTGAGCCGC TACCGCGTCA CGATGACCTT CTCGCCGAAC TTCCTGTTCG GCCAGCTGAA CGCCGCGCTC GAAGCGATGG GCGACGAGGC GCTCGCGGCA TGGCGCCGCT CGGTGGATCT CTCGTCGCTG CGGCACGTCG TGTCGGGCGG CGAGGCGATC GTCGTCGCGA CCGGGCAGCG CTTTCTCGAT CTGCTCGCGC CGTGCGGCCT CGCGCCCGAC GCGCTGTGGC CCGCGTTCGG GATGACCGAG ACGTGCGCCG GCTCCGTGTA TTCGCGCGAA TTTCCGGCGG GCGACGCGGG CCGCGAGTTC GCGTCGCTCG GCCTGCCGGT GGCCGGGCTG CAGATGCGTA TCGCGGACGA CCGCAACGAC GTGCTGCCGG ACGGCGAGGC GGGCGAGTTT CAGGTGCGCG GCCCGATGAT CTTCCAGCAC TATCACAACA ACGCCGAGGC GACGCGCGCG GCGTTCACGA GCGACGGCTG GTTCCGCACG GGCGATCTCG GGCGCATCGA GCGCGGCCGG CTGTGGCTCG TGGGGCGCAG CAAGGACAGC ATCATCGTCA ACGGCGTCAA CTATTTCAGC CATGAACTGG AGACGACGCT CGAGGCGCTC GACGGCATCA AGCGCTCGTT CGTCGCGGCG TTTCCGACGC GCGGCGCCGG CGACGAATCC GAGCAGCTCG TCGTGACGTT CACGCCGTCG TTTCCGCTCG ACGACGAGGA CGCGCTGTAT CGCGTCATCA TCGCGATCCG CAACAGCACG ATCCTGCTGT GGGGTTTCCG GCCCGCGCTG ATCCTGCCGC TGCCGGAGGA CGAGTTTCCG AAGACGAGCC TCGGCAAGAC GCAGCGAGCG ATCATGCGCA AGCGTCTGGA GGCGGGCGGC TATGACGGCT GCAGGGCGTT CGTCGCCGAT CTCGCGAACC GGCAGATGGG CGGCTACGTC GCGCCCGACG GCGAGACCGA AGCCGCGGTC GCCGCGATTT TCGCGGAGAT GTTCCGGGTC GCGCCCGACG CGATCAGCGC GACCGCGAGC TTCTTCGATC TCGGCGGCAC GTCGCTCGAC ATCCTGAAGC TCAAGCGCCA CGTCGAGCAG CGGCTCGGCG TGGTCGACCT GCCGATCGTG ACGATTCTCC AGAATCCGAC CGTGCGCGCG CTGGCCGCGC GCCTCGCGTC GGGCGAGCGC GTGACGGCGG GCGAGTACGA CCCGGCCGTG CCGCTTCAGC TCACGGGCGG CAAGACGCCG CTCTTCTGCG TGCATCCCGG CGTCGGCGAG GTGCTCGTGT TCGTCAATCT CGCGAAGTAT TTCGTCAACG AGCGTCCGTT CTACGCGCTG CGCGCGCGCG GCTTCAACGA AGGGGAGACG TATTTCTCCA GCTTCGACGA GATGGTGAGC ACGTATGTCG ACGCGATCCG CAAGCGGCAG CCGCACGGGC CGTACGCGGT GGCCGGCTAT TCGTACGGCG GCGCGGTCGC GTTCGAGATC GCGAAGGTGC TCGAATCGCA GGGCGAGCGC GTGGATTTCG TCGGCAGCTT CAATCTGCCG CCGCACATCA AGTACCGGAT GGACGAGCTC GACGAGGTGG AAGGCGCGGT CAACCTCGCG TTCTTCCTGT CGCTGATCGA CAAGCAGCAG TCGCTGACGC TGCCGCCGCA ACTGCGCGCC GCGATGCCGG AGCAAGACCC GCTCGCATAC CTGATCGACC ACGCGCCGCC CGCGCGGCTC GCCGAGCTCG ACCTCGATCT GCCGAAATTC CGCGCGTGGG CGGGACTCGC GCAATCGCTG CTGACGCTTG GTCGTTCGTA CGCGCCGTCG GGCAGCGTGC GGGCGATGTC GATCTTCTAC GCGATCCCGC TGCGCGGCAC GAAGGACGAC TGGCTGAACA ACGAACTACG CCGGTGGGAC GAGTTCACGC GCGAGCCGAA TCGCTACATC GACGTCGCGG GCGAGCACTA CACGCTGATG GGGCCCGCGC ACGTCGCGAC GTTCCAGGCG GTGCTGCGGG CCGAGCTCGA TCGCGCGCTC GGCGGCAAAT GA
|
Protein sequence | MSFGRRFAFA GIGGQSGRDS PAIVNRCVRA DTPGRLFPLA PSAWSACTMT PSTLDSPRVS EHESRASSPQ NIVDLLLRAA RLHPHTGVRF IAAQAEEKGA FVTYPELLDE ARRILGGMRA RGYRSGMKVA LLLEHASDFI PAFWACALGG FVPCPLVPIR NDSERWAKHL AHVDALLDRP LLITTEALKS DLPGGALAVN LNALRASLPD ESVHAAQPSE PAVFVLTSGS TGNSKAVVLT HGNLLASMAG KNERQQLAGA DVTLNWISFD HVAALLEAHL LPLYVGAVQL HVESAAILTD PLRLLRLVSR YRVTMTFSPN FLFGQLNAAL EAMGDEALAA WRRSVDLSSL RHVVSGGEAI VVATGQRFLD LLAPCGLAPD ALWPAFGMTE TCAGSVYSRE FPAGDAGREF ASLGLPVAGL QMRIADDRND VLPDGEAGEF QVRGPMIFQH YHNNAEATRA AFTSDGWFRT GDLGRIERGR LWLVGRSKDS IIVNGVNYFS HELETTLEAL DGIKRSFVAA FPTRGAGDES EQLVVTFTPS FPLDDEDALY RVIIAIRNST ILLWGFRPAL ILPLPEDEFP KTSLGKTQRA IMRKRLEAGG YDGCRAFVAD LANRQMGGYV APDGETEAAV AAIFAEMFRV APDAISATAS FFDLGGTSLD ILKLKRHVEQ RLGVVDLPIV TILQNPTVRA LAARLASGER VTAGEYDPAV PLQLTGGKTP LFCVHPGVGE VLVFVNLAKY FVNERPFYAL RARGFNEGET YFSSFDEMVS TYVDAIRKRQ PHGPYAVAGY SYGGAVAFEI AKVLESQGER VDFVGSFNLP PHIKYRMDEL DEVEGAVNLA FFLSLIDKQQ SLTLPPQLRA AMPEQDPLAY LIDHAPPARL AELDLDLPKF RAWAGLAQSL LTLGRSYAPS GSVRAMSIFY AIPLRGTKDD WLNNELRRWD EFTREPNRYI DVAGEHYTLM GPAHVATFQA VLRAELDRAL GGK
|
| |