Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1642 |
Symbol | |
ID | 4886297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1571153 |
End bp | 1572064 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640131581 |
Product | thioesterase domain-containing protein |
Protein accession | YP_001062638 |
Protein GI | 126444062 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3208] Predicted thioesterase involved in non-ribosomal peptide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGCA TCCCCGTTAG ACTCGATCGG TTCGCGCCGG CGGCCGACCC GGCCGACCCG GCCGCTCCGG CCGATTTGGC CGATTTGGCC GATTTGGCCG ATTTGGCCGC CGCCACGGAC GGCGCCCCGG CGGCGGCGTT TCCCGCCACG GCCGCGAAGG CCGCGATCCG CACGCTGGTT GCGCCGTCGC CCGGCGGCGC GCACGTGCTG TGCATCCCGT GGGCGGGCGC GAGCATCGAG CGCTTCTTCG CGTGGAAGCG GCACCTGCCG GCGGGCGTCG GCTTGTCGGG CGTCCAGTTG CCGGGGCGCG GCGCGCGCGC GCACGAGCCG TCGCCGACGG ATCTCGTCGC GCTGCTCGAC GAGATCGCGG CCGAGTACCT GGCGCTGCGC GACCCGCCGC GCATCCTGTT CGGGCACAGC TTCGGCGCGC TGATCGCGTT CGAGATCGCG ACGCGCGTGA GCCGGGCGGC CGGCCAGGCG GGCGGCGATC CGTTCGTGCG GCTCGTCGTG TCGGGGCTGT CCGCGCCGAG CGTGATCGCC GAGGAAGAGC GCATCGCCCA TCTCGACGAC GACGCGTTCA GCGCGCAAGT GCACGCGCTC AACGGCATGC CGCCCGAAAT CGCGCGCAGC CCCGAATCGC TGCGCTATTT CATGACCGTC CTGCGCTGCG ATTTCCGGCT GTACGAATCC TATGCGTTCG ACGCGAATCG CCCGCCGCTG CGCTGCCCGA TCGCCGTGTG CTCCGGCCGC GACGATCCGA GCGTGTCGGA GCACGGGCTG AACGCGTGGG CGGCGCTCAC GACGGGCGAG TGCCGGCGTC ACGATTTCGA CGGCGATCAT TTCTTCATCG CGGACCACGC GGCCGCGATG CTGGGCCTCG CGCTGGACGC CGAAAGCACG CTCGCCGGCT GA
|
Protein sequence | MTSIPVRLDR FAPAADPADP AAPADLADLA DLADLAAATD GAPAAAFPAT AAKAAIRTLV APSPGGAHVL CIPWAGASIE RFFAWKRHLP AGVGLSGVQL PGRGARAHEP SPTDLVALLD EIAAEYLALR DPPRILFGHS FGALIAFEIA TRVSRAAGQA GGDPFVRLVV SGLSAPSVIA EEERIAHLDD DAFSAQVHAL NGMPPEIARS PESLRYFMTV LRCDFRLYES YAFDANRPPL RCPIAVCSGR DDPSVSEHGL NAWAALTTGE CRRHDFDGDH FFIADHAAAM LGLALDAEST LAG
|
| |