Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1791 |
Symbol | |
ID | 4887068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1736998 |
End bp | 1740246 |
Gene Length | 3249 bp |
Protein Length | 1082 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640131729 |
Product | non-ribosomal peptide synthase |
Protein accession | YP_001062786 |
Protein GI | 126445099 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCATA TCGACACAGG ACTTCGCCTC GCCATGAACA TGCGCGATAC ATCGCTTCGC CCGCTGCTGC CCGCCCAGCG GGAGATCTGG CTCGCCGAGC AGTTGCGCCC GGGCACGGGT GTCTACAACA CGGCGGGCTA CGCCGACATC GTCGGCGCCG TCGATCACGG CGCGCTGCGG GACGCGTTCG GCCACGCGAT GCGCGAGGCC GACGCCGTGC GCGCGACGTT CTCCGCGCAC GGTGACGACG CGTGGCAGAC GCCGCGCGAT GCGCCGCACG CCGACGTGCC GCTCGTCGAC GTGAGCGGCC AGGCCGATCC CGCCGGCGTC GCGCTCGGGT GGATGATGCG CGACATGCGC ACGCCGCCCG ATTTCGCGGC GGGCCCGCTG GTACGCAGCG CGCTCTTGCG GGTGGGGGCG GCGCGCTTTT TCTGGTACGT CTGCGCGCAT CACATCGTCA CCGACGCGTA CGGCACGGCG CTCGTCGTGC AACGCACGGC GCAGCTCTAC GGCGCGCGGG TGCGCGGCGC GGCGCCGCCG CCCGCGTGGT TCGGCACGCT GGACGCGCTC ATCGACGAGG ACCGCGCGTA TCGCGAATCG GCCGCGTTCG CCGACGATCG CGATTACTGG CGCGCCCGCC TGGCGGCGGC GCCGGAGCCG CGCAGCCTGT CGCCCGTCGC GCTGCCGGGG CAGTTCGCGC CGGCCGGCGA CTTTCACCGG CAATGGGGCG AGCTCGACGC CGCGACGAGC GACGGGCTGC GCGCGATGGC GCGCGCCGGC GCACAAGGCA TGCCGCCGCT CGTCGCGGCG ATGATGGCCG CCTATGTGCA TCGGTTCACG CACGAGCGCG ACGTCGTGCT CGGCTTGCCC GTCATGGCGC GGCTAAGCCG CGCCACGCGC AGGACGCCGG GCATGGTCGC GAACGTGCTG CCGCTGCGCT TCGCGTTCGA TCGCGGGACG ACGTTCGCGG CGCTCCACGC GCAATGCGCG GCCGAGATGC GCGCGTCGCT GCGGCACCAG CGCTATCGCG GCGAGGCGAT GCTGCGCGAC GCGCAGCAGA GCCGGCCGGC CGAGCGCCTT CACGCGCAGA ACGTGAACGT GATGGCGTTC GAGCGGGCGC TCGCGTTCGG CGATTGCCCG GCGCGCTCGC ACAGCCTGTC GAACGGCCCC GTCGACGATT TTTCGATCAC CGTCTACGAC GACGGCGCGA GGCAACCGAT CCGCATCGCG TTCGACGCGA ACGCGGCGCG CTATGCAAGC GGCGATCTGG CCGCGCATCG CGAGCGCTTT CTGCGCCTTG GCGTCGCGCT CGCGAACGCG GCGCAGCGGC CGATCGCCGA CGCGCAGTGG CTCACGCCCG CCGAGCGCCG GACGCTGGTC GGCGAACGCG GCACGCCCGC GCCGGATGCG GGCGAGCCCT TCGACACCAT CGCCGCGCGC TTCGCGGCGC GGGCCGCCGA GCGCCCGGAC GCGATCGCGC TCGTCGATCG CGGGCAGCGG ATCACATACG GCGAGCTGAA CGCGCGCGCG AATCGGCTCG CGCACGTGCT GATCGAGGCG GGCGTCGGGC CGGAGGCGCT GGTCGGCCTG CACATGCCGC GCTCGGCCGA GCTCGTCGTC GGCATGCTCG CGATATTGAA GGCGGGCGGC GCGTACGTGC CGCTCGACCC CGCCTATCCG GCCTCGCGCA TCGAATTCAT GGTGGCCGAC GCGCGGCCGA TGCTGTCGAT CACGACGGGC GAGCACGCGG CGCAACTGCC GGCCCGCACG CCGACGATCG TGCTCGACGC CGCCGACGCG CAGGCGGCGT TGCGGCGCGC GCCCGCGCAC GACCCGGTGC GGCCGGCGCC GCTCGATCGC GAGCACGCGG CGTACGTGAT CTATACGTCC GGCTCCACCG GCAAGCCGAA GGGCGCGGTG GTGTCGCATC GCAACGTGAT CCGCCTGCTG GACGGCACGC GCGGCTGGTT CGATTTCGGC GCGGCGCAGA CGTGGACGCT GTTCCATTCG TTCGCATTCG ATTTCTCGGT GTGGGAGTGC TGGGGCGCGC TGCTCACGGG CGGGCGGCTC GTCGTCGTGC CTTACGACGT GAGCCGCTCG CCCGCCGAGT TCCTGAAGCT GCTGGTCGAC GAGCGCGTGA CGGTGCTCAA TCAGACGCCG TCGGCGTTTC GCCAGCTGAT GCAGGCCGAC GAGGCGCACG CGGACTTGAG CGCGCGGCTC GCGCTGCGCT ACGTGGTGTT CGGCGGCGAG GCGCTCGACG CGCGCAGCCT CGCGCGCTGG TATGAGCGGC ACGCGGACAC CGCGCCCCGG CTCGTCAACA TGTACGGCAT CACCGAGACG ACGGTCCACG TCAGCTATCT GGCGCTCAGC CGCGCGATCG CCGGGATGCC GGCGAACAGC CTGATCGGCC GGCCGCTTCC CGACTTGCGC GTGTATGTGC TCGACGCCGC GCTGCGCCCC GTGCCGGCGG GCGTGCCGGG CGAGATGTAC GTCGCCGGCG CGGGGCTCGC GCGCGGCTAT CTGCGCCGGC CGTCGCTCAC CGCGCAGCGC TTCATCGCCG ATCCGTTCGG GCCGCCGGGC ACGCGCATGT ACCGCACGGG CGACGTCGCG CGATGGCGGG CCGACGGCGG GCTCGATTTC ATCGGGCGGG CCGACGAGCA GGTGAAAGTG CGCGGCTTTC GCGTGGAGCT CGGCGAAATC GCCGCGCGGC TCGCGTGCGA TCCGTCGGTC GCGCAGGCGC AGGCCGTCGT GCGGCAGGAC GGGCCTGCGC ACGAGCGGCT CGTCGCGTAC GTCGTGCCGC GCGCCGGCGC GACGATCGAC GTTTGCGCGC TGCGCGCGTC GCTCGCGGCC GAGATGCCGG AATACATGGT GCCCGCGGCC ATCGTCGCGC TCGATGCGAT GCCGCTCACG CCGAACGGCA AACTCGACCG CGCGGCGCTG CCGGCGCCGA TCGTGACGGG CACGAGCCGG CGCGCGCCGG AAAACCGCAT CGAGCAGCAG GTGTGCGCGA TGTTCGCGGA ACTGCTCGAT GCGCAGACGC TCGGCGCCGA GGACAATTTT TTCGAGCTCG GCGGCGATTC GCTGCTGGCG ATGCGCGCGA TCAACAAGCT GCAGCAGACG TTCGATGTCG AGCTGACGAT CCGCGATCTG TTTTCCGCGC CGACTGTCGC GGCGCTCAGC ATGCGGCTCG ATGCGCAATT GGCGGCGCGC CGCGCTCACG CGGACGGCGC TGAGCTGCCC GCCGGGTAG
|
Protein sequence | MQHIDTGLRL AMNMRDTSLR PLLPAQREIW LAEQLRPGTG VYNTAGYADI VGAVDHGALR DAFGHAMREA DAVRATFSAH GDDAWQTPRD APHADVPLVD VSGQADPAGV ALGWMMRDMR TPPDFAAGPL VRSALLRVGA ARFFWYVCAH HIVTDAYGTA LVVQRTAQLY GARVRGAAPP PAWFGTLDAL IDEDRAYRES AAFADDRDYW RARLAAAPEP RSLSPVALPG QFAPAGDFHR QWGELDAATS DGLRAMARAG AQGMPPLVAA MMAAYVHRFT HERDVVLGLP VMARLSRATR RTPGMVANVL PLRFAFDRGT TFAALHAQCA AEMRASLRHQ RYRGEAMLRD AQQSRPAERL HAQNVNVMAF ERALAFGDCP ARSHSLSNGP VDDFSITVYD DGARQPIRIA FDANAARYAS GDLAAHRERF LRLGVALANA AQRPIADAQW LTPAERRTLV GERGTPAPDA GEPFDTIAAR FAARAAERPD AIALVDRGQR ITYGELNARA NRLAHVLIEA GVGPEALVGL HMPRSAELVV GMLAILKAGG AYVPLDPAYP ASRIEFMVAD ARPMLSITTG EHAAQLPART PTIVLDAADA QAALRRAPAH DPVRPAPLDR EHAAYVIYTS GSTGKPKGAV VSHRNVIRLL DGTRGWFDFG AAQTWTLFHS FAFDFSVWEC WGALLTGGRL VVVPYDVSRS PAEFLKLLVD ERVTVLNQTP SAFRQLMQAD EAHADLSARL ALRYVVFGGE ALDARSLARW YERHADTAPR LVNMYGITET TVHVSYLALS RAIAGMPANS LIGRPLPDLR VYVLDAALRP VPAGVPGEMY VAGAGLARGY LRRPSLTAQR FIADPFGPPG TRMYRTGDVA RWRADGGLDF IGRADEQVKV RGFRVELGEI AARLACDPSV AQAQAVVRQD GPAHERLVAY VVPRAGATID VCALRASLAA EMPEYMVPAA IVALDAMPLT PNGKLDRAAL PAPIVTGTSR RAPENRIEQQ VCAMFAELLD AQTLGAEDNF FELGGDSLLA MRAINKLQQT FDVELTIRDL FSAPTVAALS MRLDAQLAAR RAHADGAELP AG
|
| |