Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0178 |
Symbol | |
ID | 4904510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 167806 |
End bp | 170964 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640143285 |
Product | putative non-ribosomal peptide synthase |
Protein accession | YP_001074221 |
Protein GI | 126457274 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGTTCG TCGGCATGGA TAAAACAATC GAATCATCCG GCATTTTTTG TCGATGGATT CAGTGGATCG TTTCGGAATC ACGTTCGCCT TCGTGCGTGA TTCCGGAGAA ATCGGGCATG CCGGAGCGCA GCGGCGCGCG TGATCGGATT CGCTCGAACA ATCCCGCTTT TTACCGGGCA GGCCGCCGGT TCGTTTCATG CAGTCGAGTG TCAATCGGCC GTCGACGCTC GCGAGAGCGC AACGGCGGCC AGTCAGGTAG GGATTCACCC GCGATCGTCA ACCGACGCGT TCGGGCGCGC GTGCCCGGAC GTTCGTTCCC ACTTGCACCA TCAGCATGGA GTACCTGCAC CATGACGGCA TCCACCCTCG ATTTACCGCG CGATTGCGAA CACGCATTGC GCGCCGCTTC GCCGCCAAAC ATCGTCGACC TGCTGTTGCG GGCCGCACGG CTGCATCCGC ATACGGGCGT GCGCTTCATC GCCGCCGAAT CCGAACACAA GGGCGCCTTC GTCACATATC CCGAGCTGCT CGACGAGGCG CGCCGCATCC TGGGCGGCCT GCGCGCGCGC GGCTATCGGT CCGGCATGAA GGTCGCGCTG CTGCTCGAGC ACGCGAGCGA TTTCATTCCG GCGTTCTGGG CCTGCGCGCT CGGCGGCTTC GTGCCGTGCC CGCTCGTGCC GATCCGCAAC GATCCCGAGC GCTGGGCGAA GCACCTCGCG CACGTCGATA CGCTGCTCGA CCATCCGCTG CTCGTCACCA CCGAAGCGCT GAACAACGAT CTGCCGGGCG GCGCGTCGGC CGTCAACCTG AACGCGCTGC GCGCGAGCCT GCCCGATGCG TCGACGCACG TCGCGCAACC GTCGGACCCG GCGGTTTTCG TGCTCACGTC GGGCTCCACC GGCAATTCGA AGGCGGTCGT GCTCACGCAC GGCAACCTGC TCGCGTCGAT GGCGGGCAAG AACGACCGGC AGCAGCTCGC GGGCGCGGAC GTCACGCTCA ACTGGATCTC GTTCGACCAC GTCGCCGCGC TGCTCGAAGC GCACCTGCTG CCGCTGTACG TCGGCGCCGT GCAGCTTCAC GTCGAAGCCG CGGCGGTTCT CACCGATCCG CTGCGCTTCT TGCGGCTCGT CAGCCGCTAT CGCGTGACGA TGACGTTCTC GCCGAACTTC CTGTTCGGGC AACTGAACGC CGCGCTCGAA GCGATGGGCG ACGAGGCGCT CGCCGCGTGG CGCGGCGCGG TGGATCTGTC GTCGCTGCGG CACGTCGTGT CGGGCGGCGA GGCGATCGTC GTCGCGACCG GGCAGCGTTT TCTCGATCTG CTCGCGCCGT GCGGCCTCGC GCGCGATGCG CTGTGGCCCG CGTTCGGGAT GACGGAGACG TGCGCCGGCT CCGTGTATTC GCGCGAGTTC CCGGAAGGCG ACGCGGGCCG CGAGTTCGCA TCGCTCGGCC TGCCGGTGGC CGGGCTGCAG ATGCGCATCG CGGACGACCG CAACAACGTG CTGCCGGAAG GCGAGGCGGG CGAGTTCCAG GTGCGCGGCC CGATGATCTT CCAGCGCTAT CACAACAATG CCGAGGCGAC GCGCGCGGCG TTCACGAGCG ACGGCTGGTT CCGCACGGGC GACCTCGGGC GCATCGAGCG CGGCCGGCTG TGGCTCGTCG GCCGCAGCAA GGACAGCATC ATCGTCAACG GCGTCAATTA CTTCAGCCAC GAGCTGGAGA CGACTCTCGA GGCGCTCGAC GGCGTCAAGC CCTCGTTCGT CGCGGCGTTT CCGACGCGCG GGGCCGGCGA CGAATCCGAG CAACTCGTCG TCACGTTCAC GCCGTCGTTT CCGCTCGACG ACGAGGACGC GCTGTATCGC CTCGTCATCG CGATCCGCAA CAGCACGATC CTGCTGTGGG GCTTCCGGCC CGCGCTGATC CTGCCGCTGC CGGAGGACGA ATTTCCGAAG ACGAGCCTCG GCAAGACCCA GCGCGCGATC ATGCGCAAGC GCCTCGAAGC GGGCAGCTAC GACGGCTACA AGGCGCGCGT CGCCGATCTC GCGAACCGGC AGATGGGCGG CTATGTCGCG CCCGACGGGC AGACCGAGGC CGCGGTGGCC GCGATCTTCG CGCGGATGTT CCAGGTCGCG CCCGAGGCGA TCAGCGCGAC CGCGAGCTTC TTCGATCTCG GCGGCACGTC GCTCGACATC CTGAAGCTCA AGCGCCACGT CGAACAGCGG CTCGGCGTGA TCGACCTGCC GATCGTGACG ATCCTCCAGA ACCCGAGCGT GCGCGCGCTG GCCGCGCGTC TCGCCCCGGG CGAGCGCGTG ACGGCGGGCG AATACGATCC GGTCGTGCCG TTGCAGCTCA CCGGCGGCAA GACGCCGCTG TTCTGCGTGC ACCCCGGCGT CGGCGAGGTG CTCGTGTTCG TCAACCTCGC GAAGTACTTC GTCAACGAGC GCCCGTTCTA TGCATTGCGC GCGCGCGGCT TCAACGAAGG GGAGACGTAT TTCTCCAGCT TCGACGAAAT GGTGAACACG TATGTCGACG CGATCCGCAA GCGGCAGCCG CACGGGCCGT ACGCGGTGGC CGGCTATTCG TACGGCGGCG CGGTCGCGTT CGAGATCGCG AAGGTGCTCG AAGCGCAGGG CGAGCGGGTG GATTTCGTCG GCAGCTTCAA TCTGCCGCCG CACATCAAGT ACCGGATGGA CGAGCTCGAC GAGGTGGAGG GCGCGGTCAA CCTCGCGTTC TTCCTGTCGC TGATCGACAA GCAGCAGTCG CTCACGCTGC CGCCGCAACT GCGCGCGGCG ATGCCGGAGC AAGACCCGCT CGCGTACCTG ATCGACCACG CGCCGCCCGG GCGGCTCGTC GAGCTCGACC TCGATCTGCC GAAATTCCGC GCGTGGGCGG GGCTCGCGCA ATCGCTGCTC ACGCTCGGGC GTTCGTACGC GCCGTCGGGC AGCGTGCGGG CGATGTCGAT CTTCTATGCG ATTCCGCTGC GCGGCACGAA GGACGACTGG CTGAACAAGG AACTGCGCCG CTGGGACGAG TTCACGCGCG CGCCGAACCG CTATATCGAC GTGGCGGGCG AACACTACAC GCTGATGGGG CCCGCGCACG TCGCGACGTT CCAGGCGGTG CTGCGGGCCG AGCTCGATCG CGCGCTCGGC GGCAAATGA
|
Protein sequence | MQFVGMDKTI ESSGIFCRWI QWIVSESRSP SCVIPEKSGM PERSGARDRI RSNNPAFYRA GRRFVSCSRV SIGRRRSRER NGGQSGRDSP AIVNRRVRAR VPGRSFPLAP SAWSTCTMTA STLDLPRDCE HALRAASPPN IVDLLLRAAR LHPHTGVRFI AAESEHKGAF VTYPELLDEA RRILGGLRAR GYRSGMKVAL LLEHASDFIP AFWACALGGF VPCPLVPIRN DPERWAKHLA HVDTLLDHPL LVTTEALNND LPGGASAVNL NALRASLPDA STHVAQPSDP AVFVLTSGST GNSKAVVLTH GNLLASMAGK NDRQQLAGAD VTLNWISFDH VAALLEAHLL PLYVGAVQLH VEAAAVLTDP LRFLRLVSRY RVTMTFSPNF LFGQLNAALE AMGDEALAAW RGAVDLSSLR HVVSGGEAIV VATGQRFLDL LAPCGLARDA LWPAFGMTET CAGSVYSREF PEGDAGREFA SLGLPVAGLQ MRIADDRNNV LPEGEAGEFQ VRGPMIFQRY HNNAEATRAA FTSDGWFRTG DLGRIERGRL WLVGRSKDSI IVNGVNYFSH ELETTLEALD GVKPSFVAAF PTRGAGDESE QLVVTFTPSF PLDDEDALYR LVIAIRNSTI LLWGFRPALI LPLPEDEFPK TSLGKTQRAI MRKRLEAGSY DGYKARVADL ANRQMGGYVA PDGQTEAAVA AIFARMFQVA PEAISATASF FDLGGTSLDI LKLKRHVEQR LGVIDLPIVT ILQNPSVRAL AARLAPGERV TAGEYDPVVP LQLTGGKTPL FCVHPGVGEV LVFVNLAKYF VNERPFYALR ARGFNEGETY FSSFDEMVNT YVDAIRKRQP HGPYAVAGYS YGGAVAFEIA KVLEAQGERV DFVGSFNLPP HIKYRMDELD EVEGAVNLAF FLSLIDKQQS LTLPPQLRAA MPEQDPLAYL IDHAPPGRLV ELDLDLPKFR AWAGLAQSLL TLGRSYAPSG SVRAMSIFYA IPLRGTKDDW LNKELRRWDE FTRAPNRYID VAGEHYTLMG PAHVATFQAV LRAELDRALG GK
|
| |