Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2086 |
Symbol | |
ID | 4885416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2071678 |
End bp | 2074686 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128014 |
Product | linear gramicidin synthetase subunit D |
Protein accession | YP_001059121 |
Protein GI | 126439535 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain [TIGR01746] thioester reductase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT CGAACACCGT GCTCGAACGG GTGCGCGCGT GGTGCGCCGC CACGCCGCGG GCCGTTGCCG TCGCGACCGC GGATGCGACG ATGACGTATG GCGAGCTCGA CCGCGCGAGC GACGCCGTCG CCGCGTTTCT CGAGGCCGAG CGCATCGGTG CGGGCAGCAT CGTGCCGATC GAGGCGATGC GCACGGACGA TTTCGTCGCG GGCATGCTCG GCATCGTGAA GGCGGGCGCC GCGTACTGCC CGATCGATCA CGCGTATCCC GAAGCGCGCA AGACGCACAT CGTCGAGCGA ACCGGCTCGC CGCTGCTGCT CACGGCCGTG TCGCCGCGCA CGCCGCTCGC GTGCGCGCGG GCGCCGCGCA CCGCGAGCAT CGCGGCGCTG CGGCGCGCGG GGATGCCGCG TTCGGCGTCG CCGCGCACGC CGCGGCCGAA CGACGCGATC TACGTGATCT TCACCTCGGG TACGACGGGT GTGCCCAAGG GCGTCGTCGT CGAGCATCGC TCGGTCGACG GGCTGATCGC GTGGCACAAC GCGCAGTTCG GCGTCGACCG CACGAGTCGC TCGACGCAGA TCGCCGCGCT CGGCTTCGAC GCCGCGCATT GGGAGATCTG GTCGCCGCTT TGCGCGGGCG CGCGGCTGCG CTTCGTCGAC GACGACGCGC GGCGCGACGC GAACGCGCTC GTCGCGTTGC TCGAGCGCGA GCGGATCACG CATGCGTTCG TGCCGACGGT GATGGCGCGC GACGTCGTGG CCGCGAGCGA ACCGGGCCCG TCGGCGCTGC GCTATCTGTT CACCGGCGGC GAGAAGCTGA ATCCGGTCGA CACGGACCGC ATCCGTTATC GGCTGATCGA CTACTACGGG CCGACCGAGG CGACGATGTG GGCGAGCTTT CATCCGGTGC AAAGCGCGAG CCTCGGCTTG CCGCCGTCGA TCGGCACGCC GGTCGGCGGC GCGCGAATCG CGATATTCGA CGAGCGACTG CGCGAGGCGC AGAGCGGTGC CGTCGGCGAG ATTGTCATCT CGGGCCCGTG TCTCGCGCGC GGCTATCTCG ACGATCCGAG GCAGACCGCG GAGAAGTTCC TTGCGCATCC GTCGCGCCCC GGCGAGCGCG TCTATCGGAC GGGCGACCTC GGGCGCCGGC TGCCCGACGG CGCGATCCAG TTCGTCGGTC GCCTCGACGA TCAGGTGAAG ATCCGCGGCT ATCTCGTCGA GCCGGGTGAG GTCGAGATCG CGATCGCGCG GCAGTCGGGC GTGCGCCGGG TCGCCGTCGT CGCGACGTCG CCCGCCGACG GCGCGCCGAG AGAACTCGTC GCGTTCGTCG TGCCGGCCGA TCCGGCCGCG CCGCGCCGGC CGCTCGTCGG CCGCCTGCGC GCGGGCGTGG CCGCATCGCT GCCGCCTTTC ATGGTGCCCG GGCATTTCGC GATCGTCGAC GCGCTGCCGC TGTCCGCGAA CGGCAAGACC GACAAGGCGG CGCTCGTCGC GATGCACGGG CGGCGCGCCG CGCGCGCGGA TTTCGCGGAG GTGGCCGACG CCGTCGAGCG CACGGTGTGC GAATCGTTCG CCGACGCGCT CGGCCATGCG GATTTCGGCG TCGATGACAG CTTCTTCGAC GTGGGCGGCC ATTCGCTCGT CGCGGCGGCC GCCGTCGCGT CGCTGTCCGC GCGGGTCGGC GTCGCGCTGC GTCTGTCCGA TCTGTACAGG CGGCCGTCCG CCGCGGCGCT CGCGGTCGAC ATCAGGCGAA GGCCGTCGGC CGGCGATCCG GGCGCCCTCG ACCTGACGCC CGCCGACGTG CTGCGCCGCG ATGCGATCCT GCCGGAGGAC ATCGCGTTCG ACGGCGCGTT CGATCCGCAG CGGCTCGCGC GCCCGGCGCA CGTGCTGCTG ACGGGCGCGA CGGGCTTCGT CGGCGTGCAT CTGCTCGCGC AGTTGCTGGC CACCACGGAG GCGGTGATCC ATTGCGTCGT GCGGGCGCGG GACGCGCACG ACGCCGAGCG GCGGGTCGCC GACAAATTGC GCACCTACCG GCTCGGCGTG TCCGAGCGCG ATCGCGCGCG CATCCGGTGC CACGCCGGGG ACATCGCGCA CGACAGGCTC GGCATGGCGA GCGCGGATTA CGACGCGCTC AGCCGGTGCG TCGACGTCGT CCATCATTCG GCGAGCGCGG TCAACTTCAT CAAGCCGTAT GCGGCGATGA AGCGCGACAA CGTCGACGGG CTCGTCAACG TGATCCGGTT CGCCGCCGCC GCGCGCGTGA AGGCGCTGTC GCTGCTGTCG ACGATCTCGG TCTATAGCTG GGGGCACCGG ATCACGGGCA AGACCGTGAT GCGAGAGGAC GACGACCTCG ACCAGAATCT CGACGCCGTG TGCGCCGACA TCGGCTACGT GAAGAGCAAA TGGGTGATGG AGAAGCTCGC CGACGCGGCG CGCGCGCGCG GGCTGCCGCT TATCACGTTT CGCGTCGGCT ACGCGACGTA TCACGCGCAG ACCGGCCTGA GCGCCGACTA CCAATGGTGG GGGCGGCTCG TGAAGACGTG CATCGCGCTG CGCGCGGTTC CCGAGCTGCG CGAGCTTCGC GAGGGCTTGA GCACCGTCGA CTACATGACG GCGGCGATCG CGCACATCGC GCGCAATCCG GCCGCGCCTG GCAAGAAATT CAACCTGACG CATTCGGGCG AGCGCAACCT GTCGCTCGAG GATTTTTTCG ACCGGCTCGA GCGCGCGTTC GGCTTTTCGT TCGCGCGGGT GCCGTTTCGC GACTGGTTCG ACCGCTGGAA GGACGACGCC GCGACGCCGC TCTATCCGGT GCTGAACCTG TTTCGCGACC CGATGCACGG CGGCATGTGC ATGGTCGAGC TGTATCAGCA CACCTACCGG TGGGAGCACG CGAACACGTC GGCGTTCCTC GCGGGCAGCG GCGTGCGGCC GCCCGAATTC GACGAGCCGG AGCTGCGCCG CTATCTCGTG CAATCGATCG GCATCGCGCC GGCGTGCGCC GCGCGCTGA
|
Protein sequence | MSDSNTVLER VRAWCAATPR AVAVATADAT MTYGELDRAS DAVAAFLEAE RIGAGSIVPI EAMRTDDFVA GMLGIVKAGA AYCPIDHAYP EARKTHIVER TGSPLLLTAV SPRTPLACAR APRTASIAAL RRAGMPRSAS PRTPRPNDAI YVIFTSGTTG VPKGVVVEHR SVDGLIAWHN AQFGVDRTSR STQIAALGFD AAHWEIWSPL CAGARLRFVD DDARRDANAL VALLERERIT HAFVPTVMAR DVVAASEPGP SALRYLFTGG EKLNPVDTDR IRYRLIDYYG PTEATMWASF HPVQSASLGL PPSIGTPVGG ARIAIFDERL REAQSGAVGE IVISGPCLAR GYLDDPRQTA EKFLAHPSRP GERVYRTGDL GRRLPDGAIQ FVGRLDDQVK IRGYLVEPGE VEIAIARQSG VRRVAVVATS PADGAPRELV AFVVPADPAA PRRPLVGRLR AGVAASLPPF MVPGHFAIVD ALPLSANGKT DKAALVAMHG RRAARADFAE VADAVERTVC ESFADALGHA DFGVDDSFFD VGGHSLVAAA AVASLSARVG VALRLSDLYR RPSAAALAVD IRRRPSAGDP GALDLTPADV LRRDAILPED IAFDGAFDPQ RLARPAHVLL TGATGFVGVH LLAQLLATTE AVIHCVVRAR DAHDAERRVA DKLRTYRLGV SERDRARIRC HAGDIAHDRL GMASADYDAL SRCVDVVHHS ASAVNFIKPY AAMKRDNVDG LVNVIRFAAA ARVKALSLLS TISVYSWGHR ITGKTVMRED DDLDQNLDAV CADIGYVKSK WVMEKLADAA RARGLPLITF RVGYATYHAQ TGLSADYQWW GRLVKTCIAL RAVPELRELR EGLSTVDYMT AAIAHIARNP AAPGKKFNLT HSGERNLSLE DFFDRLERAF GFSFARVPFR DWFDRWKDDA ATPLYPVLNL FRDPMHGGMC MVELYQHTYR WEHANTSAFL AGSGVRPPEF DEPELRRYLV QSIGIAPACA AR
|
| |