Gene BURPS668_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2086 
Symbol 
ID4885416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2071678 
End bp2074686 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content71% 
IMG OID640128014 
Productlinear gramicidin synthetase subunit D 
Protein accessionYP_001059121 
Protein GI126439535 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins
[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT CGAACACCGT GCTCGAACGG GTGCGCGCGT GGTGCGCCGC CACGCCGCGG 
GCCGTTGCCG TCGCGACCGC GGATGCGACG ATGACGTATG GCGAGCTCGA CCGCGCGAGC
GACGCCGTCG CCGCGTTTCT CGAGGCCGAG CGCATCGGTG CGGGCAGCAT CGTGCCGATC
GAGGCGATGC GCACGGACGA TTTCGTCGCG GGCATGCTCG GCATCGTGAA GGCGGGCGCC
GCGTACTGCC CGATCGATCA CGCGTATCCC GAAGCGCGCA AGACGCACAT CGTCGAGCGA
ACCGGCTCGC CGCTGCTGCT CACGGCCGTG TCGCCGCGCA CGCCGCTCGC GTGCGCGCGG
GCGCCGCGCA CCGCGAGCAT CGCGGCGCTG CGGCGCGCGG GGATGCCGCG TTCGGCGTCG
CCGCGCACGC CGCGGCCGAA CGACGCGATC TACGTGATCT TCACCTCGGG TACGACGGGT
GTGCCCAAGG GCGTCGTCGT CGAGCATCGC TCGGTCGACG GGCTGATCGC GTGGCACAAC
GCGCAGTTCG GCGTCGACCG CACGAGTCGC TCGACGCAGA TCGCCGCGCT CGGCTTCGAC
GCCGCGCATT GGGAGATCTG GTCGCCGCTT TGCGCGGGCG CGCGGCTGCG CTTCGTCGAC
GACGACGCGC GGCGCGACGC GAACGCGCTC GTCGCGTTGC TCGAGCGCGA GCGGATCACG
CATGCGTTCG TGCCGACGGT GATGGCGCGC GACGTCGTGG CCGCGAGCGA ACCGGGCCCG
TCGGCGCTGC GCTATCTGTT CACCGGCGGC GAGAAGCTGA ATCCGGTCGA CACGGACCGC
ATCCGTTATC GGCTGATCGA CTACTACGGG CCGACCGAGG CGACGATGTG GGCGAGCTTT
CATCCGGTGC AAAGCGCGAG CCTCGGCTTG CCGCCGTCGA TCGGCACGCC GGTCGGCGGC
GCGCGAATCG CGATATTCGA CGAGCGACTG CGCGAGGCGC AGAGCGGTGC CGTCGGCGAG
ATTGTCATCT CGGGCCCGTG TCTCGCGCGC GGCTATCTCG ACGATCCGAG GCAGACCGCG
GAGAAGTTCC TTGCGCATCC GTCGCGCCCC GGCGAGCGCG TCTATCGGAC GGGCGACCTC
GGGCGCCGGC TGCCCGACGG CGCGATCCAG TTCGTCGGTC GCCTCGACGA TCAGGTGAAG
ATCCGCGGCT ATCTCGTCGA GCCGGGTGAG GTCGAGATCG CGATCGCGCG GCAGTCGGGC
GTGCGCCGGG TCGCCGTCGT CGCGACGTCG CCCGCCGACG GCGCGCCGAG AGAACTCGTC
GCGTTCGTCG TGCCGGCCGA TCCGGCCGCG CCGCGCCGGC CGCTCGTCGG CCGCCTGCGC
GCGGGCGTGG CCGCATCGCT GCCGCCTTTC ATGGTGCCCG GGCATTTCGC GATCGTCGAC
GCGCTGCCGC TGTCCGCGAA CGGCAAGACC GACAAGGCGG CGCTCGTCGC GATGCACGGG
CGGCGCGCCG CGCGCGCGGA TTTCGCGGAG GTGGCCGACG CCGTCGAGCG CACGGTGTGC
GAATCGTTCG CCGACGCGCT CGGCCATGCG GATTTCGGCG TCGATGACAG CTTCTTCGAC
GTGGGCGGCC ATTCGCTCGT CGCGGCGGCC GCCGTCGCGT CGCTGTCCGC GCGGGTCGGC
GTCGCGCTGC GTCTGTCCGA TCTGTACAGG CGGCCGTCCG CCGCGGCGCT CGCGGTCGAC
ATCAGGCGAA GGCCGTCGGC CGGCGATCCG GGCGCCCTCG ACCTGACGCC CGCCGACGTG
CTGCGCCGCG ATGCGATCCT GCCGGAGGAC ATCGCGTTCG ACGGCGCGTT CGATCCGCAG
CGGCTCGCGC GCCCGGCGCA CGTGCTGCTG ACGGGCGCGA CGGGCTTCGT CGGCGTGCAT
CTGCTCGCGC AGTTGCTGGC CACCACGGAG GCGGTGATCC ATTGCGTCGT GCGGGCGCGG
GACGCGCACG ACGCCGAGCG GCGGGTCGCC GACAAATTGC GCACCTACCG GCTCGGCGTG
TCCGAGCGCG ATCGCGCGCG CATCCGGTGC CACGCCGGGG ACATCGCGCA CGACAGGCTC
GGCATGGCGA GCGCGGATTA CGACGCGCTC AGCCGGTGCG TCGACGTCGT CCATCATTCG
GCGAGCGCGG TCAACTTCAT CAAGCCGTAT GCGGCGATGA AGCGCGACAA CGTCGACGGG
CTCGTCAACG TGATCCGGTT CGCCGCCGCC GCGCGCGTGA AGGCGCTGTC GCTGCTGTCG
ACGATCTCGG TCTATAGCTG GGGGCACCGG ATCACGGGCA AGACCGTGAT GCGAGAGGAC
GACGACCTCG ACCAGAATCT CGACGCCGTG TGCGCCGACA TCGGCTACGT GAAGAGCAAA
TGGGTGATGG AGAAGCTCGC CGACGCGGCG CGCGCGCGCG GGCTGCCGCT TATCACGTTT
CGCGTCGGCT ACGCGACGTA TCACGCGCAG ACCGGCCTGA GCGCCGACTA CCAATGGTGG
GGGCGGCTCG TGAAGACGTG CATCGCGCTG CGCGCGGTTC CCGAGCTGCG CGAGCTTCGC
GAGGGCTTGA GCACCGTCGA CTACATGACG GCGGCGATCG CGCACATCGC GCGCAATCCG
GCCGCGCCTG GCAAGAAATT CAACCTGACG CATTCGGGCG AGCGCAACCT GTCGCTCGAG
GATTTTTTCG ACCGGCTCGA GCGCGCGTTC GGCTTTTCGT TCGCGCGGGT GCCGTTTCGC
GACTGGTTCG ACCGCTGGAA GGACGACGCC GCGACGCCGC TCTATCCGGT GCTGAACCTG
TTTCGCGACC CGATGCACGG CGGCATGTGC ATGGTCGAGC TGTATCAGCA CACCTACCGG
TGGGAGCACG CGAACACGTC GGCGTTCCTC GCGGGCAGCG GCGTGCGGCC GCCCGAATTC
GACGAGCCGG AGCTGCGCCG CTATCTCGTG CAATCGATCG GCATCGCGCC GGCGTGCGCC
GCGCGCTGA
 
Protein sequence
MSDSNTVLER VRAWCAATPR AVAVATADAT MTYGELDRAS DAVAAFLEAE RIGAGSIVPI 
EAMRTDDFVA GMLGIVKAGA AYCPIDHAYP EARKTHIVER TGSPLLLTAV SPRTPLACAR
APRTASIAAL RRAGMPRSAS PRTPRPNDAI YVIFTSGTTG VPKGVVVEHR SVDGLIAWHN
AQFGVDRTSR STQIAALGFD AAHWEIWSPL CAGARLRFVD DDARRDANAL VALLERERIT
HAFVPTVMAR DVVAASEPGP SALRYLFTGG EKLNPVDTDR IRYRLIDYYG PTEATMWASF
HPVQSASLGL PPSIGTPVGG ARIAIFDERL REAQSGAVGE IVISGPCLAR GYLDDPRQTA
EKFLAHPSRP GERVYRTGDL GRRLPDGAIQ FVGRLDDQVK IRGYLVEPGE VEIAIARQSG
VRRVAVVATS PADGAPRELV AFVVPADPAA PRRPLVGRLR AGVAASLPPF MVPGHFAIVD
ALPLSANGKT DKAALVAMHG RRAARADFAE VADAVERTVC ESFADALGHA DFGVDDSFFD
VGGHSLVAAA AVASLSARVG VALRLSDLYR RPSAAALAVD IRRRPSAGDP GALDLTPADV
LRRDAILPED IAFDGAFDPQ RLARPAHVLL TGATGFVGVH LLAQLLATTE AVIHCVVRAR
DAHDAERRVA DKLRTYRLGV SERDRARIRC HAGDIAHDRL GMASADYDAL SRCVDVVHHS
ASAVNFIKPY AAMKRDNVDG LVNVIRFAAA ARVKALSLLS TISVYSWGHR ITGKTVMRED
DDLDQNLDAV CADIGYVKSK WVMEKLADAA RARGLPLITF RVGYATYHAQ TGLSADYQWW
GRLVKTCIAL RAVPELRELR EGLSTVDYMT AAIAHIARNP AAPGKKFNLT HSGERNLSLE
DFFDRLERAF GFSFARVPFR DWFDRWKDDA ATPLYPVLNL FRDPMHGGMC MVELYQHTYR
WEHANTSAFL AGSGVRPPEF DEPELRRYLV QSIGIAPACA AR