Gene BURPS668_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1990 
Symbol 
ID4882412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1966441 
End bp1968345 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content70% 
IMG OID640127918 
Productsyringopeptin synthetase C 
Protein accessionYP_001059025 
Protein GI284159947 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGTTT CGACCGATAC GTCCGCCGAA GCCCATTCAC CGGCGCCGAA TGCTCGCGAC 
ACCGACTCAC GCGCAGACTC ACTCGATATG TCAGACAGTG ACGCACTGCG CCGGATTGCC
GAAGCCGTCG ACGGCGGTGC GGCGAACATC GAGCGAATCG TTCCGCTCGC CCGTGCGCGC
GAGCGTATGC CGACGCGGCC TCGGCTCGAG CGGTGCGGCG GCGGGCGAGT GACGGCGGCG
CACCTCACGC TCGACTCGCG TGCGCGTCTC GATGCGTTGC TGCACGCATT GCAACGCGCG
ATCGACCAGA ACGCGGACCT GCGAACGTGC ATTTTGGGGG CGTGCCTGCG GCGGCCGATG
CAAGTCACGC TTCGCGAGGT TCGCCTGCGA GTGCACGCCG CGACGCTCGA CCCCGACCTC
GATCCCGCCG CGCAGTTGGC CGCGCTGAGC ACCGGGCCCG GCATGCGCAT CGACATGCAA
CGCCCGCCGT GGGTGCTCGC GTGCATCGCG CGCATTCCGG GCAGCGGGCA ATGGCTGCTG
CGGCTCGTGG CAGCCCCGAT CGCGGCCGGA TTCGACGCGC TCGACGCGCT GCTTCGCGAG
GCGGTGATTC ACGGCGACCG GGAGCCCGGG CCGACGCCGT TTCACTGGAC TGTGGAAACG
GCTGTTGAAT CGTGCGGAGG CGAACCTGCG TCGTTGCCGA CCGCGGGCGC GGTTTGGCCG
TCGAACGACG TATCACGCGC TTGCGATCCG GATGCCGCGT CGTGCGTCGA GGCGCGCATC
GCCGCGATCG CGTCCGATCT GCCGGGCGTC GTGCATGGCG GACCACGAGA CGATTTGCGC
GCGCTCGGAC GAACGCCGTT GCAGGCGCTT CGACTCGCGC GCCGTATCCG CGACGCACTG
GGCGTGACCG TACCGGTCGA GTCGATCCTC GCGAGTCCGA CCATCGTCGA GCTTGCCGGG
TACGTCGAGC AATTGCGCTC GCGGCACGTC CGCGACGGCG CTGCGCCCGT GTCGATCGGC
GAAAAACCGG CGGACGCGGA TGCTCGGGCG CAGGCGCAGG CGGATACGGA TACGGCGCAC
ACCGATTGCC TGATCGTCAT TCAAGCAGGC GGCGCCGAAC AAGCGCCGGT GTTCTGCATC
CCGGGCGCGG GGGGCAGCGT CGCGTCGTTC GTTGCGCTTG CGAGCATGCT GCGCGCCGAC
ATACCGGTAT ACGGCTTGCA GCCTCGCGGG CTGGACGGCC TGGGGCCGCC GGACCGATCC
GTCGAAGCGG CTGCGCGCCG GTACGCGCGA GCCATTCTGG ATGCCGCCCC GCCCGGGCCG
CCGCGCATCG TCGGCCACTC GTTCGGCGGC TGGATCGCGC TCGAGACAGC GCGGCTGCTG
GACGGCATGG GAGCGCGCTG CGCCCCGCTC GTCCTGCTCG ATTCGAATCC ACCGCCCGCG
TCGCAGGCCT GGCGCGCGCC TTCCGAGGCA GACATGCTGC GCACGCTCGT CGGCCTGCTC
GAGCAGGCCG CGGGCGGCGC ACCATCCGGG ATCGGCGACG AAGAAATCGC CCGTTGCGCG
GCAGCGGGCG AGGATGCGCG GGATGCGCTC GTGCACGCCT GCATGGTGAG GACCGCCCTG
CTGCCGCCGC GCGCGCCGGT CGAAACGGTG CGGCACCTGC GGCGGGTATT CGAAGCCCAT
TCGAGCACCC GCTACGCGCC GGGCGGCCGA TACGCGGGCG ACGCAACGGT GATCGTCGCC
AACGGCGATC GCGACGCGGG CGAGATGGTG CCGGCGTTCG GATGGGCCGC GCTGATCGAG
CGAGTCGAGG TGGCCGTGAC GCCGGGCAAT CACATGAGCA TGCTCGCGGC GCCGTATGTT
CGTCACGTCG CGCTGACGAT GAAGGCGGTA TGGCGCATGA TCTGA
 
Protein sequence
MHVSTDTSAE AHSPAPNARD TDSRADSLDM SDSDALRRIA EAVDGGAANI ERIVPLARAR 
ERMPTRPRLE RCGGGRVTAA HLTLDSRARL DALLHALQRA IDQNADLRTC ILGACLRRPM
QVTLREVRLR VHAATLDPDL DPAAQLAALS TGPGMRIDMQ RPPWVLACIA RIPGSGQWLL
RLVAAPIAAG FDALDALLRE AVIHGDREPG PTPFHWTVET AVESCGGEPA SLPTAGAVWP
SNDVSRACDP DAASCVEARI AAIASDLPGV VHGGPRDDLR ALGRTPLQAL RLARRIRDAL
GVTVPVESIL ASPTIVELAG YVEQLRSRHV RDGAAPVSIG EKPADADARA QAQADTDTAH
TDCLIVIQAG GAEQAPVFCI PGAGGSVASF VALASMLRAD IPVYGLQPRG LDGLGPPDRS
VEAAARRYAR AILDAAPPGP PRIVGHSFGG WIALETARLL DGMGARCAPL VLLDSNPPPA
SQAWRAPSEA DMLRTLVGLL EQAAGGAPSG IGDEEIARCA AAGEDARDAL VHACMVRTAL
LPPRAPVETV RHLRRVFEAH SSTRYAPGGR YAGDATVIVA NGDRDAGEMV PAFGWAALIE
RVEVAVTPGN HMSMLAAPYV RHVALTMKAV WRMI