Gene BURPS668_A1659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1659 
Symbol 
ID4886894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1601242 
End bp1602759 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content71% 
IMG OID640131598 
Productlinear gramicidin synthetase subunit C 
Protein accessionYP_001062655 
Protein GI126442769 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0625323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTCAA CCAGCGTCAA TCGTCTGTTC ACTTCGCAAG CCCGCCTTGC GCCCGAAGCG 
CTCGCGCTCT CGAGCGGCGA CACGCGCCTC ACGTACGGCG AGCTCGAACG ATGCGCGAAC
CACCTGGCCC GACGCCTCGT CGACAGCGGC GTGCGGCCGC GCGACCGGGT CCTGCTCTGC
CTGCCGCGCT CGGTCGACGC GGTGATCGCG ATGCTCGCGA TCATGAAGAC CGGCGCGGCG
TTCGTGCCGG TCGATCCCGC GTATTCCGAC GCGATCAAGC GCGGCTATGC GAGCGACAGC
GGCGCGCGGC ACGCGCTCGC GCGCGCGGCC GACGCCGCGG CGTTTCGCGG CGGCGCGCTG
GGCGTGATCG ACGCCGACGA TCTGTCGGCC GCACGCGATG ACGAGGGGCC CGAAGTCGAT
GCGGGGCACG ACGGGGAAAC GCCGGTGTAC GTGATGTTCA CCTCCGGCAG CACCGGCCGG
CCCAAGGGCG TGATCGTCGC GCACCGCGGC GTCGCGCGGC TCGTCAGGGA AACGAACTAT
ATCCGGATCA CGCGCGAGGA CACGCTGCTG CTGCTCTCGC CGATCACGTT CGACGCATCG
ACGTTCGAGA TCTGGGGGGC GCTGCTCAAC GGCGCGCGGC TTGCGATCTA CGAGGACGCC
ACGTTCGATC CGAACGCCGT CAGCCGGCTC ATCGCGCGCG AGCAAGTAAG CGTGATGTGG
CTCACCGCGG GGCTGTTCCA TCTGGTCGCG CGGCGCTTCA TCGGCATGCT GGCGGGGCTG
CGCGTCGTGC TCGCGGGCGG CGACGTGCTG AGCGCCGCCG CGATCGGCGC GGTGTTCGAC
GCGTTCCCGT CGATCACCGT CATCAACGGC TACGGCCCGA CCGAGAACAC GACGTTCACG
TGCTGCCACG TGATGACGGC CGACCGGCGG CCGACCGGTA CGGTGCCGAT CGGCCGGCCG
ATCGCGGGCA CCGACGTTCG CATTCTCGAC GCCGCGCTGC GCGAGGTGCC TGACGGCGAG
GAAGGCGAGC TGTGCGCAAG CGGCCTCGGC GTCGCGCTCG GCTACCTGAA CGCGCCCGAC
GCGACGCGCG CCGCGTTCGT CGACTGCCCG GCGACGGGCA GCCGGCTCTA TCGCACCGGC
GACCGCGCAC GGCGCCGGGC GGACGGCGTG ATCGAGTTCC TCGGCCGCAG CGACCGGCTC
GTGAAGATAC GCGGCTACCG CGTGTCGCTC GACGAGCTGC AATCCGTCCT CGCCGGCATT
CCCGGCGTCG AGGAGGCGCT CGTCAAGGTA TCCGAAGAAG CGACCGGCGA GAAGCGCCTC
AGCGCGATCG TCCAATCCGG CCGCGCCGAA CCGGACATGA AGGCCTACGT GCGCCGCGAA
CTGGCCAAGC GCGTGCCGCC GTTCCAGATT CCCGACGACA TCCGGATTTT CCCGCACATC
CCGCTCAACG CGAACGGCAA GCTCGACCGC CACCGGCTGC CGGCCAGCGA GACCTCGACC
CTCGGAGAGA AGCCATGA
 
Protein sequence
MTSTSVNRLF TSQARLAPEA LALSSGDTRL TYGELERCAN HLARRLVDSG VRPRDRVLLC 
LPRSVDAVIA MLAIMKTGAA FVPVDPAYSD AIKRGYASDS GARHALARAA DAAAFRGGAL
GVIDADDLSA ARDDEGPEVD AGHDGETPVY VMFTSGSTGR PKGVIVAHRG VARLVRETNY
IRITREDTLL LLSPITFDAS TFEIWGALLN GARLAIYEDA TFDPNAVSRL IAREQVSVMW
LTAGLFHLVA RRFIGMLAGL RVVLAGGDVL SAAAIGAVFD AFPSITVING YGPTENTTFT
CCHVMTADRR PTGTVPIGRP IAGTDVRILD AALREVPDGE EGELCASGLG VALGYLNAPD
ATRAAFVDCP ATGSRLYRTG DRARRRADGV IEFLGRSDRL VKIRGYRVSL DELQSVLAGI
PGVEEALVKV SEEATGEKRL SAIVQSGRAE PDMKAYVRRE LAKRVPPFQI PDDIRIFPHI
PLNANGKLDR HRLPASETST LGEKP