Gene BURPS1106A_A1578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1578 
Symbol 
ID4905875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1536285 
End bp1537802 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content71% 
IMG OID640144684 
Productlinear gramicidin synthetase subunit C 
Protein accessionYP_001075612 
Protein GI126455823 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTCAA CCAGCGTCAA TCGTCTGTTC ACTTCGCAAG CCCGCCTTGC GCCCGAAGCG 
CTCGCGCTCT CGAGCGGCGA CACGCGCCTC ACGTACGGCG AGCTCGAACG ATGCGCGAAC
CACCTGGCCC GACGCCTCGT CGACAGCGGC GTGCGGCCGC GCGACCGGGT CCTGCTCTGC
CTGCCGCGCT CGGTCGACGC GGTGATCGCG ATGCTCGCGA TCATGAAGAC CGGCGCGGCG
TTCGTGCCGG TCGATCCCGC GTATTCCGAC GCGATCAAGC GCGGCTATGC GAGCGACAGC
GGCGCGCGGC ACGCGCTCGC GCGCGCGGCC GACGCCGCGG CGTTTCGCGA CGGCGCGCTG
GGCGTGATCG ACGCCGACGA TCTGTCGGCC GCGCGCGATG ACGAGGGGCC CGAAGTCGAT
GCGGGGCACG ACGGGGAAAC GCCGGTGTAC GTGATGTTCA CCTCCGGCAG CACCGGCCGG
CCCAAGGGCG TGATCGTCGC GCACCGCGGC GTCGCGCGGC TCGTCAGGGA AACGAACTAT
ATCCGGATCA CGCGCGAGGA CACGCTGCTG CTGCTCTCGC CGATCACGTT CGACGCATCG
ACGTTCGAGA TCTGGGGGGC GCTGCTCAAC GGCGCGCGGC TTGCGATCTA CGAGGACGCC
ACGTTCGATC CGAACGCCGT CAGCCGGCTC ATCGCGCGCG AGCAAGTAAG CGTGATGTGG
CTCACCGCGG GGCTGTTCCA TCTGGTCGCG CGGCGCTTCG TCGGCATGCT GGCGGGGCTG
CGCGTCGTGC TCGCGGGCGG CGACGTGCTG AGCGCCGCCG CGATCGGCGC GGTGTTCGAC
GCGTTCCCGT CGATCACCGT CATCAACGGC TACGGCCCGA CCGAGAACAC GACGTTCACG
TGCTGCCACG TGATGACGGC CGACCGGCGG CCGACCGGTA CGGTGCCGAT CGGCCGGCCG
ATCGCGGGCA CCGACGTTCG CATTCTCGAC GCCGCGCTGC GCGAGGTGCC TGACGGCGAG
GAAGGCGAGC TGTGCGCAAG CGGCCTCGGC GTCGCGCTCG GCTACCTGAA CGCGCCCGAC
GCGACGCGCG CCGCGTTCGT CGACTGCCCG GCGACGGGCA GCCGGCTCTA TCGCACCGGC
GACCGCGCAC GGCGCCGGGC GGACGGCGTG ATCGAGTTCC TCGGCCGCAG CGACCGGCTC
GTGAAGATAC GCGGCTACCG CGTGTCGCTC GACGAGCTGC AATCCGTCCT CGCCGGCATT
CCCGGCGTCG AGGAGGCGCT CGTCAAGGTA TCCGAAGAAG CGACCGGCGA GAAGCGCCTC
AGCGCGATCG TCCAATCCGG CCGCGCCGAA CCGGACATGA AGGCCTACGT GCGCCGCGAA
CTGGCCAAGC GCGTGCCGCC GTTCCAGATT CCCGACGACA TCCGGATTTT CCCGCACATC
CCGCTCAACG CGAACGGCAA GCTCGACCGC CACCGGCTGC CGGTCAGCGA GACCTCGACC
CTCGGAGAGA AGCCATGA
 
Protein sequence
MTSTSVNRLF TSQARLAPEA LALSSGDTRL TYGELERCAN HLARRLVDSG VRPRDRVLLC 
LPRSVDAVIA MLAIMKTGAA FVPVDPAYSD AIKRGYASDS GARHALARAA DAAAFRDGAL
GVIDADDLSA ARDDEGPEVD AGHDGETPVY VMFTSGSTGR PKGVIVAHRG VARLVRETNY
IRITREDTLL LLSPITFDAS TFEIWGALLN GARLAIYEDA TFDPNAVSRL IAREQVSVMW
LTAGLFHLVA RRFVGMLAGL RVVLAGGDVL SAAAIGAVFD AFPSITVING YGPTENTTFT
CCHVMTADRR PTGTVPIGRP IAGTDVRILD AALREVPDGE EGELCASGLG VALGYLNAPD
ATRAAFVDCP ATGSRLYRTG DRARRRADGV IEFLGRSDRL VKIRGYRVSL DELQSVLAGI
PGVEEALVKV SEEATGEKRL SAIVQSGRAE PDMKAYVRRE LAKRVPPFQI PDDIRIFPHI
PLNANGKLDR HRLPVSETST LGEKP