Gene BURPS1710b_A0915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0915 
Symbolwza 
ID3692021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1175143 
End bp1176333 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID637731169 
Productcapsular polysaccharide biosynthesis/export periplasmic protein 
Protein accessionYP_336073 
Protein GI76818460 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.138032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAA AAGATATGCT GAATCGTTCG CTTAGACCCC TGGCGCTCGC CGTCGCCGCC 
GCCACGCTGC TGCAGGCGTG CGCGACGGCG CCCGGCAACT ACCTCGACAC GTCGCGTCTC
GACGACAAGG ACAGCCAGTC CGCCGAGCAT TACAACGTGC AGCTCATTAC CGCGCAGCTC
GTCGTTTCGC AGGCCGACGC GCAGCGCAAG GCTGGGCCGT TGCCGCCGGC GCGCTTCGTC
GATCCGATGC AGTACGTCTA CCGGATCGCG CCGCAGGACA TTCTCGGCGT GACCGTCTGG
GATCATCCGG AGCTCACGAC GCCGCAAGGC CAATCGTTCT CGAGCGGCGG CAACACGACG
CAGACGGTCG CGGGCGCGCT GCAGCAGCCG TATGCGAATG CGTTGCCCGG CCAGGCCGAT
CCGTACGGCC AGACGGTGAT GTCCGACGGC ACGATCTACT TTCCGTTCGT CGGCCGCCTG
CACGCGGCGG GCAAGACGGT CGGCCAGGTG CGCGACGAAC TCGCCGCGCG GCTGGCGCGT
TACGTGAAGA ATCCGCAGGT CGACGTGCGC GTGCTGTCGT ATCGCAGCCA GAAGGTGCAG
GTGACCGGCG AAGTGAAGAC GCCCGGCCCG CTTGCGATCA CCGATGTGCC GCTCACGCTC
GTGGACGCGA TCACGCGCTC GGGCGGCTCG ACGAACGAGG CCGACCTGCA GCGCGTGCGC
CTCACGCGCG ACGGCAAGTT CTACCAACTC GACGCGAACG GCATGCTCGA TCGCGGCGAC
GTCACGCAGA ACGTGATGCT GCAGCCGGGC GACATCGTCA ACGTGCCGGA CCGCGGCGAC
AGCCGCGTGT TCGTGATGGG CGAGGTGAAG ACGCCCGCGA CGGTGCCGAT GCTCAAGGGG
CGCTTGACGA TCGCGGACGC GCTCACGGCG GGAGGCGGCA TTCTCGATAC CGATGCGAAT
CCGCGTCAGG TGTACGTGTT GCGCGATCTG CAGGACAAGC CGAACACACC GGACATCTTC
CGCCTCGACA TGACGCAGCC CGACGCGCTG ATGCTGTCGA GCCGCTTCCA GTTGAAGCCG
CTCGACGTCG TGTACGTCGG CACGGCGGGA TCGGTGCGCT TCAACCGCCT GCTGCAGCAG
ATCTTCCCGA CGATCCAGTC GATTTACTAC ATGAAGCAGA TCACGCGCTG A
 
Protein sequence
MAAKDMLNRS LRPLALAVAA ATLLQACATA PGNYLDTSRL DDKDSQSAEH YNVQLITAQL 
VVSQADAQRK AGPLPPARFV DPMQYVYRIA PQDILGVTVW DHPELTTPQG QSFSSGGNTT
QTVAGALQQP YANALPGQAD PYGQTVMSDG TIYFPFVGRL HAAGKTVGQV RDELAARLAR
YVKNPQVDVR VLSYRSQKVQ VTGEVKTPGP LAITDVPLTL VDAITRSGGS TNEADLQRVR
LTRDGKFYQL DANGMLDRGD VTQNVMLQPG DIVNVPDRGD SRVFVMGEVK TPATVPMLKG
RLTIADALTA GGGILDTDAN PRQVYVLRDL QDKPNTPDIF RLDMTQPDAL MLSSRFQLKP
LDVVYVGTAG SVRFNRLLQQ IFPTIQSIYY MKQITR