Gene BMASAVP1_A0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A0521 
Symbol 
ID4680969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp523839 
End bp524960 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID639844798 
Productcapsular polysaccharide biosynthesis/export periplasmic protein 
Protein accessionYP_991870 
Protein GI121598506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.21613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGCGG TCACGCTCGC CGGTTGCTCA AGTATCCCTA CGTCGGGGGC CAGTGGCGCG 
CAAATCGCGC GGGCTGCGCA GAGTCCATCC GGAATTCAGA TCGTCGATGT GACCGAGGAT
GTCGCGCGCC AGCTGTTTGC TGATCGAAAC ACGGCGGACT TCGTGACGGC GCTGGGCGGC
GGTGCGTCGT TCCGGCAACA GTTGGGCGTC GGCGATACGA TTCAGGTGTC CATCTGGGAG
GCGCCACCCG CCACGCTTTT TGGCGCGGCT CAGTCGGAAG GGAGTTCGGG GCCGGCGAAC
GCGCGCGTGA CGGTGCTGCC CGATCAAGCC ATCGATGGCG ACGGCAATGT CAATATTCCG
TTTGCGGGCC AGGTCAAGGC GGCCGGCCGC TCGCCCACGC AGTTGGCGCG TGAGATTGCC
GCGCGGCTGA AGAGCATGGC GCACGATCCG CAAGTGCTCG TGAAGCTTTC ACGCAACGAG
ACGTCATATG TGACGGTCGT GGGCGATGTG GCGGAAAACG CTCGCATGGC TCTGACCGCT
CGGGGCGAGC GCCTGCTTGA TGCATTGGCG AGCGCAGGCG GGGCGAAGCA CCCGGTTGAC
AAGGTTACGA TCCAGATAAC GCGCGGCAAG ACGGTGGCCT CGTTGCCGCT CGACATGGTT
ATTCGTGATC CGCGGCAGAA TGTCCCGCTG CATGCGGGCG ACGTGGTCAC TGTCCTGTTT
CAGCCGTATA GCTTTACGGT GCTCGGCGCG ACGGGCAAGA ATGACGAAAT CAATTTTGAA
GCGAAGGGCA TCACGCTTGC GCAGGCCCTG GCGCGTGCTG GCGGCTTGCA GGATTCGCGC
GCCGATGCAA AGGGCGTATT CATCTTCCGA CTTGAAGACG CCAACGCGCT GAAATGGCCG
ACGGCTCCCG TGCGTACGAC TGCGGACGGA AAGGTGCCTG TCGTGTATCG CGTGAATCTT
CGCGATCCGA ATTCGTTTTT CGTGGCTCAG AGCTTCAGGG TCGACAACAA CGATCTGTTG
TACGTTTCGA ATGCGCCGAT TGCCGAACTT CAAAAATTCT TGAATGTCGT GTTCTCCGTT
GCGTATCCGG TGATTACCGG CGTTCAGACA GTCAGGTACT GA
 
Protein sequence
MGAVTLAGCS SIPTSGASGA QIARAAQSPS GIQIVDVTED VARQLFADRN TADFVTALGG 
GASFRQQLGV GDTIQVSIWE APPATLFGAA QSEGSSGPAN ARVTVLPDQA IDGDGNVNIP
FAGQVKAAGR SPTQLAREIA ARLKSMAHDP QVLVKLSRNE TSYVTVVGDV AENARMALTA
RGERLLDALA SAGGAKHPVD KVTIQITRGK TVASLPLDMV IRDPRQNVPL HAGDVVTVLF
QPYSFTVLGA TGKNDEINFE AKGITLAQAL ARAGGLQDSR ADAKGVFIFR LEDANALKWP
TAPVRTTADG KVPVVYRVNL RDPNSFFVAQ SFRVDNNDLL YVSNAPIAEL QKFLNVVFSV
AYPVITGVQT VRY