Gene BMASAVP1_A1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1917 
Symbol 
ID4679206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1897932 
End bp1898951 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID639846181 
Producthypothetical protein 
Protein accessionYP_993236 
Protein GI121600762 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTAC TGAAGAAGTG CGCCGCGCTC GCGGCGCTCG CCGTCGTATT GCTGGGCGTC 
GCGTGCGCGG GCGGCGCCTA TTACTGGGCC ACTCGGCCGC TCGCGCTCGC CGCGCCGATC
CTCGATGTGA CGATCAAGCC CCGCAGCAGC GTGCGCAGCG TCGCGCAGCA ACTCGTGCAC
GGCGGCGTGG GCGTCGAGCC GCGCCTGTTC GTCGCGATGA CGCGCGTGCT GTTCCTGTCG
AGCCGGCTCA AGTCCGGCAA TTACGAATTC AAGACGGGCG TGAGCCCTTA CGAGGTGTTG
CAGAAGGTCG CGCGCGGGGA CGTGAACGAA TATGTCGTGA CCGTGATCGA GGGCTGGACG
TTCCGGCGCA TGCGCGCGGA GCTCGACGCG AATGCGGCGC TCGCGCATGC GAGCGCGGGG
ATGAGCGACG CGGCGCTGCT GCGCGCGATC GGCGCGCCCG CCGAAGTCGT CGCGCGCGGC
ACCGGCGAGG GGCTGTTCTT TCCGGATACC TATCTGTTCG ACAAGGGCAC GAGCGACCTG
AACGTGTATC GGCGCGCGTA CCGGCTGATG CAGGCGCGCC TGGCCGACGC GTGGACCGCC
CGTCGGCCCG GCCTGCCGTT CAAGACCCCT TACGAGGCGC TGACGGTCGC GTCGCTCGTC
GAGAAGGAGA CGGGGCACGC GTCCGACCGT GCGTTCGTGT CGGGCGTGTT CGCGAATCGC
CTGCGGGCCG GGATGCCGCT GCAGACCGAT CCCTCGGTGA TCTACGGAAT GGGCGACGCG
TACACGGGGC GGCTGCGCAA GCGCGATCTG CAGACCGACA CTCCGTACAA TACCTACACG
CGCCGCGGGC TGCCCCCGAC GCCGATCGCG CTGCCGGGCG AGGCGGCGCT CTACGCCGCG
GTGAACCCGG CGGCGACGTC CGCGCTCTAT TTCGTCGCGA GGGGCGACGG CACGAGCGTC
TTCTCGGACA CGCTCGGGGA TCACAACAAG GCCGTGGACA AATACATACG AGGTCAATGA
 
Protein sequence
MSLLKKCAAL AALAVVLLGV ACAGGAYYWA TRPLALAAPI LDVTIKPRSS VRSVAQQLVH 
GGVGVEPRLF VAMTRVLFLS SRLKSGNYEF KTGVSPYEVL QKVARGDVNE YVVTVIEGWT
FRRMRAELDA NAALAHASAG MSDAALLRAI GAPAEVVARG TGEGLFFPDT YLFDKGTSDL
NVYRRAYRLM QARLADAWTA RRPGLPFKTP YEALTVASLV EKETGHASDR AFVSGVFANR
LRAGMPLQTD PSVIYGMGDA YTGRLRKRDL QTDTPYNTYT RRGLPPTPIA LPGEAALYAA
VNPAATSALY FVARGDGTSV FSDTLGDHNK AVDKYIRGQ