Gene BMASAVP1_A1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1698 
Symbolsbp 
ID4679603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1691359 
End bp1692396 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID639845965 
Productsulfate/thiosulfate ABC transporter, sulfate-binding protein 
Protein accessionYP_993024 
Protein GI121600537 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0701208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGC GCAACACGGG GCTGGCAGGC GGCGCGCGCC GTCTCATCGC ATCATTGGCG 
CTCGGCGCGG CGGCGGCGCT CGGCGCGCTC ACGCCGGCGC TCGCGGACAC GACGTTCCTG
AACGTTTCGT ACGACCCGAC GCGCGAGCTC TACCAGGACG TCAACCAGGC GTTCGGCAAG
GAATGGAAGG CGAGGACGGG CGAGACGGTG AACTTCAAGC AGTCGCACGG CGGCTCGGGC
GCGCAGGCGC GCTCGGTGCT CGACGGGCTG CAGGCCGACG TGGTCACGCT CGCGCTCGCG
TACGACATCG ACGCGCTCGC GAACAAGGGC CTCGTCAGCA AGGATTGGCA AAAGCGTCTG
CCGGACAACG CGTCGCCGTA CACGTCGACG ATCGTGTTCC TCGTGAGGAA GGGCAATCCG
AAGGGCATCA AGGATTGGGA CGATCTCGTG AAGCCGGGCG TGTCGATCGT CACGCCGAAC
CCGAAAACCT CGGGCGGCGC GCGCTGGAAC TACCTCGCCG CGTGGGCATA CGCGCAGCAC
CAGCCGGGCG GCACGGCGCA GACGGCGAAG GATTTCGTCA CGAAGCTGTA CAGGAACGCG
GGCGTGCTCG ACTCGGGCGC GCGCGGCGCG ACGACGAGCT TCGTGCAGCG CGGCATCGGC
GACGTGCTGA TCGCGTGGGA AAACGAGGCG TTCCTGTCGA TCAAGGAATT CGGCGCCGAC
AAGTTCGAGA TCGTCGTGCC GTCGGCGAGC ATTCTCGCGG AGCCGCCGGT GGCGGTGGTC
GACAAGGTGG TCGACAAGAA GGGCACGCGC AAGCTCGCCG ACGCGTACCT GAACTTCCTG
TACAGCAGGC AAGGGCAGGA GATCGCCGCG CGCAACTACT ACCGGCCGCG CTCGCGGGAC
GTGCCGGCGG CGCTCACGAA GCAGTTCCCG AAGCTCAAGC TGTACACGGT CGACGACACG
TTCGGCGGCT GGACCCAAGC GCAGAAGACG CATTTCGCCG ACGGCGGCGT GTTCGATTCG
ATCTACAAGC CGCAGTGA
 
Protein sequence
MVKRNTGLAG GARRLIASLA LGAAAALGAL TPALADTTFL NVSYDPTREL YQDVNQAFGK 
EWKARTGETV NFKQSHGGSG AQARSVLDGL QADVVTLALA YDIDALANKG LVSKDWQKRL
PDNASPYTST IVFLVRKGNP KGIKDWDDLV KPGVSIVTPN PKTSGGARWN YLAAWAYAQH
QPGGTAQTAK DFVTKLYRNA GVLDSGARGA TTSFVQRGIG DVLIAWENEA FLSIKEFGAD
KFEIVVPSAS ILAEPPVAVV DKVVDKKGTR KLADAYLNFL YSRQGQEIAA RNYYRPRSRD
VPAALTKQFP KLKLYTVDDT FGGWTQAQKT HFADGGVFDS IYKPQ