Gene BURPS668_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1847 
Symbol 
ID4883580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1808301 
End bp1809338 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID640127775 
Productsulfate/thiosulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001058882 
Protein GI126438507 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGC GCAACACGGG GCTGGCAGGC GGCGCGCGCC GTCTCATCGC ATCATTGGCG 
CTCGGCGCGG CGGCGGCGCT CGGCGCGCTC ACGCCGGCGC TCGCGGACAC GACGTTCCTG
AACGTTTCGT ACGACCCGAC GCGCGAACTC TACCAGGACG TCAACCAGGC GTTCGGCAAG
GAATGGAAGG CGAGGACGGG CGAGACGGTG AACTTCAAGC AGTCGCACGG CGGCTCGGGC
GCGCAGGCGC GCTCGGTGCT CGACGGGCTG CAGGCCGACG TGGTCACGCT CGCGCTCGCG
TACGACATCG ACGCGCTCGC GAACAAGGGC CTCGTCAGCA AGGATTGGCA AAAGCGTCTG
CCGGACAACG CGTCGCCGTA CACGTCGACG ATCGTGTTCC TCGTGAGGAA GGGCAATCCG
AAGGGCATCA AGGATTGGGA CGATCTCGTG AAGCCGGGCG TGTCGATCGT CACGCCGAAC
CCGAAAACCT CGGGCGGCGC GCGCTGGAAC TACCTCGCCG CGTGGGCATA CGCGCAGCAC
CAGCCGGGCG GCACGGCGCA GACGGCGAAG GATTTCGTCA CGAAGCTGTA CAGGAACGCG
GGCGTGCTCG ACTCGGGCGC GCGCGGCGCG ACGACGAGCT TCGTGCAGCG CGGCATCGGC
GACGTGCTGA TCGCGTGGGA AAACGAGGCG TTCCTGTCGA TCAAGGAATT CGGCGCCGAC
AAGTTCGAGA TCGTCGTGCC GTCGGCGAGC ATTCTCGCGG AGCCGCCGGT GGCGGTGGTC
GACAAGGTGG TCGACAAGAA GGGCACGCGC AAGCTCGCCG ACGCGTACCT GAACTTCCTG
TACAGCAGGC AAGGGCAGGA GATCGCCGCG CGCAACTACT ACCGGCCGCG CTCGCGGGAC
GTGCCGGCGG CGCTCACGAA GCAGTTCCCG AAGCTCAAGC TGTACACGGT CGACGACACG
TTCGGCGGCT GGACCCAAGC GCAGAAGACG CATTTCGCCG ACGGCGGCGT GTTCGATTCG
ATCTACAAGC CGCAGTGA
 
Protein sequence
MVKRNTGLAG GARRLIASLA LGAAAALGAL TPALADTTFL NVSYDPTREL YQDVNQAFGK 
EWKARTGETV NFKQSHGGSG AQARSVLDGL QADVVTLALA YDIDALANKG LVSKDWQKRL
PDNASPYTST IVFLVRKGNP KGIKDWDDLV KPGVSIVTPN PKTSGGARWN YLAAWAYAQH
QPGGTAQTAK DFVTKLYRNA GVLDSGARGA TTSFVQRGIG DVLIAWENEA FLSIKEFGAD
KFEIVVPSAS ILAEPPVAVV DKVVDKKGTR KLADAYLNFL YSRQGQEIAA RNYYRPRSRD
VPAALTKQFP KLKLYTVDDT FGGWTQAQKT HFADGGVFDS IYKPQ