Gene BURPS668_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1849 
SymbolcysW 
ID4884451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1810361 
End bp1811407 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID640127777 
Productsulfate/thiosulfate ABC transporter, permease protein CysW 
Protein accessionYP_001058884 
Protein GI126440089 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4208] ABC-type sulfate transport system, permease component 
TIGRFAM ID[TIGR00969] sulfate ABC transporter, permease protein
[TIGR02140] sulfate ABC transporter, permease protein CysW 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCG ATACCGCTCG ATCCCGCGCG GGCGCCGCGC GCGCCGACGC CGGCCCGGAA 
TTCGGCGCTC GATTCGGCCT TGAATCCGGC CTTGAATCCG GCCTTGAATC CGGCGATGAC
GCGCGCCGCG CCGCGCGCGC CCCGGCGCGA GCGCGCCGGC TCGACCCCGT GAGCGAGCCG
CGCGCGGTGC GCTGGCTGCT CACGGGCGCG GCGCTCGCGT TCCTCGCGCT CTTTCTCGTC
GTGCCGCTCG CCGCGGTGTT CTTCGAGGCG CTGCGCAAGG GCGTCGATTT CTATCTCGAA
TCGCTCGCCG ATCCGGACGC GTGGTCGGCG ATCAAGCTCA CGCTCGTCGT CGCCGCGATC
GCCGTGCCGC TCAATCTCGT GTTCGGCGTG TGCGCGTCGT GGGCGATCGC GAAGTTCGAG
TTCAAGGGCA AGGCGGTGCT GACGACGCTC ATCGATCTGC CGTTCTCGGT GTCGCCCGTG
ATCTCGGGCC TCGTCTACGT GCTGCTGTTC GGCGCGCAGG GCTGGCTCGG CCCGTGGCTG
CAGGCGCATG ACGTGCAGAT CATCTTCGCC GTGCCGGGCA TCGTGCTCGC GACGATCTTC
GTCACGTTCC CGTTCGTCGC GCGCGAGCTG ATTCCGCTGA TGCAGGCGCA GGGCGCCGAC
GAGGAGGAGG CCGCGCGCGT GCTCGGCGCA TCCGGCTGGC AGATCTTCCG GCGCGTCACG
CTGCCGAACG TGAAGTGGGG CCTGCTGTAC GGCGTGATTC TCTGCAACGC GCGCGCGATG
GGCGAGTTCG GCGCGGTGTC GGTGGTGTCG GGCCATATCC GCGGGCAGAC CGACACGATG
CCGCTGCACG TCGAGATTCT CTACAACGAG TACAACTTCG CGGCCGCGTT CGCGGTGGCG
TCGGTACTCG CGCTGCTCGC GCTCGTCACG CTCGCGCTCA AGCTGTTCGC CGAGCGGCGG
CTGTCCGCCG AACTCGCGCA CGGGCGCGAC GACGCGAGCG CCCCCGCCGC GCACCCGGGC
GCCGCCGTCA CTTCGTCGAT TTCGTAA
 
Protein sequence
MSRDTARSRA GAARADAGPE FGARFGLESG LESGLESGDD ARRAARAPAR ARRLDPVSEP 
RAVRWLLTGA ALAFLALFLV VPLAAVFFEA LRKGVDFYLE SLADPDAWSA IKLTLVVAAI
AVPLNLVFGV CASWAIAKFE FKGKAVLTTL IDLPFSVSPV ISGLVYVLLF GAQGWLGPWL
QAHDVQIIFA VPGIVLATIF VTFPFVAREL IPLMQAQGAD EEEAARVLGA SGWQIFRRVT
LPNVKWGLLY GVILCNARAM GEFGAVSVVS GHIRGQTDTM PLHVEILYNE YNFAAAFAVA
SVLALLALVT LALKLFAERR LSAELAHGRD DASAPAAHPG AAVTSSIS