Gene BURPS1106A_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1016 
SymbolcysD 
ID4899735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp992932 
End bp993897 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID640134246 
Productsulfate adenylyltransferase subunit 2 
Protein accessionYP_001065296 
Protein GI126453969 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID[TIGR02039] sulfate adenylyltransferase, small subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA CGCTCGAACA ATCCGCTTTT GCCCCGTCCG CCGGCGCGTC GACGAGCCGG 
ATGGGCCATC TCGACTGGCT CGAGGCCGAG TCGATCCACA TCCTGCGCGA GCTCGTCGCG
GAATGCAGCA AGCCGGCGCT GCTGTTCTCG GGCGGCAAGG ATTCGGTGGT CGTGCTGCAT
CTCGCGCTCA AGGCGTTCGG GCTCGGCGCG AACCGCAGGA CGACGCTGCC GTTTCCGCTC
GTGCACATCG ACACGGGCCA CAACTACGAC GAGGTGATCG ATTTCCGCGA CCGCCGCGCG
AAGCAGATCG GCGCGGAGCT GGTGGTCGGC CACGTCGAGG ATTCGATCGC GCGCGGCACG
GTGGTGCTGC GCCGCGAGAC GGATTCGCGC AACGCCGCGC AGGCGGTCAC GCTGCTCGAG
ACGATCGAGC GGCACGGCTA CACGGCGATG ATCGGCGGGG CGCGGCGCGA CGAAGAGAAG
GCGCGGGCGA AGGAGCGGAT TTTCTCGTTT CGCGACGAAT TCGGCCAGTG GGACCCGAAG
GCGCAGCGCC CGGAGCTGTG GAGCCTGTAC AACGCGCGGC TGCACCGGGG CGAACACCTG
CGGGTGTTCC CGATCTCGAA CTGGACGGAG CTCGACGTGT GGCAGTACAT CGCGCGCGAG
AAGCTGGAAC TGCCGTCGAT CTACTACGCG CATCGCCGGG AGATCGTGCG GCGCAACGGG
CTGCTCGTGC CGGTGACGCC GCTCACGCCG ATGCGCGAGG GCGAGACGAG CGAGCAGGCG
CTGGTGCGGT TCCGCACGGT GGGGGACATT TCGTGCACGT GCCCGGTCGA GAGCGACGCG
GACGACGTGG AGAAGATCAT CGCGGAGACG GCGGTGACGG AGATCACGGA GCGCGGGGCG
ACGCGGATGG ACGACCAGGC GTCGGAGGCC GCGATGGAGC AGCGCAAGAA GCAGGGCTAT
TTCTGA
 
Protein sequence
MSTTLEQSAF APSAGASTSR MGHLDWLEAE SIHILRELVA ECSKPALLFS GGKDSVVVLH 
LALKAFGLGA NRRTTLPFPL VHIDTGHNYD EVIDFRDRRA KQIGAELVVG HVEDSIARGT
VVLRRETDSR NAAQAVTLLE TIERHGYTAM IGGARRDEEK ARAKERIFSF RDEFGQWDPK
AQRPELWSLY NARLHRGEHL RVFPISNWTE LDVWQYIARE KLELPSIYYA HRREIVRRNG
LLVPVTPLTP MREGETSEQA LVRFRTVGDI SCTCPVESDA DDVEKIIAET AVTEITERGA
TRMDDQASEA AMEQRKKQGY F