Gene BURPS1710b_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1991 
SymbolssuB 
ID3691802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2163920 
End bp2164927 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content75% 
IMG OID637728447 
Productaliphatic sulfonate ABC transporter, ATP-binding protein 
Protein accessionYP_333387 
Protein GI76810819 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0937962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGGCA CCACGCTGGC CGCCACGTAC GGCCCGATCT CGGGCGCGGA CCTCGAGGCC 
GAGCTCGCGC AGCCGCGCAT CGCCGACGGT GATGCGCAGG ACGCCGCCGT GTACGAGCGC
GACGGCGGCG CGCACGCGCC GCCGTTCGCG TCGGGCGGCG CGCCGCCCGA CGGCGACCGC
GCCGATGTGC GGCGCGCAGC AGGAGCAGGC GACGCGTCGG TGCGCCTCAC GCGCGTGAGC
AAGCGCTACG GCGAGCGGGC CGTGCTCGCC GACGTCGATC TGTCGATCGG GCGCGGCAGT
TTCGTCTCGA TCGTCGGGCG CAGCGGCTGC GGGAAATCCA CGCTGCTGCG CCTCGTCGCG
GAGCTCGAGA CGCCGAGCGC CGGCACGCTC GTCAAGCGCG GCGACGGCGG CGGCGCGCTC
GATACGCGGA TCATGTATCA GGAGGCGCGC CTGTTGCCGT GGAAGACCGT GCTGCAGAAC
GTGATGCTCG GCCTCGGCCG GCGCGCGAAG GACGACGCGC GGGCGGTGCT CGACGAAGTC
GGGCTACTCG CGCGCGCGAA CGATTGGCCC GCACAACTCT CGGGCGGGCA GCGGCAGCGC
GTCGCGCTCG CGCGGGCGCT CGTCCATCGC CCGCAACTGT TGCTGCTCGA CGAGCCGCTC
GGCGCGCTCG ATGCGCTCAC GCGCATCGAA ATGCACGCGC TGATCGAGCG CCTGTGGCGC
GAGCATCGCT TCACCGCGCT GCTCGTCACG CACGACGTGC AGGAGGCGGT CGCGCTCGCC
GACAGGGTCC TGCTCATCGA AGCGGGCCGG ATCGCGTTCG ATCAGCGGGT GCCGCTCGAT
CGGCCGCGCG CGCGGGCGTC GGCGGCGTTC GCCGCGCTCG AGGATCGCGT GCTGCAGCGC
GTATTGACGG GCTCGGATGC CGCGCCCGCG GCGCCGAACG CTGCGGGCCC GGAGGGCGCG
TCGCGCGGCC GCGCCGCGCC GGCAAGCGGA TTGCGCTGGG CGGTATGA
 
Protein sequence
MTGTTLAATY GPISGADLEA ELAQPRIADG DAQDAAVYER DGGAHAPPFA SGGAPPDGDR 
ADVRRAAGAG DASVRLTRVS KRYGERAVLA DVDLSIGRGS FVSIVGRSGC GKSTLLRLVA
ELETPSAGTL VKRGDGGGAL DTRIMYQEAR LLPWKTVLQN VMLGLGRRAK DDARAVLDEV
GLLARANDWP AQLSGGQRQR VALARALVHR PQLLLLDEPL GALDALTRIE MHALIERLWR
EHRFTALLVT HDVQEAVALA DRVLLIEAGR IAFDQRVPLD RPRARASAAF AALEDRVLQR
VLTGSDAAPA APNAAGPEGA SRGRAAPASG LRWAV