Gene BURPS668_A1513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1513 
Symbol 
ID4886292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1454336 
End bp1455901 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content68% 
IMG OID640131452 
Productputative carbohydrate ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_001062509 
Protein GI126443810 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACCGT TTGCGAAACC TCCATCGAAA CGCGCCGCGC CGCCGCCGCG CCGAATCATG 
CGGCGCGGCG GCCTTTCGTA CGGGTTTTCC TGCATGCCGG CGCGGAATTT CGCCCTCGCG
GAACGTTTCC GGGAGGATTT CGTTTCATTC GATGCCGCCC GGGCGGCGGC ATTTTCCGAT
TGGCCCGCAA TCAGCCGCTT TCGATTTAAC GGAGCGTCGT CCGGTTTATT TAGCATTGCG
CTGAAGTTTA TTTGCGATTG CCCGATTCCG ATCGAGACGA CACGGCGACG ATCGCGGGTT
CCCGCCGATA TCGGCATCGG CATCGACGCC GCGAAGCCGC CGGCTCGGCG AGCGCCCGCG
CCGGCCGCGT CCGATTCCCG CCGCGCATCT TTGCTTGTGC GGCCCGCGCA TTTGCGGCCA
TCCCATAACC AAAGCACGAA TCGATCGTTC CCTTCGCGGG CGTCGTTGCA CAGAATCGTT
CGTTCGCCTC CACGTCCGAA CCCAGACCCG CCGATCATGC CACGCCCGCT TCCGTCGCTT
GCCCGCCTGT TGTCCATGCT GTTCGCCGTC GCGCTCGCGA TCGGCGCCGC CCCCGCTTGC
GCGTCGCCCG CCGCCGGCGC TGCGCCGCCC GGGCCGCGCG CGGGCCATGC GCCGCTGTCG
CTCGCCGGCA AGCGGATCGG CATCACGGCG GCCGGCACCG ATCACTACTG GGATCTGCAG
GCGTACCAGG GCGCGGTGGA CGAAGTGAAG CGCCTCGGCG GCACGCCGAT CGCGCTCGAC
GCCGGCCGCA ACGACAGCCG CCAGATCGCG CAGATCCAGA CGCTGATCGC GCAACAGCCC
GATGCGATCA TCGAGCAGCT CGGCACCGCA TCCGTGCTCG AGCCGTGGCT CAGGAAAATC
CGGCAAGCGG GCATCCCGCT TTTCACGATC GACACCGCGT CGCCGTCGAG CCTGAACGTC
GTCACGTCGG ACAATTTCGC GATCGGCTCG CAGCTCGCGC TGAAGCTCGT CAACGATATC
CGCGGCGAAG GCAACGTCCT CGTGTTCAAC GGCTTCTACG GCGTGCCCGT GTGCGCGATC
CGCTACGACC AGCTGAAAGC CGTGCTGAAG TGGTATCCGA AGGTGAAGAT CATCGAGCCC
GAGCTGCGCG ACGTGATTCC GAACACGGCG CAGAACGCGT ACGCGCAGAT CAGCCAGTTG
CTGCAGAAGT ATCCGAAAGG CGCGATCTCG GCGATCTGGG CCGCGTGGGA CATTCCGCAG
GTCGGCGCGA CGCAGGCGGT CGACGCGGCC GGCCGACGCG AGATCCGCAC GTACGCGGTG
GACGGCAGCC CCGAGGCGGT CGCGCTCGTG AGGAATCCGA CCTCGAGCGC GGCGGCCGTC
GTCGCGCAGC AGCCGGCGCT GATCGGCCGC ACCGCCGTGC GCAACGTCGC GCGCTATCTG
GCGGGCGACC GATCGCTGCC CGCGTACACG TTCGTGCCGT CGGTGCTCGT CACGAAGGAC
GACGCGGGTG TCGCGCGGCC TGCGCTCGGG CAGACGCCGG CCGCCGCCGG GCTCGCGCGG
CGATGA
 
Protein sequence
MQPFAKPPSK RAAPPPRRIM RRGGLSYGFS CMPARNFALA ERFREDFVSF DAARAAAFSD 
WPAISRFRFN GASSGLFSIA LKFICDCPIP IETTRRRSRV PADIGIGIDA AKPPARRAPA
PAASDSRRAS LLVRPAHLRP SHNQSTNRSF PSRASLHRIV RSPPRPNPDP PIMPRPLPSL
ARLLSMLFAV ALAIGAAPAC ASPAAGAAPP GPRAGHAPLS LAGKRIGITA AGTDHYWDLQ
AYQGAVDEVK RLGGTPIALD AGRNDSRQIA QIQTLIAQQP DAIIEQLGTA SVLEPWLRKI
RQAGIPLFTI DTASPSSLNV VTSDNFAIGS QLALKLVNDI RGEGNVLVFN GFYGVPVCAI
RYDQLKAVLK WYPKVKIIEP ELRDVIPNTA QNAYAQISQL LQKYPKGAIS AIWAAWDIPQ
VGATQAVDAA GRREIRTYAV DGSPEAVALV RNPTSSAAAV VAQQPALIGR TAVRNVARYL
AGDRSLPAYT FVPSVLVTKD DAGVARPALG QTPAAAGLAR R