Gene BURPS1106A_1680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1680 
Symbol 
ID4902392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1635446 
End bp1636558 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content65% 
IMG OID640134910 
Productcarbohydrate ABC transporter periplasmic sugar-binding protein 
Protein accessionYP_001065951 
Protein GI126453990 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.824085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTCGTGA ACGCGGCCCG ACCCGCGTTC GCCCGAAAAC GACAGACAGA GAAAGGTGGA 
GACATGAGAC TTTGCACTGG CAAGGCCGTC CTGCGCGCAT GCGTCGCCGC GATCGCCGTC
GCGGCGGGCG TGGGCGGCGC GGCCCCGGCA GCCCAGGCGG CGGGCGCGCG CTTCGCGCTC
GTGAGCCACG CGCCCGATTC CGATTCGTGG TGGAACACGA TCAAGAACGC GATCAAGCAG
GCGGACGAGG ATTTCGATGT CACGACCGAT TACCGCAATC CGCCGAACGG CGACATCGCC
GACATGGCGC GCCTGATCGA GCAATCGGCC GCGCGCGACT ACGACGGCGT GATCACGACG
ATCGCCGATT ACGACGTGCT GAAGAATTCG CTGAGGAAAG TCACCGCGAA GAAGATCCCG
CTCGTGACGA TCAACTCCGG CACCGAAGAA CAGAGCGCGC AACTGGGCGC GATCATGCAT
GTCGGCCAGC CCGAGTACGT CGCGGGTCAC GCGGCGGGCG AGAAGGCGAA GGCGGCCGGC
GTGAAGCGCT TCCTCTGCGT GAACCACATC GCGACCAACA GCGTGTCGTT CGACCGCTGT
CGCGGCTTCG CCGACGCGAT CGGCGCCGAC TACAAGAGCT CGACGATCGA CTCCGGCCAG
GACCCGACCG AGATCCAGTC GAAGGTGAGC GCGTACCTGC GCAACCATCC GAACACGCAG
GCGATCCTCA CGCTCGGCCC GGTGCCCGCC GCGGCGTCGC TGAAGGCGGT GCAGCAGATG
GGCCTCGCGA ACAAGCTGTT CTTCGCGACG TTCGATTTCT CCGACGACAT CGCGAAGGCG
ATCCAGAGCG GCGCGATCAA GTTCGCGATC GACCAGCAGC CATACCTGCA GGGCTACATC
CCGGTGGCCG TGCTCGCGAT CGCGAAGCAG AACAAGACCA CCGATCCCGC GAAGATCCGC
CAGATCCTCG AGGCGAACCC GAAATTCCAG GCGCGGCTGT CGACCTACGG GCTGCAGCCG
TCGTACGGGC CGAAGAACAT CCGCTCGGGC CCGGGCTTCA TCACGAAGGA GAACCTCGAG
AAGGTGATCA AGTACGCGGG CCAGTACCGC TAA
 
Protein sequence
MLVNAARPAF ARKRQTEKGG DMRLCTGKAV LRACVAAIAV AAGVGGAAPA AQAAGARFAL 
VSHAPDSDSW WNTIKNAIKQ ADEDFDVTTD YRNPPNGDIA DMARLIEQSA ARDYDGVITT
IADYDVLKNS LRKVTAKKIP LVTINSGTEE QSAQLGAIMH VGQPEYVAGH AAGEKAKAAG
VKRFLCVNHI ATNSVSFDRC RGFADAIGAD YKSSTIDSGQ DPTEIQSKVS AYLRNHPNTQ
AILTLGPVPA AASLKAVQQM GLANKLFFAT FDFSDDIAKA IQSGAIKFAI DQQPYLQGYI
PVAVLAIAKQ NKTTDPAKIR QILEANPKFQ ARLSTYGLQP SYGPKNIRSG PGFITKENLE
KVIKYAGQYR