Gene BURPS1710b_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1034 
Symbol 
ID3688161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1082197 
End bp1083525 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID637727490 
Productextracellular solute-binding protein 
Protein accessionYP_332446 
Protein GI76810460 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAA AGACGCTTAC CGCCGCCGCC GCACGCGTTG CCGCGTTCGC CGCGCTCGCC 
TCGTCGGCGC TCGCCGCGCA GGCGGCGACG CTGACGATCG CGACGCTCAA CAACCCGGAC
ATGATCGAGC TGAAGAAGCT GTCGTCCGCG TTCGAGAAGG CGAACCCGGA CATCCGGCTC
AACTGGGTGA TCCTCGAGGA GAACGTGCTG CGCCAGCGCG CGACGACCGA CATCACGACG
GGCAGCGGCC AGTTCGACGT GATGGCGATC GGCACGTACG AGGCGCCGCA GTGGGGCAAG
CGCGGCTGGC TCGCGCCGAT GTCGAACCTG CCCGCCGATT ACGATCTGAA CGACGTGATC
AAGACGGCGC GCGATTCGCT GTCGTACAAC GGCCAGTTGT ACGCGCTGCC GTTCTACGTC
GAAAGCTCGA TGACGTTCTA CCGCAAGGAC CTGTTCGCCG CGAAGGGCCT GAAGATGCCC
GAGCAGCCGA CTTACGAGCA GATCGCCGAA TTCGCCGACA AACTGACCGA CCGTGCGAAC
GGCACCTACG GGATCTGCCT GCGCGGCAAG GCGGGCTGGG GCGAGAACAT GGCGTACGTG
TCGACGGTCG TCAACACGTT CGGCGGCCGC TGGTTCGACG AGAACTGGAA CGCGCAGCTC
ACGTCGCCCG AGTGGAAGAA GGCGATCAAC TTCTACGTGA ACCTGCTCAA GAAGAACGGG
CCGCCGGGCG CGAGCTCGAA CGGCTTCAAC GAGAACCTGA CGCTCACCGC ATCGGGCAAG
TGCGCGATGT GGATCGACGC GACGGTCGCG GCGGGCATGC TGTACAACAA GCAGCAGTCG
CAGGTCGCGG AGAAGATCGG TTTCGCGGCC GCGCCGGTGG CCGCGACGCC GAAGGGCTCG
CACTGGCTGT GGGCGTGGGC GCTCGCGATT CCGAAGACGT CGAAGCAGCA GGATGCGGCG
AAGAAGTTCG TCACGTGGGC GACGTCGAAG CAGTACGTCG AGATGGTCGG CAAGGACGAG
GGCTGGGCGT CGGTGCCGCC GGGCACGCGC CAGTCCACCT ATCAGCGCGC CGAGTACAAG
GCCGCCGCGC CGTTCTCCGA GTTCGTGCTG AAGGCGATCC AGACCGCCGA TCCGACCGAT
CCGTCGCTGA AGAAGGTGCC GTACACGGGC GTGCAGTACG TCGGGATTCC TGAGTTCCAG
TCGTTCGGCA CGGTGGTCGG CCAGGCGATC GCGGGCGCGG TTGCCGGGCA GACGAGCGTC
GACCAGGCGC TCGCCGCGGG CCAGGCGGCG GCCGAGCGCG CGGTGCGCCA GGCCGGCTAC
CGCAAGTGA
 
Protein sequence
MQRKTLTAAA ARVAAFAALA SSALAAQAAT LTIATLNNPD MIELKKLSSA FEKANPDIRL 
NWVILEENVL RQRATTDITT GSGQFDVMAI GTYEAPQWGK RGWLAPMSNL PADYDLNDVI
KTARDSLSYN GQLYALPFYV ESSMTFYRKD LFAAKGLKMP EQPTYEQIAE FADKLTDRAN
GTYGICLRGK AGWGENMAYV STVVNTFGGR WFDENWNAQL TSPEWKKAIN FYVNLLKKNG
PPGASSNGFN ENLTLTASGK CAMWIDATVA AGMLYNKQQS QVAEKIGFAA APVAATPKGS
HWLWAWALAI PKTSKQQDAA KKFVTWATSK QYVEMVGKDE GWASVPPGTR QSTYQRAEYK
AAAPFSEFVL KAIQTADPTD PSLKKVPYTG VQYVGIPEFQ SFGTVVGQAI AGAVAGQTSV
DQALAAGQAA AERAVRQAGY RK