Gene BURPS1710b_A2575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2575 
Symbol 
ID3693304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp3097484 
End bp3098740 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID637732829 
Productcupin family protein 
Protein accessionYP_337725 
Protein GI76818513 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG2140] Thermophilic glucose-6-phosphate isomerase and related metalloenzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR03404] bicupin, oxalate decarboxylase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.97464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC TGTCCCGACG CAAGATGCTA GCCGGCACGG CCGGCGCGCT CGCCGCCGCC 
GGCATCGCCG TTTCCGCGAA GGCCGCTTCG TTCGGCAATC CGGACAGCCC GGCCGAGGGC
GCGGTGAACG CGCGCAATCC GCAAAGCCTG ACCGATCCGG GGCCGCAAAA TCCGGCGCTG
ATGAAGGAGT TTCCTTCGTT TCAGAGTCCG CCGGCCACCG ACATTAACGG CATGCCGATT
TTCTGGGCGT CGTTCAATAA TGCGCACAAG CGCATTCAGA ACGGCGGCTG GGCGCGCGAA
GTCACGCAGG AGGATTTCGC GATTTCCGAA ACGATTTCCG GCGTCAACAT GCGGCTCGCG
CGCGGCGGCA TTCGCGAGAT GCACTGGCAC CAGCAGGCCG AATGGGCGTT CATGCTCGAC
GGCCGCTGCC GGATCACCGT GCTCGATGAA GAGGGGCGGC CGTCCGTGCA GGACGTGAAG
ACGGGCGACC TCTGGTATTT CCCGCCGGGG CTGCCGCATT CGCTGCAGGG GCTCGGCGTC
GACGGCGCCG AATTCCTGCT CGCGTTCGAC AACGGCCGCG CGTCCGAATT CAACACGCTG
CTCGTGACGG ACTGGATCGC GCACACGCCG CCCGACGTGC TCGCGCTGAA CTTCGGCGTG
CCCGCCGATG CATTCCGCCG CATTCCGCTC GACAATCTGT GGATCTTCCA GGGCGACGAT
CCCGGGCCGC TCGCCGCCGC GCAGCGCGCG TCGGCGTCGT CGCGCGGCGC GCCGAAGCAT
CCGTTCATCT TCTCGATGGG CGACATGAAG CCGAACGTGA AGACGCGCGG CGGCGAAGTG
CGGATCGTCG ACAGCACGAA CTTCGCTGTG TCGAAGACGA TCGCGGCCGC GCTCGTCACG
GTGAAGCCGG GCGGCATGCG CGAGCTGCAC TGGCACCCGA ACGCGGACGA GTGGCAGTAC
TACATCCGGG GCGACGCGCG CATGACGGTG TTCGACACCG GCCCGAAGGC GCAGACGGCC
GATTTCCGCG CGGGCGACGT CGGCTACGTG AAGAAGAGCC TCGGCCACTA CGTGCAGAAC
ACGGGCACGA CCGATCTCGT GTTCCTCGAG ATCTTCAAGG CGGACCGCTA CGCGGAGGTT
TCGCTGTCCG ACTGGCTCGC GCACACGCCG CCGCAGCTCG TCGAGGCGCA TCTGCATATC
GCGCCCGACG TGATCGCGCG CTTTCCGCGC AACCGGCCGG ACGTCGTGCC GGCGTAA
 
Protein sequence
MTNLSRRKML AGTAGALAAA GIAVSAKAAS FGNPDSPAEG AVNARNPQSL TDPGPQNPAL 
MKEFPSFQSP PATDINGMPI FWASFNNAHK RIQNGGWARE VTQEDFAISE TISGVNMRLA
RGGIREMHWH QQAEWAFMLD GRCRITVLDE EGRPSVQDVK TGDLWYFPPG LPHSLQGLGV
DGAEFLLAFD NGRASEFNTL LVTDWIAHTP PDVLALNFGV PADAFRRIPL DNLWIFQGDD
PGPLAAAQRA SASSRGAPKH PFIFSMGDMK PNVKTRGGEV RIVDSTNFAV SKTIAAALVT
VKPGGMRELH WHPNADEWQY YIRGDARMTV FDTGPKAQTA DFRAGDVGYV KKSLGHYVQN
TGTTDLVFLE IFKADRYAEV SLSDWLAHTP PQLVEAHLHI APDVIARFPR NRPDVVPA