Gene BURPS1106A_A1266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1266 
Symbol 
ID4905624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1198385 
End bp1199692 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID640144372 
Productmajor facilitator family transporter 
Protein accessionYP_001075301 
Protein GI126458142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCGATC GATATCGACG ACACGCTGAC ATGATGAATA AAAAGCCCCT GAGCCTGACG 
CAAATCGTGC TCGCGACTTC GGTCGGCAAC GCGCTCGAAT GGTTCGATAT CGCCATTTAT
GCGTTCTTCG CCGTGCATAT CGCGAAGAAT TTCTTCCCGA CCGCGAACGA GACGGCGTCG
ATGCTGCTCA CGTTCGGCTC GTTCGGCGCG TCGTATCTCG TGCGGCCGAT CGGCGGCATG
GTGCTGGGCG CCTACGCGGA CAAGCGCGGG CGCAAGGCGG CACTGATGAT GTCGGTCGGT
CTGATGATGA TCGGCACGGC GATCATCGCG GTGATTCCGC CACATGCGTC GATCGGCCTG
CTCGCGCCAG CAGGCGTGTT CATCGCACGG CTGATTCAGG GTTTTTCGGC GGGCGGCGAA
TTCGGCGCTT CGACGGCGAT GCTGATCGAA CATGCACCCG AGCGTCGCGG CCTGCTCGCA
AGCTGGCAGT TCGCGACGCA AGGCCTCGCG ACCCTGCTCG CGTCGACCTT CGGCTTCGCG
CTCGCCAAGC TGATGCCCGC CAGCGAGCTC TCCGCATGGG GCTGGCGCAT GCCGTTCTTC
TTCGGGCTGC TGATCGGGCC GGCGGGTCTG TATCTGCGAC GCTTTCTCGA AGATGCGGCC
GATTACACCG AAGCCGAGCA CACGGCCACG CCGGTGCGCG ATGTACTCAC GCGGCAGAAG
GCGTTGCTGC TGACCTCGAT CGGTGCGCTG ACGGTGTCGA CGGCGGTGAA CTACCTGTTG
CAGTACGTGC CGACGTTCGC GATCCGCGAA TTGCATCTCG ATGCATCGAC GGGTTTCGCC
GCGAGCATCG TCGCGGGGCT GATGCTGACC TTGGTCACGC CGTTCGCGGG GCATCTGTCC
GACAAAATCG GCCGCGTGAA GCAGATGTCG ACTGCGGCGC TGCTACTGTT CGTGACGGGT
TATCCGGCGT TCGCGTATGT CGTGTCGCAT GTGTCGGTAG CGGCGTTGTT CGGGCTCGTC
GCGTGGCTTG CGCTGCTCAA GGCGGTGTAT TTCGGCGCGC TGCCGGCGCT CATGTCGGAG
ATTTTCCCCA CATCGACGCG CGTGACGAGC ATTTCAATCA GCTACAACAT CGGCGTGACG
GTGTTCGGCG GTTTTACGCC GGCGATCGTC ATCTGGCTGT CGAGCGCGAC GGGCAGCAAG
GCGGCACCGA GCTTCTACAT GATGTTCACG GCCGTGATCA GTCTCGCGGC GCTGGCGGCA
GTGAGCCGCG GGAAAGAGCC GCTGAGCTCG GCGGGCACGG CGGCCTGA
 
Protein sequence
MTDRYRRHAD MMNKKPLSLT QIVLATSVGN ALEWFDIAIY AFFAVHIAKN FFPTANETAS 
MLLTFGSFGA SYLVRPIGGM VLGAYADKRG RKAALMMSVG LMMIGTAIIA VIPPHASIGL
LAPAGVFIAR LIQGFSAGGE FGASTAMLIE HAPERRGLLA SWQFATQGLA TLLASTFGFA
LAKLMPASEL SAWGWRMPFF FGLLIGPAGL YLRRFLEDAA DYTEAEHTAT PVRDVLTRQK
ALLLTSIGAL TVSTAVNYLL QYVPTFAIRE LHLDASTGFA ASIVAGLMLT LVTPFAGHLS
DKIGRVKQMS TAALLLFVTG YPAFAYVVSH VSVAALFGLV AWLALLKAVY FGALPALMSE
IFPTSTRVTS ISISYNIGVT VFGGFTPAIV IWLSSATGSK AAPSFYMMFT AVISLAALAA
VSRGKEPLSS AGTAA