Gene BURPS1106A_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2044 
Symbol 
ID4900437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2027478 
End bp2028800 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content69% 
IMG OID640135274 
ProductMFS transporter, metabolite:H+ symporter (MHS) family protein 
Protein accessionYP_001066309 
Protein GI126453686 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCAAT CCGCAGTTCC CCTCGACACG GGCACGGCCG TGTCGCCCCC TACGTCGAGC 
GCCGGGCGAG CAATCGCCGC GGCGTCGATC GGCAACGCGC TCGAGTGGTA CGACTTTTCC
GTCTATGCAT TCTTCGCCGT CTACATCGCG CGGAACTTCT TCCATCGAGG CAACACCGGC
ACCGAGCTGG TCGAGGCTTT CATGGCGTTC GGCATCGGCT TCATCGCACG GCCGCTCGGC
GCGCTCGCGA TCGGCGTATA CGGCGACCGC GCGGGCCGCA AGGCCGCGCT CACGCTGACC
ATCCTCGTGA TGGCAACCGG CACGGGCGTC ATCGCGTTCG CGCCGCCATA CGCCGCGATC
GGCGTGGGCG CGCCGCTGCT GATCCTCTGC GGGAGGCTAC TGCAGGGCTT CTCGGCGGGC
GGCGAAGTGG GCGGCGCGGC GGCGTTTCTC ATCGAGCACG CGCCGGCGGA CCGCAAGGGC
TGCTACGCGT CGTGTCTGCA GGCGAGCATG GCCGCGTCGA ACATCCTCGG CGCGCTGGTC
GCGACCGGCG TGACGCTTAC GCTGACGCGC GAACAGATCG GCGATTGGGG ATGGCGGATT
CCGTTCATCC TCGGCCTCGC GATCGCGCCG GTCGGCCTCT GGCTGCGCAG GACGCTCGAC
GAGACGCCGC ACTTCCGCGC CGAGATGGCG CGCGCGCAGC ACGCGCATGC GGAACAGAAA
GCGCCGCTTC TGCAGGTGGT GCGCGACCAC CCGCGCGCGC TCGCCGTCGG CACGGGATTC
TCGGTGCTCT GGGCCGTGTG CGTCTACGCG CTGGTGATCT ATATGCCGAC GCACGCGCAG
CGCGCACTGC ATTTCGACGG GCGCGACGCG TTCATCGCGT CGCTGGTCGG CAACTGCCTG
ATGGCCGTCA CCTGCGTGTG CGCGGGAAGC TGGTCCGACC GCCTCGGCCG GCGCACGGTG
CTCGCCGCCG GCGCGGCGCT GATGCTCGTG TCGGTCTATC CGCTGCTGCG CTGGCTGAGC
GACGTGCACA CGCTCGCCGC GCTCCTTACC GTCCAGAGCG CGTTCTGCGT GTTGGTGGCC
ATCTTCACGG GAGTGGCGCC CGCAGCGCTG TCGGAGCTGT TCCCGACCCG CGTACGTGCG
ACCGGCATGT CCCTGTCCTA CAACATCGCC ACGACGATCT TCGGCGGCTT CGCGCCCGCG
ATCCTCGCAT GGCTCACGCA ACAGACCGGC AATCCGTTTG CGCCGGCCTG GTACGTGATG
GTGGCGAGCG CCATCGCGCT CGCATCGATC GCCGCGCTTT CTTCCACGCC ACGCCACGCC
TGA
 
Protein sequence
MKQSAVPLDT GTAVSPPTSS AGRAIAAASI GNALEWYDFS VYAFFAVYIA RNFFHRGNTG 
TELVEAFMAF GIGFIARPLG ALAIGVYGDR AGRKAALTLT ILVMATGTGV IAFAPPYAAI
GVGAPLLILC GRLLQGFSAG GEVGGAAAFL IEHAPADRKG CYASCLQASM AASNILGALV
ATGVTLTLTR EQIGDWGWRI PFILGLAIAP VGLWLRRTLD ETPHFRAEMA RAQHAHAEQK
APLLQVVRDH PRALAVGTGF SVLWAVCVYA LVIYMPTHAQ RALHFDGRDA FIASLVGNCL
MAVTCVCAGS WSDRLGRRTV LAAGAALMLV SVYPLLRWLS DVHTLAALLT VQSAFCVLVA
IFTGVAPAAL SELFPTRVRA TGMSLSYNIA TTIFGGFAPA ILAWLTQQTG NPFAPAWYVM
VASAIALASI AALSSTPRHA