Gene BURPS1710b_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1044 
Symbol 
ID3689616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1091480 
End bp1092691 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID637727500 
Productmajor facilitator family transporter 
Protein accessionYP_332456 
Protein GI76808583 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT GCACGACGCG GCCCGCCGGC TTCGCGCGGC CGTCGCGCGA AGCCGCGCGC 
CTGCCGCTCG CGGGATTGCT CGCGCTCGCG ACGGCCGGCT TCATCACGAT CGTGACCGAG
GCGCTGCCCG CCGGGCTGCT GCCGCTGATG GGGCGCGACC TGCGCGTGTC CGATGCGCTC
GTCGGCCAGC TCGTCACAGT CTATGCGGCG GGCTCGATCG TCGCGGCGAT GCCGCTCGTC
GCGGCGACGC GCGGCATGCG CAGGCGGCCG CTGCTGCTCG CCGCGCTCGC GGGCTTCGTC
GTCGCGAACA CGGCGACGGC CGCGTCGCCG TACTACGCGC CCGTGCTCGT CGCGCGCTGC
GTCGCGGGCG TCTCGGCGGG GCTCCTGTGG GCGCTGCTCG CGGGCTACGC GAGCCGGATG
GTCGACGCGC GGCAGCGCGG CCGCGCGATC GCGATCGCGA TGCTCGGCGC GCCGGTGGCG
ATGTCGGTCG GCATTCCGCT CGGCACGGCG CTCGGCGCCG CGCTCGGCTG GCGCGCGACG
TTCGCCGGCG TGACGGCGCT CACGCTCGCG CTGATCGCGT GGGTGCGCGC GAGCCTGCCC
GATGCGCCGG GGCGGCCCTC GGGCGAGCGG CTGCCGGTCG CCCGCGTGCT GCGGATGCCG
GGCGTGCTGC CCGTGCTGGC GGTGATGTTC GCGTACGTGC TCGCGCACAA CATCCTCTAC
ACGTACATCG CGCCGTTTCT CGCGAGCGCC GGGATGGGCG CGCGCATCGA CGCGACGCTG
TTCGCGTTCG GCGCGGCGTC GTTCGCGGGC ATCGGTCTCA CGGGCGTGTG GATCGGCAAC
GGGCTGCGGC GGCTCGCGCT CGCGAGCATC GCGCTTTTCG CGCTTGCGTC CGTGCTGCTC
GGCGTGGCGA GCGGATCGCC CGCGGTCGTC TATGCGAGCG TCGCCGTGTG GGGGCTCACG
TTCGGCGGCG CGGCGACGGT CTTCCAGACC GCGTCGGCGA ACGCGGCGGG CGAGGCGGCG
GACGTCGCGC AATCGATGAT CGTCACGGTG TGGAATCTCG CGATCGCGGC CGGCGGCGTC
GCGGGCGGCG TGCTGCTCGA GCGGTTCGGC GCGGGCGCGA TGCCGTGGGC GCTCGTCGCG
CTGCTCGTGC CCGCGTGGCT CGGCGCGTGG CGCGCGCGGC GCCACGGCTT CCCGGCGGCC
CGCGCGCCGT GA
 
Protein sequence
MSDCTTRPAG FARPSREAAR LPLAGLLALA TAGFITIVTE ALPAGLLPLM GRDLRVSDAL 
VGQLVTVYAA GSIVAAMPLV AATRGMRRRP LLLAALAGFV VANTATAASP YYAPVLVARC
VAGVSAGLLW ALLAGYASRM VDARQRGRAI AIAMLGAPVA MSVGIPLGTA LGAALGWRAT
FAGVTALTLA LIAWVRASLP DAPGRPSGER LPVARVLRMP GVLPVLAVMF AYVLAHNILY
TYIAPFLASA GMGARIDATL FAFGAASFAG IGLTGVWIGN GLRRLALASI ALFALASVLL
GVASGSPAVV YASVAVWGLT FGGAATVFQT ASANAAGEAA DVAQSMIVTV WNLAIAAGGV
AGGVLLERFG AGAMPWALVA LLVPAWLGAW RARRHGFPAA RAP