Gene BURPS1106A_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0885 
Symbol 
ID4900546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp868206 
End bp869417 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID640134115 
Productmajor facilitator family transporter 
Protein accessionYP_001065166 
Protein GI126453044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.106194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATT GCACGACGCG GCCCGCCGGC TTCGCGCGGC CGTCGCGCGA AGCCGCGCGC 
CTGCCGCTCG CGGGATTGCT CGCGCTCGCG ACGGCCGGCT TCATCACGAT CGTGACCGAG
GCGCTGCCCG CCGGGCTGCT GCCGCTGATG GGGCGCGACC TGCGCGTGTC CGATGCGCTC
GTCGGCCAGC TCGTCACAGT CTATGCGGCG GGCTCGATCG TCGCGGCGAT GCCGCTCGTC
GCGGCGACGC GCGGCATGCG CAGGCGGCCG CTGCTGCTCG CCGCGCTCGC GGGCTTCGTC
GTCGCGAACA CGGCGACGGC CGCGTCGCCG TACTACGCGC CCGTGCTCGT CGCGCGCTGC
GTCGCGGGCG TCTCGGCGGG GCTCCTGTGG GCGCTGCTCG CGGGCTACGC GAGCCGGATG
GTCGACGCGC GGCAGCGCGG CCGCGCGATC GCGATCGCGA TGCTCGGCGC GCCGGTGGCG
ATGTCGGTCG GCATTCCGCT CGGCACGGCG CTCGGCGCCG CGCTCGGCTG GCGCGCGACG
TTCGCCGGCG TGACGGCGCT CACGCTCGCG CTGATCGCGT GGGTGCGCGC GAGCCTGCCC
GATGCGCCGG GGCGGCCCTC GGGCGAGCGG CTGCCGGTCG CCCGCGTGCT GCGGATGCCG
GGCGTGCTGC CCGTGCTGGC GGTGATGTTC GCGTACGTGC TCGCGCACAA CATCCTCTAC
ACGTACATCG CGCCGTTTCT CGCGAGCGCC GGGATGGGCA CGCGCATCGA CGCGACGCTG
TTCGCGTTCG GCGCGGCGTC GTTCGCGGGC ATCGGTCTCA CGGGCGTGTG GATCGGCAAC
GGGCTGCGGC GGCTCGCGCT CGCGAGCATC GCGCTTTTCG CGCTCGCGTC CGTGCTGCTC
GGCGTGGCGA GCGGATCGCC CGCGGTCGTC TATGCGAGCG TCGCCGTGTG GGGGCTCACG
TTCGGCGGCG CGGCGACGGT CTTCCAGACC GCGTCGGCGA ACGCGGCGGG CGAGGCGGCG
GACGTCGCGC AATCGATGAT CGTCACGGTG TGGAATCTCG CGATCGCGGC CGGCGGCGTC
GCGGGCGGCG TGCTGCTCGA GCGGTTCGGC GCGGGCGCGA TGCCGTGGGC GCTCGTCGCG
CTGCTCGTGC CCGCGTGGCT CGGCGCGTGG CGCGCGCGGC GCCACGGCTT CCCGGCGGCC
CGCGCGCCGT GA
 
Protein sequence
MSDCTTRPAG FARPSREAAR LPLAGLLALA TAGFITIVTE ALPAGLLPLM GRDLRVSDAL 
VGQLVTVYAA GSIVAAMPLV AATRGMRRRP LLLAALAGFV VANTATAASP YYAPVLVARC
VAGVSAGLLW ALLAGYASRM VDARQRGRAI AIAMLGAPVA MSVGIPLGTA LGAALGWRAT
FAGVTALTLA LIAWVRASLP DAPGRPSGER LPVARVLRMP GVLPVLAVMF AYVLAHNILY
TYIAPFLASA GMGTRIDATL FAFGAASFAG IGLTGVWIGN GLRRLALASI ALFALASVLL
GVASGSPAVV YASVAVWGLT FGGAATVFQT ASANAAGEAA DVAQSMIVTV WNLAIAAGGV
AGGVLLERFG AGAMPWALVA LLVPAWLGAW RARRHGFPAA RAP