Gene BURPS1106A_A2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2507 
Symbol 
ID4905291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2469163 
End bp2470368 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content74% 
IMG OID640145611 
Productmajor facilitator transporter 
Protein accessionYP_001076538 
Protein GI126457528 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCCGC TCCGATCGAT TCTCCCGTTG GCCCTGTTCA CCGCCGTCGG GCTGCTCGCC 
ACCGACCTCT ATCTCCCCGC CGTGCCGTCG TTGCCGCAGC AGCTCGGCGG CTCCATCGAA
AGCGCGCAGG CGACGCTCGC CGCGTTCTCG GCCGCGCTCG CCGTGTCGCA GCTCGTCTGG
GGCGCGGCCG CCGACCGCTT CGGGCACCGC CGCACGCTCG CGTTCGCGGT GCTGCTGCAA
CTCGTCGCGG GCGCCGCGTG CGCGCTCGCG CCTTCGATGG GCGCGCTGAT CGGCGCGCGC
CTCGCGCAGG GCTTCGGCGT CGGCGCGGCG ATGGTCATCG TCCCCGCGCT CGTGCGACAG
TCGTTCGGCG ACGGCGGCGC GGTCCGCGCG CTCGCATGGC TCGGCATCGT CGAAAGCGCG
GTGCCCGGAC TCGCGCCCCT CGTCGGCGCG GCGCTGCTCG TCGTGGCCGA CTGGCGAACG
AGCTTCTGGA TCATCGTCGC GTTGTCCGCC ATCGCGGCGC CGCTCGTGTT CCGCGTGATT
CCGACGGCTC GCGCGATGCG CGCGTGTGCG CCGGCGAACG TCGGCGCACA CGCGGGCGGC
TATCGGCGGC TGCTGCGCTC GCCCGTCTAT CTCGGCTACG CGCTCGGCCA CGCGCTCTGC
TTCGCCGCGC TGCTCGCGTT CGTCGCGAGC GCGCCGCAAG TCGTCGAGAT CTGGCTCGGC
GCGGGGCCGT CGACGTTCAG CCTGATGCAG GCGTGCGGGG TCGCCGCGTT CATGCTGAGC
GCCGCGCGCA GCGGCAAATG GTCCGACGCG CTCGGCCTCG ACCGGATCAT CGCGCTCGGC
GCGCTGCTGC AGTTCGCGGC GTCGGCCGCG TTCCTGCTGC TCGCGTATGC CGATTGGCGC
TCGACGCCGC TCGTCGTCGC ATCGTGGATG CTGTTCTGCG GCTCGCTCGG CCTGCGCGGG
CCGGCGTCGA TGGCGCGCGC GCTCGCGGCC GAGCCCGCCG TCGCGGGACG CGCGGCCGGG
CTGCTGATGT TCTTCGGGCT CGGCGGCGCG GCGCTCGCGA CACAGGCCGT CGCGCCGTTC
CTGCGGCTGG GGCTCGCGCC CGTCGCGTGG ATGTGCGCGG GCTTCACGCT CGCGAGCGGC
GCGGTCGTGC TGTGGGGCAT CGCGATACGC GGCCGACATC GCGCGGCGGC CACGGAAATC
GCGTGA
 
Protein sequence
MHPLRSILPL ALFTAVGLLA TDLYLPAVPS LPQQLGGSIE SAQATLAAFS AALAVSQLVW 
GAAADRFGHR RTLAFAVLLQ LVAGAACALA PSMGALIGAR LAQGFGVGAA MVIVPALVRQ
SFGDGGAVRA LAWLGIVESA VPGLAPLVGA ALLVVADWRT SFWIIVALSA IAAPLVFRVI
PTARAMRACA PANVGAHAGG YRRLLRSPVY LGYALGHALC FAALLAFVAS APQVVEIWLG
AGPSTFSLMQ ACGVAAFMLS AARSGKWSDA LGLDRIIALG ALLQFAASAA FLLLAYADWR
STPLVVASWM LFCGSLGLRG PASMARALAA EPAVAGRAAG LLMFFGLGGA ALATQAVAPF
LRLGLAPVAW MCAGFTLASG AVVLWGIAIR GRHRAAATEI A