Gene BURPS1106A_A2864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2864 
Symbol 
ID4905068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2805898 
End bp2807139 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID640145967 
Productmajor facilitator family transporter 
Protein accessionYP_001076893 
Protein GI126455534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAT CCACCATGCC GGCCGGCGGC GTCGCGATCC GGCTTGGTCT CAAGGAAAAC 
TGGCGCCAGT TCGCACTACT CGTGCTGATC AATGCGTTCG TGGGTGGCAT GGTGGGGATC
GAGCGGACGG TCGTGCCGCT GATCGGCTCC GAGACGTTTC ACATCCAGTC CACGACGCTC
ATTACATCGT TCATCGTCAG CTTCGGGCTG GTGAAAGCCG TGGCGAACCT GATTTCCGGT
CAACTGGCGG ACACCTGGGG CCGCAAGCGC GTGCTTGTGG CCGGCTGGCT GCTCGGGTTG
CCGGTGCCGT TCATGATCAT CGCCGCGCCG AACTGGGAAT GGGTGATCGC GGCCAATGTG
TTGCTGGGCC TCAGCCAGGG TTTTGCGTGG TCGATGACCG TGATCATGAA AGTGGATCTC
GTGGGGCCGA AGGCGCGCGG GCTCGCGGTC GGGCTCAACG AGTTCGCGGG CTATTTCGCG
GTGGGCCTGA CCGCGTTTCT GACCGGCTAC CTGGCGAGCC GCCACGGCCT GCGGCCGGCG
CCGATCTATC TCGGCGTCGC GTATGCGATC GCCGGCCTGA CCCTGTCGAT TCTCGTCGTG
CGCGATACGC GCGATCACGT TTGCCTGGAG GCCGGCAAGC CGAAAGAAGC AACGTCGCTG
TCGTTCCACG ACGTGTTCCT GCTCGCGTCG CTGAAGGACC GCAACCTGTT CGCGGCGTCG
CAGGCCGGGC TAATCAACAA CCTGAACGAC GGGATGAGTT GGGGCATCTT CCCGCTGTTT
TTCACGGGAC TCGGGCTCGG CGTCGAACGG ATCGGCATCC TCAAGGCCGC CTATCCGATC
GTGTGGGGCG TGTTTCAGGT CGTCACCGGC CCGTTGAGCG ACCGCTGGGG CCGCAAGGGG
CTGATCGTCG CCGGGATGTG GGTTCAGGCG GCCGGCCTGG TGCTGACCGC GTCGATGGGC
GAGTTCCGGT GGTGGCTGGT TGCCAGCGTG CTGCTCGGCC TCGGCACCGC GATGGTCTAC
CCGAGCCTGA TCGCGGCCGT CTCCGATGCG TCGGATCCGC GCTGGCGTGC CCGGGCGCTG
AGCGTGTACC GGTTCTGGCG TGACCTCGGC TATGCGATCG GCGCGCTGTC GGCGGGTCTC
ATCGCGGACC GCTTCGGCTT CGCCGATGCG ATCCTGTCGA TCGCGGCCGT CACGTTCCTG
TCAGGCGCCG TGGTGGCGAT CGTCATGCAC GCGCGCCACT GA
 
Protein sequence
MSKSTMPAGG VAIRLGLKEN WRQFALLVLI NAFVGGMVGI ERTVVPLIGS ETFHIQSTTL 
ITSFIVSFGL VKAVANLISG QLADTWGRKR VLVAGWLLGL PVPFMIIAAP NWEWVIAANV
LLGLSQGFAW SMTVIMKVDL VGPKARGLAV GLNEFAGYFA VGLTAFLTGY LASRHGLRPA
PIYLGVAYAI AGLTLSILVV RDTRDHVCLE AGKPKEATSL SFHDVFLLAS LKDRNLFAAS
QAGLINNLND GMSWGIFPLF FTGLGLGVER IGILKAAYPI VWGVFQVVTG PLSDRWGRKG
LIVAGMWVQA AGLVLTASMG EFRWWLVASV LLGLGTAMVY PSLIAAVSDA SDPRWRARAL
SVYRFWRDLG YAIGALSAGL IADRFGFADA ILSIAAVTFL SGAVVAIVMH ARH