Gene BURPS668_A2196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2196 
Symbol 
ID4888069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2127697 
End bp2129016 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content67% 
IMG OID640132133 
ProductMFS transporter, metabolite:H+ symporter (MHS) family protein 
Protein accessionYP_001063190 
Protein GI126445410 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.563103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCAAG TCCGCTCTCT TGCTACTGCC CCCCATTGCC CGCCGGCCGG CACCCTCGCC 
CGCCTTCGTT CGATCTTCAG CGGATCGGTC GGCAATCTGA TCGAATACTA CGACTGGTAC
GTGTACTCGG CGTTCTCGCT GTACTTCGCG AAGGTGTTCT TTCCGTCCGG CAGCCAGACC
GTGCAGTTGC TCAACACCGC GGCGATCTTC GCCGTCGGCT TCGTGATGCG GCCCATCGGC
GGCTGGCTCG TCGGCCTGTA CGCGGACCGC AAGGGGCGCA AGGCCGCGCT GCTGGTCTCG
GTGCTCGCGA TGTGCGCCGG CTCGCTGATC ATCGGCCTCA CGCCCGGCTA CGGCAGCATC
GGCATCGCGG CGCCCGTGCT GCTCGTTCTC GCGCGGCTGC TGCAGGGCTT GAGCCTCGGC
GGCGAATACG CGAGCTCGGC CACCTATCTG AGCGAGATGG CCGACAAGAC CAACCGCGGC
TTCTATTCGA GCTTCCTGTT CGCGACGCTG TCGCTCGGCC AGTTGCTCGC GATGGCGGTG
CTCGTCGCGC TGCAGCAGTT CTTCCTGAGC GCCGCGCAAC TCGAAAGCTG GGGCTGGCGC
ATTCCGTTCC TGATCGGCTC GCTCGCGGCG GGCGTCGCGA TCTTCCTGCG CCGGAACATG
GAGGAAACGG AATCGTTCGA GCAGCACCGG CAGAGCCGGC GCAGCCGCAC GTCGGTCGCC
GAGCTGTTCC GGCACAAGCG CGCGTGCCTG ATCGTCGCGG GCCTGACGCT CGGCGGCACC
GTCGCGTTCT ACGCATACAC GACGTACATG CAGAAATTCC TCGTCAACAG CGCGGGGATG
AGCAAGGCGG ATGCGTCGAT GGTGTCCGTC GCGAGCCTGA TCGCGTTCGT GCTGATGCAG
CCGGTGTTCG GCAGCCTGTC GGACCGCGTC GGGCGGCGCC CGCTGCTGAT CGCCTTCGGC
GTGCTCGGCA CGCTGTGCAC GGTTCCGATC TTCAGCGCGC TGACGACGGT CAGGACGATG
GGCGGCGCGC TCGCGCTGAT CTCCGCGGCG CTGCTGATCG TGAGCCTCTA TTCGTCCGTC
AGCGCGGTCG CGAAAGCCGA GCTGTTCCCG GTCGAGATCC GCGCGCTCGG CGTCGGGCTG
CCCTACGCGA TCACGGTGTC GCTGTTCGGC GGCACGGCCG AGTACATCGC GCTGTGGACC
AAGAGCATCG GCCACGAGAC CTGGTTCTTC TGGTACGTGT CCGGGTGCGT GCTGGTGTCG
CTGCTGTGCT ATCTGTGGAT GCCCGATCCG AAGACGGTCT CCTGCATCGA TCGGGACTGA
 
Protein sequence
MSQVRSLATA PHCPPAGTLA RLRSIFSGSV GNLIEYYDWY VYSAFSLYFA KVFFPSGSQT 
VQLLNTAAIF AVGFVMRPIG GWLVGLYADR KGRKAALLVS VLAMCAGSLI IGLTPGYGSI
GIAAPVLLVL ARLLQGLSLG GEYASSATYL SEMADKTNRG FYSSFLFATL SLGQLLAMAV
LVALQQFFLS AAQLESWGWR IPFLIGSLAA GVAIFLRRNM EETESFEQHR QSRRSRTSVA
ELFRHKRACL IVAGLTLGGT VAFYAYTTYM QKFLVNSAGM SKADASMVSV ASLIAFVLMQ
PVFGSLSDRV GRRPLLIAFG VLGTLCTVPI FSALTTVRTM GGALALISAA LLIVSLYSSV
SAVAKAELFP VEIRALGVGL PYAITVSLFG GTAEYIALWT KSIGHETWFF WYVSGCVLVS
LLCYLWMPDP KTVSCIDRD