Gene BTH_II2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2072 
Symbol 
ID3845128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2525888 
End bp2527288 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content69% 
IMG OID637839373 
Productmajor facilitator family transporter 
Protein accessionYP_440260 
Protein GI83716115 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.264718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCCGTG CGCCATCCCG CGCGTTGCAC CGTCAGTCCG ACGACGAGGC CCGCCGCCCG 
GCCCCATCCG CGCGGTACAC AACCATTCGA ACCAGGATCG CCCCGACCAT GCCTGCTTCC
GAACTGAGCG CCGTCTCAAG CGCCCCGCCC CGCGCCCTGA CGGGCCGCGA CTACAAAACC
CTCGGCCTCG CCGCGTTGGG CGGCGCGCTG GAGTTCTACG ATTTCATCAT CTTTGTGTTC
TTCGCGCCCG CGATCGGCCA GCTGTTCTTC CCGCACGACA TCCCCGACTG GCTTCGCCAG
TTGCAGACGT TCGGCATCTT CGCGGCCGGC TATCTCGCGC GGCCGCTCGG CGGCATCGTG
ATGGCGCACT TCGGCGACCT CGTCGGCCGC AAGCGGATGT TCACGCTGAG CGTGCTGCTG
ATGTCGGTGC CGACGCTGCT GATGGGCCTG CTGCCCACCT ACGACAGCGT CGGCATCCTC
GCGCCGGTCG CGCTGCTGCT GTTCCGCGTG CTGCAGGGCG CGGCGGTGGG CGGCGAAGTG
CCCGGCGCGT GGGTGTTCGT GTCCGAGCAC GTGCCGTCGC AGCGCATCGG CTACGCGTGC
GGCACGCTGA CGGCGGGCCT CACGATCGGC ATCCTGCTCG GCTCGCTCGT CGCGACGGCC
ATCAACAGCC GCTTCTCGAC AGCCGAAGTC GCCGCGTTCG CGTGGCGCAT CCCGTTCCTG
CTCGGCGGCG TGTTCGGCCT CTTCTCCGTC TACCTGCGCC GCTGGCTGCA CGAGACGCCC
GTGTTCGCCG AGATGAAGGC GCGCAAGACG CTCGCGGCCG AGATCCCGCT GAAGGCGGTG
ATTCGCGACC ACGGCCGCGC GGTGATCGTG TCGATGCTGA TCACGTGGAT GCTGTCGGCG
GCGATCGTCG TCGTGATCCT GATGACGCCG ACGCTGCTGC AAAAGCAGTT TCATATCGCA
CCCGCGACCG CGCTGTTCGC GAACAGCATC GCGACGCTGT GCCTGACGGC CGGCTGCATC
ACCGCCGGCT CGCTCGCGGA CCGCTTCGGC GCGAAGGCGG TGCTGTCGAT CGGCGGCATC
GCGCTCGCCG CGTGCTACTA CGCGATGTAC ACGCAGATCG CCGTCGACGC CTCGCGCCTC
GTGCCGCTCT ACGGGCTCGC GGGCTTCGCC GTCGGCACGA TCGGCGCGGT GCCGTTCGTG
CTGGTGAAGA GCTTTCCGGC CGTCGTGCGC TTCTCGGGCA TCTCGTTCTC TTACAACGTC
GCGTACGCGG TGTTCGGCGG GCTCACGCCG GTGATCGTGT CGCTGCTGAT GAAATCGAGC
CCGCTCGCGC CGGCCTACTA TGTCGCGGCG ATCTGCGTGC TCGGCGCGGT CGCGATGCCG
TTCGCGAAGG ACGCCGAATA A
 
Protein sequence
MSRAPSRALH RQSDDEARRP APSARYTTIR TRIAPTMPAS ELSAVSSAPP RALTGRDYKT 
LGLAALGGAL EFYDFIIFVF FAPAIGQLFF PHDIPDWLRQ LQTFGIFAAG YLARPLGGIV
MAHFGDLVGR KRMFTLSVLL MSVPTLLMGL LPTYDSVGIL APVALLLFRV LQGAAVGGEV
PGAWVFVSEH VPSQRIGYAC GTLTAGLTIG ILLGSLVATA INSRFSTAEV AAFAWRIPFL
LGGVFGLFSV YLRRWLHETP VFAEMKARKT LAAEIPLKAV IRDHGRAVIV SMLITWMLSA
AIVVVILMTP TLLQKQFHIA PATALFANSI ATLCLTAGCI TAGSLADRFG AKAVLSIGGI
ALAACYYAMY TQIAVDASRL VPLYGLAGFA VGTIGAVPFV LVKSFPAVVR FSGISFSYNV
AYAVFGGLTP VIVSLLMKSS PLAPAYYVAA ICVLGAVAMP FAKDAE