Gene BURPS668_A1342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1342 
Symbol 
ID4887894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1264039 
End bp1265316 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID640131281 
Productmajor facilitator family transporter 
Protein accessionYP_001062339 
Protein GI126445565 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATA GAAAGCCCCT GAGCCTGACG CAAATCGTGC TCGCGACTTC GGTCGGCAAC 
GCGCTCGAAT GGTTCGATAT CGCCATTTAT GCGTTCTTCG CCGTGCATAT CGCGAAGAAT
TTCTTCCCGA CCGCGAACGA GACGGCGTCG ATGCTGCTCA CGTTCGGCTC GTTCGGCGCG
TCGTATCTCG TGCGGCCGAT CGGCGGCATG GTGCTGGGCG CCTACGCGGA CAAGCGCGGG
CGCAAGGCGG CACTGATGAT GTCGGTCGGT CTGATGATGA TCGGCACGGC GATCATCGCG
GTGATTCCGC CACATGCGTC GATCGGCCTG CTCGCGCCAG CAGGCGTGTT CATCGCACGG
CTGATTCAGG GTTTTTCGGC GGGCGGCGAA TTCGGCGCTT CGACGGCGAT GCTGATCGAA
CATGCACCCG AGCGTCGCGG CCTGCTCGCG AGCTGGCAGT TCACGACGCA AGGCCTCGCG
ACCCTGCTCG CGTCGACCTT CGGCTTCGCG CTCGCCAAGC TGATGCCCGC CAGCGAGCTC
TCCGCATGGG GCTGGCGCAT GCCGTTCTTC TTCGGGCTGC TGATCGGGCC GGCGGGTCTG
TATCTGCGAC GCTTTCTCGA AGATGCGGCC GATTACACCG AAGCCGAGCA CACGGCCACG
CCGGTGCGCG ACGTACTCAC GCGGCAGAAG GCGTTGCTGC TGACCTCGAT CGGTGCGCTG
ACGGTGTCGA CGGCGGTGAA CTACCTGTTG CAGTACGTGC CGACGTTCGC GATCCGCGAA
TTGCATCTCG ATGCATCGAC GGGTTTCGCC GCGAGCATCG TCGCGGGGCT GATGCTGACC
TTGGTCACGC CGTTCGCGGG GCATCTGTCC GACAAAATCG GCCGCGTGAA GCAGATGTCG
ACCGCGGCGC TGCTACTGTT CGTGACGGGT TATCCGGCGT TCGCGTATGT CGTGTCGCAT
GTGTCGGTAG CGGCGCTGTT CGGGCTCGTC GCGTGGCTTG CGCTGCTCAA GGCGGTGTAT
TTCGGCGCGC TGCCGGCGCT CATGTCGGAG ATTTTCCCCA CATCGACGCG CGTGACGGGC
ATTTCAATCA GCTACAACAT CGGCGTGACG GTGTTCGGCG GTTTTACGCC GGCGATCGTC
ATCTGGCTGT CGAGCGCGAC GGGCAGCAAG GCGGCACCGA GCTTCTACAT GATGTTCACG
GCCGTGATCA GTCTCGCGGC GCTGGCGGCT GTGAGCCGCG GGAAAGAGCC GCTGAGCTCG
GCGGGCACGG CGGCCTGA
 
Protein sequence
MTNRKPLSLT QIVLATSVGN ALEWFDIAIY AFFAVHIAKN FFPTANETAS MLLTFGSFGA 
SYLVRPIGGM VLGAYADKRG RKAALMMSVG LMMIGTAIIA VIPPHASIGL LAPAGVFIAR
LIQGFSAGGE FGASTAMLIE HAPERRGLLA SWQFTTQGLA TLLASTFGFA LAKLMPASEL
SAWGWRMPFF FGLLIGPAGL YLRRFLEDAA DYTEAEHTAT PVRDVLTRQK ALLLTSIGAL
TVSTAVNYLL QYVPTFAIRE LHLDASTGFA ASIVAGLMLT LVTPFAGHLS DKIGRVKQMS
TAALLLFVTG YPAFAYVVSH VSVAALFGLV AWLALLKAVY FGALPALMSE IFPTSTRVTG
ISISYNIGVT VFGGFTPAIV IWLSSATGSK AAPSFYMMFT AVISLAALAA VSRGKEPLSS
AGTAA