Gene BURPS668_2112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2112 
Symbol 
ID4882006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2102559 
End bp2103776 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID640128040 
Productmajor facilitator superfamily permease 
Protein accessionYP_001059147 
Protein GI126439977 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.954954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAAG CAAAGGCAAG ACACCCGCTC CTCGTCTGGC TGCTGATCGT CGGCACCGGC 
TTCGTCGTGA TGGCGCGCGC GATGAGCCTG CCGTTTCTGG CGATCTACCT GCACGAACGG
ATGGGGCTCG ACGCGGCGAC GATCGGCCTG CTGCTCGGCA CGGGCGCACT CGTCGGCACG
TTCGGCGGCT TCTTCGGCGG CCATCTGTCC GACGTGCTCG GCCGGCGCAA GGTGCTGACC
GGCTGCCTGC TCGTATCGAG CCTGTCGTTC GCCGCGCTTC ATTTCGCGGC CGACGCGTGG
CAGGTCTTCG TGATCAACCT CTTCATCAAT CTCGCGAGCT CGTTCTACGA TCCGGTCTCG
AAAGCGACCA TCAGCGACAA TCTGCCGCCC GAGCAGCGGC TGCGCGCATT CGCGCGGCGC
TACGTGGCGA TCAACATCGG CTTCGCGATC GGGCCGCTGC TCGGCGCGTC GCTCGGCCTG
CTCGACAAAT CCCCCGTGTT CCTCATCACG GGCGCCGTCT ACCTGCTGTT CTCGATCGCG
ATCTACGCGA TCACGGCCCG GCTCGTGTTC GGCCGCGCGC CGCACGAAGC GGCCGCCTCC
GAGTTGCCGC TCGCCGCGAA GCTGCGCGTC ATCGGCACCG ACCGGCGCCT CGTCCTCTTC
ACCGCGGGTA GCATGCTCGC GATCGCCGTG CACGGCGAAA TGTCGGTCAC GTTCTCGCAA
TACCTGATCG GCGCGTTCGA CGACGGGCTC AAGATGTTCG CCTGGCTGAT GAGCACGAAC
GCGATCACGG TCGTCTTGAG CCAGCCGTTG CTGAACCGCA TCGGCGAACG GCGCGGGCCG
TTCACGTCGC TCACGCTCGG CGCGATCCTG CTCGCGATCG GCGCGGCCGG CTTCGCGAAT
TCGCCGAACA TGATCGCGCT CGTCGTGTCG ATGGTCGTCT TCACGTGGGG CGAAGTGCTG
CTGATCCCGT CGGAATACGC GGTCCTCGAC AGCATCACGC CCGAGCCGCT GCGCGGCATC
TATTACGGCG CGCATTCGCT CAGCAACGTC GGCAACCTGC TCGGGCCCTG GCTTGGCGGC
CTCGTGCTGC TGCACTACGG CGGCGCCGCG ATGTTCTACG GCATGGGCTT CATCGCGCTG
CTCAGCCTGC TCACGTTCGC CGTCGGCTCG CAGATCAAGC CCGCGCCGGC GGGCCGGCTC
GAAGTCCAGA ACCGCTGA
 
Protein sequence
MSQAKARHPL LVWLLIVGTG FVVMARAMSL PFLAIYLHER MGLDAATIGL LLGTGALVGT 
FGGFFGGHLS DVLGRRKVLT GCLLVSSLSF AALHFAADAW QVFVINLFIN LASSFYDPVS
KATISDNLPP EQRLRAFARR YVAINIGFAI GPLLGASLGL LDKSPVFLIT GAVYLLFSIA
IYAITARLVF GRAPHEAAAS ELPLAAKLRV IGTDRRLVLF TAGSMLAIAV HGEMSVTFSQ
YLIGAFDDGL KMFAWLMSTN AITVVLSQPL LNRIGERRGP FTSLTLGAIL LAIGAAGFAN
SPNMIALVVS MVVFTWGEVL LIPSEYAVLD SITPEPLRGI YYGAHSLSNV GNLLGPWLGG
LVLLHYGGAA MFYGMGFIAL LSLLTFAVGS QIKPAPAGRL EVQNR