Gene BURPS668_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1208 
Symbol 
ID4885110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1189465 
End bp1190733 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content71% 
IMG OID640127136 
Productmajor facilitator transporter 
Protein accessionYP_001058257 
Protein GI126441788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCC GGCATACGGT CAGCGCGCGT AGCCTGCGCG CCCTCGACTG GCTCAACTTC 
TTCGTCGCGA ACGTGCAGAC AGGCTTCGGT CCGTTCATCG CGTCGTATCT CGCGTCGCAC
AAGTGGACGC AGGGCGAAAT CGGCATGGTG CTGTCGATCG GCACGATCAG CGCGATGGTG
AGCCAGGTGC CCGGCGGCGC GGCCGTCGAT GCGCTGAAGA ACAAGAAAGG CGCCGCCGCG
TGGGCGATCG CCGCGATCAT CCTGTCCGCG GTGCTGCTCG CCGCGAGCCC GACCGTCGTG
CCCGTGATCG CGGCCGAGGT GTTCCACGGC TTCGCGAGCT GCATGCTCGT GCCGGCAATG
GCGGCGATCT CGTTCGCGCT CGTCGGCCGC GAGAGCCTGG GCGACCGGCT CGGCCGCAAC
GCGCGCTGGG CGTCGCTCGG CAGCGCGGTC GCGGCGGGTC TGATGGGGCT CACGGGCGAG
TACTTCTCCG CGCGCGCGGT GTTCTGGCTG ACGGCGGCCC TCGCGCTGCC CGCGCTCGTC
GCGCTCGCGA TGATCGAGCC GACGCACCAT CATCATCACG CGGCGCCACG CGCGTCGGCG
CGAAGCGACG AAGCCGACGA AGACGAAGAC GAAGAACGCG AAACGCTGCG CGAACTGCTG
CGCGACAAGC GGATGCTGAT CTTCGCCGCC TGCGTCGTGT TGTTCCATCT GTCGAACGCG
GCGATGCTGA ACCTCGCCGC GGGCGAAGTG ACGGCGGGCA TGGGCGAGAA CGTGCAGCTC
GTGATCGCCG CGTGCATCAT CGTCCCGCAG GCGATCGTCG CGATGCTCTC GCCGTGGGTC
GGACGCTCCG CGCAGCGCTG GGGCCGCCGG CCGATCCTGC TGCTCGGTTT CGCCGCGCTG
CCGCTGCGCG CGCTGCTGTT CGCCGGCGTC TCGAGCCCGT ACCTGCTCGT GCCGGTGCAG
ATGCTCGACG GCATCAGCGC CGCCGTGTTC GGCGTGATGC TGCCGCTCAT CGCGGCGGAC
GTCGCGGGCG GCAAGGGGCG CTACAACCTG TGCATCGGGC TCTTCGGACT CGCGGCGGGC
GTCGGCGCGA CGCTCAGCAC CGCGCTCGCC GGCTTCGCGG CCGACCACTT CGGCAACGCG
ATGAGCTTCT TCGGGCTCGC CGCCGCGGGC GCGCTCGCGA CGCTGCTCGT GTGGTTCGCG
ATGCCCGAGA CGCGCGACGC GGCGCTCGCC GAAGACGCTC GGCACTCGAG CGCCGAGCCG
GCGCAGTAA
 
Protein sequence
MTVRHTVSAR SLRALDWLNF FVANVQTGFG PFIASYLASH KWTQGEIGMV LSIGTISAMV 
SQVPGGAAVD ALKNKKGAAA WAIAAIILSA VLLAASPTVV PVIAAEVFHG FASCMLVPAM
AAISFALVGR ESLGDRLGRN ARWASLGSAV AAGLMGLTGE YFSARAVFWL TAALALPALV
ALAMIEPTHH HHHAAPRASA RSDEADEDED EERETLRELL RDKRMLIFAA CVVLFHLSNA
AMLNLAAGEV TAGMGENVQL VIAACIIVPQ AIVAMLSPWV GRSAQRWGRR PILLLGFAAL
PLRALLFAGV SSPYLLVPVQ MLDGISAAVF GVMLPLIAAD VAGGKGRYNL CIGLFGLAAG
VGATLSTALA GFAADHFGNA MSFFGLAAAG ALATLLVWFA MPETRDAALA EDARHSSAEP
AQ