Gene BURPS668_A0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0478 
Symbol 
ID4885913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp434399 
End bp435736 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content70% 
IMG OID640130419 
Productmajor facilitator family transporter 
Protein accessionYP_001061484 
Protein GI126445072 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.138863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGAAG ACAACCTGAC GATGCCCGCC GACGGTGCGG GCGTCGCCCC GGCCGACGCC 
GTATCGGCGG CGGGCTTGCG CACCGCGCGC GGCGGTGCGA TCGCGGCCGC GGTGATCGGC
AACTGGCTCG AATTCTTCGA TTTCACCGTC TACGGTTTTT TCGCGGTGAT CATCGGCAAG
CTGTATTTCC CGTCGCACGA CGCGACGACG TCGCTGCTGC TGTCGGTCGC GACGTTCGCC
GCGGGCTTCT TCACGCGTCC GCTCGGCAGC ATCGTGCTCG GCGTCTACGC GGACCGCCGC
GGCCGCAAGG CGGCGCTGAA CCTGACGATC CTGCTGATGG CGCTCGGCAC CGGCATGATC
GCGCTCGCGC CGACCTACGC GCAGATCGGC GTGCTCGCGC CGGTGATCGT CGTGTGCGCG
CGGCTGATGC AGGGCTTTTC GCAAGGCGGC GAGTTCGGCG CGGCGACGTC GACGCTCGTC
GAGCACGGCG GCGCCGCGCG CCGGGGCTTT CGCGCGAGCT GGCAGCTCGC GACACAGGGC
GGCGCGGCGC TGATGGGCTC GGGGTTCGCG GCGCTGCTGT CGAACACGCT CGCGAAGGAC
GCGCTGGAAA GCTGGGGCTG GCGCGTGCCG TTCGCGCTCG GTGTGCTGAT CGCGCCCGTC
GGCATGTATC TGCGCCGCCG CCTGGCCGAC GATGCGCCGG GCGCCGGCCA CCACGGCATC
GACGGCGGCG TGCTGCGCGA GCTGTTCGCG CGGCACGGCC GCACGGTGCT GCTGCTGACG
CTCACGGTGA TGGGCGGCAC CGTATCGACG TACATCCTGA CGTTCTACAT GCCGACCTAC
GCGATCCATA CGCTCGGGCT GCCGATGAAG CTGTCGATGT TCGTCGGCGT CGCGTCCGGC
TGCGTGATGC TCGTCACCTG CCCGTTGTTC GGCTGGCTGT CCGACCGGAT CGGCAGCCGG
CGCCTGCCGA TCTTCGTCGG GCGCGGCGTG CTCGTCGTGC TGCTGTTTCC GGCGTTCGTG
CTGATGAACC GCTATCCGTC GCTCGCGGTC GTGATGCCGC TCACCGGGCT GATGCTGCTG
TTCTATTCGA TGGGCTCGGC GTCCGAGTTC GCGCTGATGT GCGAATCGTT TCCGCGCCGC
GTGCGTGCGA CGGGCATCTC GATCGCCTAT GCGCTCGCGG TGACGCTGTT CGGCGGCACC
GCCCAGCTCG TCGCGACGTG GCTCGTGCGC GTGACGGGCA GCACGCTCGC GCCGGCCGCT
TACGTGGCCG CGTGCGTGAT CGTGTCGCTC GTCGCGGTCG GGATGCTGTG CGAGACGGCG
ACGGAAACGG CCGACTGA
 
Protein sequence
MHEDNLTMPA DGAGVAPADA VSAAGLRTAR GGAIAAAVIG NWLEFFDFTV YGFFAVIIGK 
LYFPSHDATT SLLLSVATFA AGFFTRPLGS IVLGVYADRR GRKAALNLTI LLMALGTGMI
ALAPTYAQIG VLAPVIVVCA RLMQGFSQGG EFGAATSTLV EHGGAARRGF RASWQLATQG
GAALMGSGFA ALLSNTLAKD ALESWGWRVP FALGVLIAPV GMYLRRRLAD DAPGAGHHGI
DGGVLRELFA RHGRTVLLLT LTVMGGTVST YILTFYMPTY AIHTLGLPMK LSMFVGVASG
CVMLVTCPLF GWLSDRIGSR RLPIFVGRGV LVVLLFPAFV LMNRYPSLAV VMPLTGLMLL
FYSMGSASEF ALMCESFPRR VRATGISIAY ALAVTLFGGT AQLVATWLVR VTGSTLAPAA
YVAACVIVSL VAVGMLCETA TETAD