Gene BURPS668_A1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1518 
Symbol 
ID4888080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1458934 
End bp1460403 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content70% 
IMG OID640131457 
Productmajor facilitator superfamily permease 
Protein accessionYP_001062514 
Protein GI126442482 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.28243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACAC TCGATCTCGA CGCGCCGATC TCGACGCCGA TCCGCACCGC GAGCGACGTT 
TCGCGCCTCA TCAACGCATC GGGCGCGCGC GCGAACCACG CGCGCGTGAT CGTCCTGCTC
GCGCTCGGCG GCGTGTTCCT CGATGCGTAC GATCTGACCA CGCTGTCGTA CGGCATCGAG
GACGTGACCC GCGAGTTCGG CCTGACGCCG GCGCTGAGCG GGCTCGTCAG CGCGTCGATC
ATGATCGGCA CGATCCTCGG CAGCCTGCTC GGCGGCTGGT TCACCGACAA GGTCGGCCGC
TACCGCGTGT TCATGGCCGA CATGCTCTGC TTCGTCGTCG CCGCGATCGT GGCCGGACTC
GCGCCGAACG TAGAGGTGCT GATCGCCGCG CGCTTCGTGA TGGGGCTCGG CGTCGGCATC
GATCTGCCGG TCGCGATGGC GTTCCTGGCC GAGTTCTCGA AATTCGGCGG GCGCGGCAAC
AAGGCGTCGC GGCTCGCCGC GTGGTGCCCG ATGTGGTACG CGGCGTCGTC CGTGTGCTTC
CTGATCGTGT TCGGGCTGTA TTTCGCGCTG CCCGCCGAGC ACGCGCGCTG GCTGTGGCGC
GCGTCGCTGA TCTTCGGCGC GGTGCCGGCG CTCGCGATCA TCGCGGTACG CGGCCGCTAC
ATGAACGAAT CGCCGCTGTG GGCCGCGAAT CAAGGCAAGC TGCGCGACGC CGCGCGCATC
CTGCGCGAGT CGTACGGGAT CCGCGCGCAT GCGGCCGACG ACACGCCGCG CGCGGCACCG
TCGCAGCCGC CCGTCAGTTT TCGCGTGCTG TTCAGGCAGC CGTACCTGCC GCGCACGCTC
GTCGCGAGCG CGATGAATCT GTGCATCCCG TTCGAATACA CGGCGATCGC GTTCTTCCTG
CCGACGATCC TGACGCAGTT CCTCGGCGCG GGCGTGTTCG AGACGATCGC CGCGACGCTC
GCGCTCAACG TGCTGTTCGC GCTCACGGGC GGGTTGCTCG GCATGCGCCT CGCGTACCGG
CTGCCGTCGC GGCGCGTCGC GATCGCCGGC TTCGCGTTGC AGGCCGCGGC GCTCGTCACG
CTCGCGCTGC TCGGCCATCC GCGGACCGCG CTCGGCATCG GCGCCGCGGT GCTGATGCTC
GGCCTGTGGC TGTTCGCCGA AGGCTTCGGC CCCGGCGCGC AGATGATGAT CTATCCGGCG
CTGTCGTATC CGGCGTCGAT TCGCGGCACG GGGCTCGGCT TCGGGCGTGC GCTGACGGGC
ATCGGCAGCG CGTTCGCGCT GTTCGTGCTG CCGATCCTCA ACGCGCGCCT CGGCGCCGGC
ATGTTCTGGA TCGTGGCGAT CGCGGCATTC GTGCCGATCG TGTTTCTCGC GGCGATTCGC
TTCGAGCCGA CCGCGCGCGA TGTCGATGTC GACATCGATG CCGGCCCGCG TGCGCAAGGC
GATCGAGCGC GCGTGCGCGC CGCCGCGTAG
 
Protein sequence
MSTLDLDAPI STPIRTASDV SRLINASGAR ANHARVIVLL ALGGVFLDAY DLTTLSYGIE 
DVTREFGLTP ALSGLVSASI MIGTILGSLL GGWFTDKVGR YRVFMADMLC FVVAAIVAGL
APNVEVLIAA RFVMGLGVGI DLPVAMAFLA EFSKFGGRGN KASRLAAWCP MWYAASSVCF
LIVFGLYFAL PAEHARWLWR ASLIFGAVPA LAIIAVRGRY MNESPLWAAN QGKLRDAARI
LRESYGIRAH AADDTPRAAP SQPPVSFRVL FRQPYLPRTL VASAMNLCIP FEYTAIAFFL
PTILTQFLGA GVFETIAATL ALNVLFALTG GLLGMRLAYR LPSRRVAIAG FALQAAALVT
LALLGHPRTA LGIGAAVLML GLWLFAEGFG PGAQMMIYPA LSYPASIRGT GLGFGRALTG
IGSAFALFVL PILNARLGAG MFWIVAIAAF VPIVFLAAIR FEPTARDVDV DIDAGPRAQG
DRARVRAAA