Gene BURPS668_A1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1918 
Symbol 
ID4887611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1864048 
End bp1865634 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content70% 
IMG OID640131856 
Productmajor facilitator superfamily permease 
Protein accessionYP_001062913 
Protein GI126442520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00659527 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGC CGGCGAGACC GGCGGCAGCG GCCGCGAACG GCGCGGGGGC GCACGAGCCG 
GCCGCGGCGC GCCCGCTGCG CGGCGCGAAG CTCGCGCTGC TGACGTTCGC GCTGTCGCTC
GCGACGTTCA TCGAAGTGCT GGACTCGACG GTGGCGAACG TCGCGGTGCC GGCGATCTCG
GGCAGCCTCG GGGTATCGAA CAGCCAGGGC ACGTGGGTGA TCAGCTCGTA CTCGGTGGCC
GCGGCGATCG CGGTGCCGCT GACGGGCTGG CTCGCGCGGC GGGTGGGCGA GCAGCGGCTG
TTCGTCGCGT CGGTGATTCT GTTCACGCTG ACGTCGCTGC TGTGCGGGCT CGCGCGGGAC
CTGGAGGTGC TGGTGGCGTG CCGGGCGCTG CAGGGGCTGT TCTCGGGGCC GATGGTGCCG
CTGTCGCAGA CGATCCTGAT GCGCGCGTTT CCGCCGGCGA AGCGCACGCT CGCGCTGGCG
CTGTGGGGGA TGACGGTGCT GCTCGCGCCG ATCTTCGGGC CGGTGGTGGG CGGCTGGCTG
ATCGACAACT TCTCGTGGCC GTGGATCTTC CTGATCAACC TGCCGATCGG GCTGTTCTCG
TTCGCGGTGT GCACGCTGAT GCTGCGGCCG CGGGCCTCGC GCGGCGAGGC GAGCCCGATC
GACGTGCCGG GGATCGTGCT GCTGGTGATC GGCGTGGGCT CGCTGCAGGC GATGCTGGAC
CTGGGGCATG ACCGGGGGTG GTTCGATTCG TCGCTGATCA CGGCGCTGGC GATCGCGGCG
GGGGTGTCGC TCGTGTCGCT GCTGATCTGG GAGCTGGGCG AGGCGCACCC GGTGGTGGAG
CTGAGCCTGT TCCGGGAGCG GACCTTCACG TTCTGCGTGG TGATCATCTC GCTGGGGATG
ATGAGCTTCT CGGTGGTGGG GGTGGTGTTT CCGCTGTGGC TGCAGGCGGT GATGGGATAC
ACGGCGTACC AGGCGGGGCT GGCGACGGCG TCGATGGGGC TGCTGGCGCT GGTGTTCTCG
ATCCTGGTGG GGGTGTATGC GAGCCGGGTG GACGCACGGG TGCTGGTGAC GTTCGGGTTC
GGGGTGTTCG CGGCGGTGAT GGGGTGGAGC ACGCACTTCA CGCTGTCGAT GACGTTCGCG
CAGGTGGTGA CGCCGCGGCT GATCCAGGGG ATGGGGCTGC CGTGCTTCTT CATTCCGCTG
ACGGCGGCGA CGCTGTCGCG GGTGGCGGAC GACAAGCTGG CGGCGGCGTC GAGCCTGTCG
AACTTCCTGA GGACGTTGTC GGCGGCGTTC GGCACGGCGC TGAGCGTGAC GTGGTGGGAC
AACCGGGCGA CGTATCACTA CGCGGTGGTG TCGCAGGCGG TGACGCGGGC CTCGGAGAAC
ACGCAGCGGT ACGTGGACGC GCTGCACGCG ATGGGGCTGC ACGGCGCGCG CGAGCTGAGC
TCGCTGCACC AGGTGGTGCG GCAGCAGGCG TACATGATGG CGACGAACGA CATGTTCTAC
ATGGCGAGCG TGACGTGCGT GCTGCTGGCG GGGCTGATGT GGCTGACGCG GCCGAAGCGG
GGCGCGGCGG CGACGATGGG GCATTGA
 
Protein sequence
MSAPARPAAA AANGAGAHEP AAARPLRGAK LALLTFALSL ATFIEVLDST VANVAVPAIS 
GSLGVSNSQG TWVISSYSVA AAIAVPLTGW LARRVGEQRL FVASVILFTL TSLLCGLARD
LEVLVACRAL QGLFSGPMVP LSQTILMRAF PPAKRTLALA LWGMTVLLAP IFGPVVGGWL
IDNFSWPWIF LINLPIGLFS FAVCTLMLRP RASRGEASPI DVPGIVLLVI GVGSLQAMLD
LGHDRGWFDS SLITALAIAA GVSLVSLLIW ELGEAHPVVE LSLFRERTFT FCVVIISLGM
MSFSVVGVVF PLWLQAVMGY TAYQAGLATA SMGLLALVFS ILVGVYASRV DARVLVTFGF
GVFAAVMGWS THFTLSMTFA QVVTPRLIQG MGLPCFFIPL TAATLSRVAD DKLAAASSLS
NFLRTLSAAF GTALSVTWWD NRATYHYAVV SQAVTRASEN TQRYVDALHA MGLHGARELS
SLHQVVRQQA YMMATNDMFY MASVTCVLLA GLMWLTRPKR GAAATMGH