Gene BURPS668_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1865 
Symbol 
ID4885419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1828042 
End bp1829346 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content67% 
IMG OID640127793 
ProductMFS transporter, metabolite:H+ symporter (MHS) family protein 
Protein accessionYP_001058900 
Protein GI126438364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGACC AGACCGCCGT CGCCGCCGCT GCGCGGCAGG ACGTCCGACG CCGCGTGCTC 
GCGATCGTCG GCGCTTCGTC GGGCAACCTC GTCGAGTGGT TCGACTTCTA CATCTACTCG
TTCTGCGCGC TGTATTTCGC GCCCGCGTTT TTCCCGAGCG GGAACACGAC CACGCAGCTT
CTCAACACCG CGGGCGTGTT CGCCGCGGGC TTCCTGATGC GGCCGATCGG CGGCTGGCTG
TTCGGCCGGA TCGCCGACAA GCACGGCCGG CGCGCCGCGA TGATGATCTC GGTGCTGATG
ATGTGCGGCG GCTCGCTCGT GATCGCGGTG CTGCCGACGT ATGCGCAGAT CGGCGCGCTC
GCGCCGTTGC TGCTGCTCGT CGCACGGCTG TTCCAGGGGC TCTCGGTGGG CGGCGAGTAC
GGCACGAGCG CGACGTACAT GAGCGAGGTC GCGCTCCAGG GCCGGCGCGG CTTCTTCGCG
TCGTTCCAGT ACGTGACGCT GATCGGCGGC CAGCTCTGCG CGCTGCTCGT GCTCGTGATC
CTGCAGCAGA CGCTTTCGAG CGACGCGCTC AAGGCTTGGG GGTGGCGGAT TCCGTTCGTC
GTCGGCGCGG CGGCCGCGCT GATCTCGCTG TATCTGCGCA AGTCGCTCGA CGAGACGTCG
ACGAGCGAAT CGCGCAAGGC GAAGGACGCC GGCACGATCC GCGGCGTGTG GCAGCACAAG
GGCGCGTTCC TGACGGTGGT CGGCTTCACG GCGGGCGGCT CGCTGATCTT CTACACGTTC
ACGACCTACA TGCAGAAGTA CCTCGTCAAC ACGGCCGGCA TGCATGCGAA GACGGCGAGC
AACGTGATGA CGGCCGCGCT CTTCGTCTAC ATGCTGATGC AGCCGGTGTT CGGCGCGCTG
TCCGACAAGA TCGGCCGCCG CATGTCGATG ATCCTGTTCG GCACGGGCGC CGTGATCGGC
ACGGTGCCGC TGATGCATGC GCTGGGCGGC GTGACGAGCC CGCTCGCCGC ATTCGGGCTG
ATCGTCGTCG CGCTCGCGAT CGTCAGCTTC TACACGTCGA TCAGCGGCCT CATCAAGGCC
GAAATGTTTC CGCCCGAGGT GCGCGCGATG GGCGTCGGCC TGTCGTACGC GGTCGCCAAC
GCGATCTTCG GCGGCTCGGC CGAATATGTC GCGCTGTGGT TCAAGTCGGT CGGCAGCGAA
TCGAGCTTCT ACTGGTACGT GACCGTGCTC TGCGCGATCT CGCTGCTCGT GTCGTGGCGG
ATGCGCGATC CGAGCCGGGA AGGCTACCTG CGCAACGAGC CCTGA
 
Protein sequence
MNDQTAVAAA ARQDVRRRVL AIVGASSGNL VEWFDFYIYS FCALYFAPAF FPSGNTTTQL 
LNTAGVFAAG FLMRPIGGWL FGRIADKHGR RAAMMISVLM MCGGSLVIAV LPTYAQIGAL
APLLLLVARL FQGLSVGGEY GTSATYMSEV ALQGRRGFFA SFQYVTLIGG QLCALLVLVI
LQQTLSSDAL KAWGWRIPFV VGAAAALISL YLRKSLDETS TSESRKAKDA GTIRGVWQHK
GAFLTVVGFT AGGSLIFYTF TTYMQKYLVN TAGMHAKTAS NVMTAALFVY MLMQPVFGAL
SDKIGRRMSM ILFGTGAVIG TVPLMHALGG VTSPLAAFGL IVVALAIVSF YTSISGLIKA
EMFPPEVRAM GVGLSYAVAN AIFGGSAEYV ALWFKSVGSE SSFYWYVTVL CAISLLVSWR
MRDPSREGYL RNEP