Gene BURPS1710b_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2378 
SymbolemrY 
ID3688601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2650910 
End bp2652466 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID637728834 
Producthypothetical protein 
Protein accessionYP_333773 
Protein GI76809632 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCG CGCCGGGCCG ACCGCCGCTC TGGAGCGCCG CGAACCTGCG CGGCGATTTC 
TTTCCATGGG TGCTCGCGAT CGTCACCGGC CTCGATTACT TCGACAACGC CGCGTTCTCG
TTCTTCGCGA GCTACATCGC GGGCGGAATC AACGCGTCGC CGGACGAGCT CGTGTGGGCG
TCGAGCGCTT ACGCGGTGAC GGCCGTGCTC GGCATCCTGC AGCAGCAATG GTGGGTCGAC
CGGCTCGGTC ACCGGCGTTA CGTCGCCGGC TGCATGCTGA TGTTCTCGCT TGGCGCAATG
GCCGCGGCGG CGGCCGACAC GTCGCTGCAG CTCGCGTTCG CGCGCGGCTT TCAGGGCTAT
TTCATCGGCC CCATGATGGG CGCGTGCCGG ATCCTGATCC AGGTCAGCTT CGCGCCGAAG
GATCGCCCGC CCGCGACGCG CGCGTTCCTC ATCATGCTGC TGCTCGGCAG CGCGCTCGCG
CCGATCGCGG GCGGCCTGCT CGTCGCGCAC TTCACATGGC GCGCGCTGTT CGCCTGCACG
GTGCCGGCCG GCATCCTGTT CGCGGCGCTC GCGTTCGTCG CGCTGCCCGA TTCCGGCCAC
ACGCCGCCCG ACGAACGCGG CGGCGCGCAT TTCTGGCCGT ACGTGATCTT CGCGCTCGCG
CAAGGCGCGC TGCAGATCGT CATGCAGCAG GTGCGCTTCC AGCTCTTCGC CGGCTCGCCG
CTGCTCGTGC TGCTCGCCGC CGGCGGCCTC GCGGCGCTCG CGTGGTTCGG CCATCATCAG
TGGCATCATC CGGCGCCGCT CGTGCGGCTG CACGCGTTTC GCGAGCGCAC GTTCCGGGTC
GGCCTGCTGC TCTACCTGTT CTATTACTAC GAGACGACGG GCTACAGCTA TCTGATCTCC
CGCTTCCTCG AAACCGGGCT CGGCTATCCG GTCGAGAACG CCGGGCGGCT CGTCGGCACG
ATGTCGCTGA TCTCCGCGAG CGCGCTCTTC GTCTACCTGC GCTACGCGAA GCTTCTCACG
CACAAGAAAT GGATCATCGT GCCCGGCTTC GCGCTCGCCG CGTTCGCCGC GCTATGGATG
ACGCGGATGT CGCCCGAGGT CGGCGAAGCG GCGCTCGTCG CGCCGCTCCT GATGCGCGGG
CTGCTGCTCC TGTTCATCGT GCTGCCCGTC GCGAACCTGA CGTTTCGCGT GTTCGCGATC
GACGAGTATT CGCACGGCTA CCGGCTGAAG AACATCGTCC GGCAACTGAC GATTTCGTTT
GCGACCGCCT CCGTCATCAT CGTCGAGCAG CATCGGCTCG CCGTGCATCA GACGCGGCTT
GTCGAGCAGG CGAACGTCTA CAATCCGCTG TTCCGGCAAA CCCTCGACAC GCTCACGCGC
GGCTTCGCGG CCGCGGGCCA CGCGTTCTCC GACGCGCACG CGCTCGCGCT CGTCACCGTG
AGCCGCACCA TCGCGCAACA GGCGTCGTTC CTGGCGTCGC TCGACGGCTT CTACTTCCTC
GCGGGCGTCG CGATCTGCGG CGGCCTGTTC GCCGCCTGGC AAAAAGAGAT CGATTGA
 
Protein sequence
MSAAPGRPPL WSAANLRGDF FPWVLAIVTG LDYFDNAAFS FFASYIAGGI NASPDELVWA 
SSAYAVTAVL GILQQQWWVD RLGHRRYVAG CMLMFSLGAM AAAAADTSLQ LAFARGFQGY
FIGPMMGACR ILIQVSFAPK DRPPATRAFL IMLLLGSALA PIAGGLLVAH FTWRALFACT
VPAGILFAAL AFVALPDSGH TPPDERGGAH FWPYVIFALA QGALQIVMQQ VRFQLFAGSP
LLVLLAAGGL AALAWFGHHQ WHHPAPLVRL HAFRERTFRV GLLLYLFYYY ETTGYSYLIS
RFLETGLGYP VENAGRLVGT MSLISASALF VYLRYAKLLT HKKWIIVPGF ALAAFAALWM
TRMSPEVGEA ALVAPLLMRG LLLLFIVLPV ANLTFRVFAI DEYSHGYRLK NIVRQLTISF
ATASVIIVEQ HRLAVHQTRL VEQANVYNPL FRQTLDTLTR GFAAAGHAFS DAHALALVTV
SRTIAQQASF LASLDGFYFL AGVAICGGLF AAWQKEID