Gene BURPS1710b_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1667 
SymbolproP 
ID3691329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1777544 
End bp1778878 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content56% 
IMG OID637728123 
Productgp59 
Protein accessionYP_333070 
Protein GI76811195 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00012359 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTGATA GCCGTAATAA CTCAATAATC CGAGGGCACG AAATAATGCA AACATCAACG 
TACGCGCGCG AGGCCGCGCC GAGTGCGAGC TCCGACACGC ATCGGAGAGC TGTAATCGCC
GTCATCGTCG GAAATGGTTT CGAATGGTTC GATTTCATTT CGTATAGTTT CTTCTCAGTC
ATTATTGCGA AACTATTTTT CCCGTCGACG GACGACAACC TGTCTCTGTT GCTGTCGGTT
TCGACGATTG GCGTAGGCTT CTTTATGCGT CCGATCGGTG GCATCGTGAT TGGCGGAATT
GCGGACAAAG TGGGGCGCCG AGCAGCACTT ACGGTCACGA TTGCATTGAT GACCGCCGGG
ACGGCGATGA TTGGATTCGC GCCGACATAC AAAGATGCAG GGCTTGGTGC GCCACTGATG
ATTGTCGTCG CGCGTCTACT TCAGGGATTT TCGGCTGGAG GGGAAATGGG AGGTGCGACA
GCGTATCTTC GCGAGCGCGT GTCGGCCGAG CGGCATGGAT ACTACACGAG CTGGATTCAG
GCGAGTATCG GGTTCGCGAT TATCCTTGCG TCAGTTCTTG CGGTGTTTAT CGTGAAGTGC
CTCGATGAGC AGCAGATCGA ATCTTGGGGC TGGCGAATTC CCTTCCTTCT CGGACTCGGT
CTCGGCCCGG TCGGGATTTA TATCCGCAGT AGGTTGAACG ACCCTGGCTT TCCCGCGGAC
GAGCGTTTGG GCGAGTGTGC GCCGGTCGTC GAGGTCGTCA GGAGCTTTTC GCGTGAGGCG
CTTGTCGGAT TTGGTTTAGT CGTTTTCTGG ACGGTTTGCT CTTATGTCCT ACTGTTCTAC
ATCCCGACCT ACGCTTCGAA GGTTCTGAGA CTCCCGTCTT CTACGGGTTT CATCGCAGTG
CTTGTCGGCG CGTCAATTGT TCTCTTCGTC ACGCCTTTGA TTGGACACTT TTCCGATCTG
TTTGGGCGCC GCTGGTTCCT TGCGGGAGCG TTGCTCGTTG CGATCGTCGC GGCTTATCCG
CTGTTCGCTA TGTTGAATGC CGCACCAGGG TTGAAGTCGT TGCTCGTGTT CCAGGTGGTG
TTCGGGCTCG TTATCGCCAG CTACGAGGGG CCAATCCTGG CGGCGCTTAG CGACATGTTT
CCAGATGGGG TTCTGTCGAC TGGGATTTCG ATCTCGTACA ACCTCGCCGT GATCACGTTT
GGTGGATTCT CCGCCGCGAT CATTACGTGG GCGATTGCGA CCACGCACAA CAACCTCGCG
CCGGCATTCT ACGTGATAGC AGCGGCCATC GTGAGCTTGA TATCCGTGTC TCTCTGGCAA
CCTCGCAGGA AGTAG
 
Protein sequence
MFDSRNNSII RGHEIMQTST YAREAAPSAS SDTHRRAVIA VIVGNGFEWF DFISYSFFSV 
IIAKLFFPST DDNLSLLLSV STIGVGFFMR PIGGIVIGGI ADKVGRRAAL TVTIALMTAG
TAMIGFAPTY KDAGLGAPLM IVVARLLQGF SAGGEMGGAT AYLRERVSAE RHGYYTSWIQ
ASIGFAIILA SVLAVFIVKC LDEQQIESWG WRIPFLLGLG LGPVGIYIRS RLNDPGFPAD
ERLGECAPVV EVVRSFSREA LVGFGLVVFW TVCSYVLLFY IPTYASKVLR LPSSTGFIAV
LVGASIVLFV TPLIGHFSDL FGRRWFLAGA LLVAIVAAYP LFAMLNAAPG LKSLLVFQVV
FGLVIASYEG PILAALSDMF PDGVLSTGIS ISYNLAVITF GGFSAAIITW AIATTHNNLA
PAFYVIAAAI VSLISVSLWQ PRRK