Gene BURPS1106A_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1492 
Symbol 
ID4902298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1448522 
End bp1450126 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content70% 
IMG OID640134723 
Productputative proline/betaine transporter 
Protein accessionYP_001065766 
Protein GI126452173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCTT TCACGCTCGC GACCGACCCT CGTCCGATCG CGAGCGGCAC TCCCGGCAAA 
CCGTGGCGAA CCGCGCCCCG TCGCGCGGTT CGGCCGCCGT CACAGGAGTT TTCGACCTTG
ACTGCAACAC CCGCCCCCTC CAGTTCGTCC AGCGCGCCCA CCGAAGGCGC GCTTCCCGCC
GCTGCGCACG AGATCACCGT CGTCGATCAG GGCCTGCTCA AGCGCGCCGT CGGCGCGATG
GCGCTCGGCA ACGCGATGGA ATGGTTCGAC TTCGGCGTCT ACAGCTACAT CGCCGTCACG
CTCGGCCAGG TGTTCTTCCC GTCGAGCAGC CCGTCCGCGC AGTTGCTCGC GACGTTCGGC
ACGTTCGCCG CCGCCTTCCT CGTGCGCCCG CTCGGCGGGA TGGTGTTCGG GCCGCTCGGC
GATCGCATCG GCCGCCAGCG CGTGCTCGCG ATGACGATGA TCATGATGGC GGTCGGCACG
TTCGCGATCG GCCTGATCCC GAGCTACGAC TCGATCGGCC TCCTCGCGCC CGTGCTGCTC
CTCGTCGCGC GTCTCGTGCA AGGCTTCTCG ACGGGCGGCG AGTACGGCGG CGCGGCAACC
TTCATCGCCG AGTTCTCGAC CGACAAGCGC CGCGGCTTCA TGGGCAGCTT CCTCGAGTTC
GGCACGCTGA TCGGCTATGT GATGGGCGCG GGCGTCGTCG CGCTGCTGAC GGCTTCGCTG
TCGCACGACG CGCTGCTGTC GTGGGGCTGG CGCGTGCCGT TCCTGATCGC CGGCCCGCTC
GGCCTGATCG GCCTGTACAT CCGGATGAGG CTCGAGGAAA CGCCCGCGTT CAAGCGGCAG
GCCGAAGCGC GCGAAGCGCA GGACAAGGCC GTGCCGAAGG CGCATTTCCG CCGACAGCTC
GCGCGGCACT GGCGCGCGCT GCTGCTGTGC GTCGGCCTCG TGCTGATCTT CAACGTCACC
GATTACATGG CGCTGTCGTA CCTGCCGAGC TATCTGTCGT CGACGCTGCA CTTCGACGAG
GCGCACGGCC TCGTGCTGAT CCTGATCGTG ATGGTGCTGA TGATGCCGAT GACGCTCGCC
ACGGGCCGCC TGTCGGACGC CGTCGGCCGC AAGCCGGTGA TGCTCGCCGG CTGCGTCGGG
CTCTTCGCGC TCGCGATTCC CGCGCTGCTC CTGATCCGCA CCGGCGAGAC GGCGCTCGTG
TTCGGCGGCC TGCTGATCCT CGGCGCACTG CTGTCGTGCT TCACGGGCGT GATGCCGTCG
GCGCTGCCCG CGCTCTTTCC GACCGAGATC CGCTACGGCG CGCTCGCGAT CGGCTTCAAC
GTGTCGGTGT CGCTGTTCGG CGGCACGACG CCGCTCGCCG CCGCGTGGCT CGTCGACGCG
ACGGGCAACC TGATGATGCC CGCGTACTAC CTGATGGGCG CGGCCGTGAT CGGCGCGATC
TCGGTGCTCG CGCTGCCCGA GAGCGCGCGC CAGCCGCTCA AGGGCTCGCC GCCCGCCGTC
GCGTCGCACC GCGAGGCACA CGCGCTCGCG CGCGAGATCA AGCGCCGCGA GGCGGCCGAG
CGCGACGACA GCGGCTACCC GTCGGCCGCG GCGTTGCGCG CGTGA
 
Protein sequence
MRPFTLATDP RPIASGTPGK PWRTAPRRAV RPPSQEFSTL TATPAPSSSS SAPTEGALPA 
AAHEITVVDQ GLLKRAVGAM ALGNAMEWFD FGVYSYIAVT LGQVFFPSSS PSAQLLATFG
TFAAAFLVRP LGGMVFGPLG DRIGRQRVLA MTMIMMAVGT FAIGLIPSYD SIGLLAPVLL
LVARLVQGFS TGGEYGGAAT FIAEFSTDKR RGFMGSFLEF GTLIGYVMGA GVVALLTASL
SHDALLSWGW RVPFLIAGPL GLIGLYIRMR LEETPAFKRQ AEAREAQDKA VPKAHFRRQL
ARHWRALLLC VGLVLIFNVT DYMALSYLPS YLSSTLHFDE AHGLVLILIV MVLMMPMTLA
TGRLSDAVGR KPVMLAGCVG LFALAIPALL LIRTGETALV FGGLLILGAL LSCFTGVMPS
ALPALFPTEI RYGALAIGFN VSVSLFGGTT PLAAAWLVDA TGNLMMPAYY LMGAAVIGAI
SVLALPESAR QPLKGSPPAV ASHREAHALA REIKRREAAE RDDSGYPSAA ALRA