Gene BURPS1710b_1596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1596 
SymbolproP 
ID3690476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1676950 
End bp1678836 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content69% 
IMG OID637728052 
Productputative proline/betaine transporter 
Protein accessionYP_333000 
Protein GI76809832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTC TTTCAACAGT CTACGCCTGT CATTTATTTC GGTCACCAGA CCGTTCGGCC 
GCGTGCCGCG ACTGGTCGCC CGGAATTTCG CTCACAAGAA AGCGCGCCTT CGGGCAGCTT
TTTTCGTTTC GACGCCCTCG AAGGGCATTC AACCGGCCGC GTACGCCACG CGATGCGCCG
GTTCGCCGAC CGTCCCCATG CGGCGGTCTT TTCTCACGAG TGCAAGCGCG CGACGACCCT
CGGAGCCGCG CCGATCGGTT TCCGTTATGG ATTAAGCGTT TCATGCGCCC TTTCACGCTC
GCGACCGACC CTCGTCCGAT CGCGAGCGGC ACTCCCGGCA AACCGTGGCG AACCGCGCCC
CGTCGCGCGG TTCGGCCGCC GTCACAGGAG TTTTCGACCT TGACTGCAAC ACCCGCCCCC
TCCAGTTCGT CCAGCGCGCC CACCGAAGGC GCGCTTCCCG CCGCTGCGCA CGAGATCACC
GTCGTCGATC AGGGCCTGCT CAAGCGCGCC GTCGGCGCGA TGGCGCTCGG CAACGCGATG
GAATGGTTCG ACTTCGGCGT CTACAGCTAC ATCGCCGTCA CGCTCGGCCA GGTGTTCTTC
CCGTCGAGCA GCCCGTCCGC GCAGTTGCTC GCGACGTTCG GCACGTTCGC CGCCGCCTTC
CTCGTGCGCC CGCTCGGCGG GATGGTGTTC GGGCCGCTCG GCGATCGCAT CGGCCGCCAG
CGCGTGCTCG CGATGACGAT GATCATGATG GCGGTCGGCA CGTTCGCGAT CGGCCTGATC
CCGAGCTACG ACTCGATCGG CCTCCTCGCG CCCGTGCTGC TCCTCGTCGC GCGTCTCGTG
CAAGGCTTCT CGACGGGCGG CGAGTACGGC GGCGCGGCAA CCTTCATCGC CGAGTTCTCG
ACCGACAAGC GCCGCGGCTT CATGGGCAGC TTCCTCGAGT TCGGCACGCT GATCGGCTAT
GTGATGGGCG CGGGCGTCGT CGCGCTGCTG ACGGCTTCGC TGTCGCACGA CGCGCTGCTG
TCGTGGGGCT GGCGCGTGCC GTTCCTGATC GCCGGCCCGC TCGGCCTGAT CGGCCTGTAC
ATCCGGATGA GGCTCGAGGA AACGCCCGCG TTCAAGCGGC AGGCCGAAGC GCGCGAAGCG
CAGGACAAGG CCGTGCCGAA GGCGCATTTC CGCCGACAGC TCGCGCGGCA CTGGCGCGCG
CTGCTGCTGT GCGTCGGCCT CGTGCTGATC TTCAACGTCA CCGATTACAT GGCGCTGTCG
TACCTGCCGA GCTATCTGTC GTCGACGCTG CACTTCGACG AGGCGCACGG CCTCGTGCTG
ATCCTGATCG TGATGGTGCT GATGATGCCG ATGACGCTCG CCACGGGCCG CCTGTCGGAC
GCCGTCGGCC GCAAGCCGGT GATGCTCGCC GGCTGCGTCG GGCTCTTCGC GCTCGCGATT
CCCGCGCTGC TCCTGATCCG CACCGGCGAG ACGGCGCTCG TGTTCGGCGG CCTGCTGATC
CTCGGCGCAC TGCTGTCGTG CTTCACGGGC GTGATGCCGT CGGCGCTGCC CGCGCTCTTT
CCGACCGAGA TCCGCTACGG CGCGCTCGCG ATCGGCTTCA ACGTGTCGGT GTCGCTGTTC
GGCGGCACGA CGCCGCTCGC CGCCGCGTGG CTCGTCGACG CGACGGGCAA CCTGATGATG
CCCGCGTACT ACCTGATGGG CGCGGCCGTG ATCGGCGCGA TCTCGGTGCT CGCGCTGCCC
GAGAGCGCGC GCCAGCCGCT CAAGGGCTCG CCGCCCGCCG TCGCGTCGCA CCGCGAGGCA
CACGCGCTCG CGCGCGAGAT CAAGCGCCGC GAGGCGGCCG AGCGCGACGA CAGCGGCTAC
CCGTCGGCCG CGGCGTTGCG CGCGTGA
 
Protein sequence
MSLLSTVYAC HLFRSPDRSA ACRDWSPGIS LTRKRAFGQL FSFRRPRRAF NRPRTPRDAP 
VRRPSPCGGL FSRVQARDDP RSRADRFPLW IKRFMRPFTL ATDPRPIASG TPGKPWRTAP
RRAVRPPSQE FSTLTATPAP SSSSSAPTEG ALPAAAHEIT VVDQGLLKRA VGAMALGNAM
EWFDFGVYSY IAVTLGQVFF PSSSPSAQLL ATFGTFAAAF LVRPLGGMVF GPLGDRIGRQ
RVLAMTMIMM AVGTFAIGLI PSYDSIGLLA PVLLLVARLV QGFSTGGEYG GAATFIAEFS
TDKRRGFMGS FLEFGTLIGY VMGAGVVALL TASLSHDALL SWGWRVPFLI AGPLGLIGLY
IRMRLEETPA FKRQAEAREA QDKAVPKAHF RRQLARHWRA LLLCVGLVLI FNVTDYMALS
YLPSYLSSTL HFDEAHGLVL ILIVMVLMMP MTLATGRLSD AVGRKPVMLA GCVGLFALAI
PALLLIRTGE TALVFGGLLI LGALLSCFTG VMPSALPALF PTEIRYGALA IGFNVSVSLF
GGTTPLAAAW LVDATGNLMM PAYYLMGAAV IGAISVLALP ESARQPLKGS PPAVASHREA
HALAREIKRR EAAERDDSGY PSAAALRA