Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1596 |
Symbol | proP |
ID | 3690476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | - |
Start bp | 1676950 |
End bp | 1678836 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637728052 |
Product | putative proline/betaine transporter |
Protein accession | YP_333000 |
Protein GI | 76809832 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTTC TTTCAACAGT CTACGCCTGT CATTTATTTC GGTCACCAGA CCGTTCGGCC GCGTGCCGCG ACTGGTCGCC CGGAATTTCG CTCACAAGAA AGCGCGCCTT CGGGCAGCTT TTTTCGTTTC GACGCCCTCG AAGGGCATTC AACCGGCCGC GTACGCCACG CGATGCGCCG GTTCGCCGAC CGTCCCCATG CGGCGGTCTT TTCTCACGAG TGCAAGCGCG CGACGACCCT CGGAGCCGCG CCGATCGGTT TCCGTTATGG ATTAAGCGTT TCATGCGCCC TTTCACGCTC GCGACCGACC CTCGTCCGAT CGCGAGCGGC ACTCCCGGCA AACCGTGGCG AACCGCGCCC CGTCGCGCGG TTCGGCCGCC GTCACAGGAG TTTTCGACCT TGACTGCAAC ACCCGCCCCC TCCAGTTCGT CCAGCGCGCC CACCGAAGGC GCGCTTCCCG CCGCTGCGCA CGAGATCACC GTCGTCGATC AGGGCCTGCT CAAGCGCGCC GTCGGCGCGA TGGCGCTCGG CAACGCGATG GAATGGTTCG ACTTCGGCGT CTACAGCTAC ATCGCCGTCA CGCTCGGCCA GGTGTTCTTC CCGTCGAGCA GCCCGTCCGC GCAGTTGCTC GCGACGTTCG GCACGTTCGC CGCCGCCTTC CTCGTGCGCC CGCTCGGCGG GATGGTGTTC GGGCCGCTCG GCGATCGCAT CGGCCGCCAG CGCGTGCTCG CGATGACGAT GATCATGATG GCGGTCGGCA CGTTCGCGAT CGGCCTGATC CCGAGCTACG ACTCGATCGG CCTCCTCGCG CCCGTGCTGC TCCTCGTCGC GCGTCTCGTG CAAGGCTTCT CGACGGGCGG CGAGTACGGC GGCGCGGCAA CCTTCATCGC CGAGTTCTCG ACCGACAAGC GCCGCGGCTT CATGGGCAGC TTCCTCGAGT TCGGCACGCT GATCGGCTAT GTGATGGGCG CGGGCGTCGT CGCGCTGCTG ACGGCTTCGC TGTCGCACGA CGCGCTGCTG TCGTGGGGCT GGCGCGTGCC GTTCCTGATC GCCGGCCCGC TCGGCCTGAT CGGCCTGTAC ATCCGGATGA GGCTCGAGGA AACGCCCGCG TTCAAGCGGC AGGCCGAAGC GCGCGAAGCG CAGGACAAGG CCGTGCCGAA GGCGCATTTC CGCCGACAGC TCGCGCGGCA CTGGCGCGCG CTGCTGCTGT GCGTCGGCCT CGTGCTGATC TTCAACGTCA CCGATTACAT GGCGCTGTCG TACCTGCCGA GCTATCTGTC GTCGACGCTG CACTTCGACG AGGCGCACGG CCTCGTGCTG ATCCTGATCG TGATGGTGCT GATGATGCCG ATGACGCTCG CCACGGGCCG CCTGTCGGAC GCCGTCGGCC GCAAGCCGGT GATGCTCGCC GGCTGCGTCG GGCTCTTCGC GCTCGCGATT CCCGCGCTGC TCCTGATCCG CACCGGCGAG ACGGCGCTCG TGTTCGGCGG CCTGCTGATC CTCGGCGCAC TGCTGTCGTG CTTCACGGGC GTGATGCCGT CGGCGCTGCC CGCGCTCTTT CCGACCGAGA TCCGCTACGG CGCGCTCGCG ATCGGCTTCA ACGTGTCGGT GTCGCTGTTC GGCGGCACGA CGCCGCTCGC CGCCGCGTGG CTCGTCGACG CGACGGGCAA CCTGATGATG CCCGCGTACT ACCTGATGGG CGCGGCCGTG ATCGGCGCGA TCTCGGTGCT CGCGCTGCCC GAGAGCGCGC GCCAGCCGCT CAAGGGCTCG CCGCCCGCCG TCGCGTCGCA CCGCGAGGCA CACGCGCTCG CGCGCGAGAT CAAGCGCCGC GAGGCGGCCG AGCGCGACGA CAGCGGCTAC CCGTCGGCCG CGGCGTTGCG CGCGTGA
|
Protein sequence | MSLLSTVYAC HLFRSPDRSA ACRDWSPGIS LTRKRAFGQL FSFRRPRRAF NRPRTPRDAP VRRPSPCGGL FSRVQARDDP RSRADRFPLW IKRFMRPFTL ATDPRPIASG TPGKPWRTAP RRAVRPPSQE FSTLTATPAP SSSSSAPTEG ALPAAAHEIT VVDQGLLKRA VGAMALGNAM EWFDFGVYSY IAVTLGQVFF PSSSPSAQLL ATFGTFAAAF LVRPLGGMVF GPLGDRIGRQ RVLAMTMIMM AVGTFAIGLI PSYDSIGLLA PVLLLVARLV QGFSTGGEYG GAATFIAEFS TDKRRGFMGS FLEFGTLIGY VMGAGVVALL TASLSHDALL SWGWRVPFLI AGPLGLIGLY IRMRLEETPA FKRQAEAREA QDKAVPKAHF RRQLARHWRA LLLCVGLVLI FNVTDYMALS YLPSYLSSTL HFDEAHGLVL ILIVMVLMMP MTLATGRLSD AVGRKPVMLA GCVGLFALAI PALLLIRTGE TALVFGGLLI LGALLSCFTG VMPSALPALF PTEIRYGALA IGFNVSVSLF GGTTPLAAAW LVDATGNLMM PAYYLMGAAV IGAISVLALP ESARQPLKGS PPAVASHREA HALAREIKRR EAAERDDSGY PSAAALRA
|
| |