Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1667 |
Symbol | proP |
ID | 3691329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 1777544 |
End bp | 1778878 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637728123 |
Product | gp59 |
Protein accession | YP_333070 |
Protein GI | 76811195 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00012359 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTGATA GCCGTAATAA CTCAATAATC CGAGGGCACG AAATAATGCA AACATCAACG TACGCGCGCG AGGCCGCGCC GAGTGCGAGC TCCGACACGC ATCGGAGAGC TGTAATCGCC GTCATCGTCG GAAATGGTTT CGAATGGTTC GATTTCATTT CGTATAGTTT CTTCTCAGTC ATTATTGCGA AACTATTTTT CCCGTCGACG GACGACAACC TGTCTCTGTT GCTGTCGGTT TCGACGATTG GCGTAGGCTT CTTTATGCGT CCGATCGGTG GCATCGTGAT TGGCGGAATT GCGGACAAAG TGGGGCGCCG AGCAGCACTT ACGGTCACGA TTGCATTGAT GACCGCCGGG ACGGCGATGA TTGGATTCGC GCCGACATAC AAAGATGCAG GGCTTGGTGC GCCACTGATG ATTGTCGTCG CGCGTCTACT TCAGGGATTT TCGGCTGGAG GGGAAATGGG AGGTGCGACA GCGTATCTTC GCGAGCGCGT GTCGGCCGAG CGGCATGGAT ACTACACGAG CTGGATTCAG GCGAGTATCG GGTTCGCGAT TATCCTTGCG TCAGTTCTTG CGGTGTTTAT CGTGAAGTGC CTCGATGAGC AGCAGATCGA ATCTTGGGGC TGGCGAATTC CCTTCCTTCT CGGACTCGGT CTCGGCCCGG TCGGGATTTA TATCCGCAGT AGGTTGAACG ACCCTGGCTT TCCCGCGGAC GAGCGTTTGG GCGAGTGTGC GCCGGTCGTC GAGGTCGTCA GGAGCTTTTC GCGTGAGGCG CTTGTCGGAT TTGGTTTAGT CGTTTTCTGG ACGGTTTGCT CTTATGTCCT ACTGTTCTAC ATCCCGACCT ACGCTTCGAA GGTTCTGAGA CTCCCGTCTT CTACGGGTTT CATCGCAGTG CTTGTCGGCG CGTCAATTGT TCTCTTCGTC ACGCCTTTGA TTGGACACTT TTCCGATCTG TTTGGGCGCC GCTGGTTCCT TGCGGGAGCG TTGCTCGTTG CGATCGTCGC GGCTTATCCG CTGTTCGCTA TGTTGAATGC CGCACCAGGG TTGAAGTCGT TGCTCGTGTT CCAGGTGGTG TTCGGGCTCG TTATCGCCAG CTACGAGGGG CCAATCCTGG CGGCGCTTAG CGACATGTTT CCAGATGGGG TTCTGTCGAC TGGGATTTCG ATCTCGTACA ACCTCGCCGT GATCACGTTT GGTGGATTCT CCGCCGCGAT CATTACGTGG GCGATTGCGA CCACGCACAA CAACCTCGCG CCGGCATTCT ACGTGATAGC AGCGGCCATC GTGAGCTTGA TATCCGTGTC TCTCTGGCAA CCTCGCAGGA AGTAG
|
Protein sequence | MFDSRNNSII RGHEIMQTST YAREAAPSAS SDTHRRAVIA VIVGNGFEWF DFISYSFFSV IIAKLFFPST DDNLSLLLSV STIGVGFFMR PIGGIVIGGI ADKVGRRAAL TVTIALMTAG TAMIGFAPTY KDAGLGAPLM IVVARLLQGF SAGGEMGGAT AYLRERVSAE RHGYYTSWIQ ASIGFAIILA SVLAVFIVKC LDEQQIESWG WRIPFLLGLG LGPVGIYIRS RLNDPGFPAD ERLGECAPVV EVVRSFSREA LVGFGLVVFW TVCSYVLLFY IPTYASKVLR LPSSTGFIAV LVGASIVLFV TPLIGHFSDL FGRRWFLAGA LLVAIVAAYP LFAMLNAAPG LKSLLVFQVV FGLVIASYEG PILAALSDMF PDGVLSTGIS ISYNLAVITF GGFSAAIITW AIATTHNNLA PAFYVIAAAI VSLISVSLWQ PRRK
|
| |