Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1492 |
Symbol | |
ID | 4902298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 1448522 |
End bp | 1450126 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640134723 |
Product | putative proline/betaine transporter |
Protein accession | YP_001065766 |
Protein GI | 126452173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCCTT TCACGCTCGC GACCGACCCT CGTCCGATCG CGAGCGGCAC TCCCGGCAAA CCGTGGCGAA CCGCGCCCCG TCGCGCGGTT CGGCCGCCGT CACAGGAGTT TTCGACCTTG ACTGCAACAC CCGCCCCCTC CAGTTCGTCC AGCGCGCCCA CCGAAGGCGC GCTTCCCGCC GCTGCGCACG AGATCACCGT CGTCGATCAG GGCCTGCTCA AGCGCGCCGT CGGCGCGATG GCGCTCGGCA ACGCGATGGA ATGGTTCGAC TTCGGCGTCT ACAGCTACAT CGCCGTCACG CTCGGCCAGG TGTTCTTCCC GTCGAGCAGC CCGTCCGCGC AGTTGCTCGC GACGTTCGGC ACGTTCGCCG CCGCCTTCCT CGTGCGCCCG CTCGGCGGGA TGGTGTTCGG GCCGCTCGGC GATCGCATCG GCCGCCAGCG CGTGCTCGCG ATGACGATGA TCATGATGGC GGTCGGCACG TTCGCGATCG GCCTGATCCC GAGCTACGAC TCGATCGGCC TCCTCGCGCC CGTGCTGCTC CTCGTCGCGC GTCTCGTGCA AGGCTTCTCG ACGGGCGGCG AGTACGGCGG CGCGGCAACC TTCATCGCCG AGTTCTCGAC CGACAAGCGC CGCGGCTTCA TGGGCAGCTT CCTCGAGTTC GGCACGCTGA TCGGCTATGT GATGGGCGCG GGCGTCGTCG CGCTGCTGAC GGCTTCGCTG TCGCACGACG CGCTGCTGTC GTGGGGCTGG CGCGTGCCGT TCCTGATCGC CGGCCCGCTC GGCCTGATCG GCCTGTACAT CCGGATGAGG CTCGAGGAAA CGCCCGCGTT CAAGCGGCAG GCCGAAGCGC GCGAAGCGCA GGACAAGGCC GTGCCGAAGG CGCATTTCCG CCGACAGCTC GCGCGGCACT GGCGCGCGCT GCTGCTGTGC GTCGGCCTCG TGCTGATCTT CAACGTCACC GATTACATGG CGCTGTCGTA CCTGCCGAGC TATCTGTCGT CGACGCTGCA CTTCGACGAG GCGCACGGCC TCGTGCTGAT CCTGATCGTG ATGGTGCTGA TGATGCCGAT GACGCTCGCC ACGGGCCGCC TGTCGGACGC CGTCGGCCGC AAGCCGGTGA TGCTCGCCGG CTGCGTCGGG CTCTTCGCGC TCGCGATTCC CGCGCTGCTC CTGATCCGCA CCGGCGAGAC GGCGCTCGTG TTCGGCGGCC TGCTGATCCT CGGCGCACTG CTGTCGTGCT TCACGGGCGT GATGCCGTCG GCGCTGCCCG CGCTCTTTCC GACCGAGATC CGCTACGGCG CGCTCGCGAT CGGCTTCAAC GTGTCGGTGT CGCTGTTCGG CGGCACGACG CCGCTCGCCG CCGCGTGGCT CGTCGACGCG ACGGGCAACC TGATGATGCC CGCGTACTAC CTGATGGGCG CGGCCGTGAT CGGCGCGATC TCGGTGCTCG CGCTGCCCGA GAGCGCGCGC CAGCCGCTCA AGGGCTCGCC GCCCGCCGTC GCGTCGCACC GCGAGGCACA CGCGCTCGCG CGCGAGATCA AGCGCCGCGA GGCGGCCGAG CGCGACGACA GCGGCTACCC GTCGGCCGCG GCGTTGCGCG CGTGA
|
Protein sequence | MRPFTLATDP RPIASGTPGK PWRTAPRRAV RPPSQEFSTL TATPAPSSSS SAPTEGALPA AAHEITVVDQ GLLKRAVGAM ALGNAMEWFD FGVYSYIAVT LGQVFFPSSS PSAQLLATFG TFAAAFLVRP LGGMVFGPLG DRIGRQRVLA MTMIMMAVGT FAIGLIPSYD SIGLLAPVLL LVARLVQGFS TGGEYGGAAT FIAEFSTDKR RGFMGSFLEF GTLIGYVMGA GVVALLTASL SHDALLSWGW RVPFLIAGPL GLIGLYIRMR LEETPAFKRQ AEAREAQDKA VPKAHFRRQL ARHWRALLLC VGLVLIFNVT DYMALSYLPS YLSSTLHFDE AHGLVLILIV MVLMMPMTLA TGRLSDAVGR KPVMLAGCVG LFALAIPALL LIRTGETALV FGGLLILGAL LSCFTGVMPS ALPALFPTEI RYGALAIGFN VSVSLFGGTT PLAAAWLVDA TGNLMMPAYY LMGAAVIGAI SVLALPESAR QPLKGSPPAV ASHREAHALA REIKRREAAE RDDSGYPSAA ALRA
|
| |