Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1216 |
Symbol | |
ID | 4900898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 1199772 |
End bp | 1201034 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640134446 |
Product | major facilitator transporter |
Protein accession | YP_001065495 |
Protein GI | 126451518 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0301654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCC GGCATGCGGT CAGCGCGCGT AGCCTGCGCG CCCTCGACTG GCTCAACTTC TTCGTCGCGA ACGTGCAGAC AGGCTTCGGT CCGTTCATCG CGTCGTATCT CGCGTCGCAC AAGTGGACGC AGGGCGAAAT CGGCATGGTG CTGTCGATCG GCACGATCAG CGCGATGGTG AGCCAGGTGC CCGGCGGCGC GGCCGTCGAT GCGCTGAAGA ACAAGAAAGG CGCCGCCGCG TGGGCGATCG CCGCGATCAT CCTGTCCGCG GTGCTGCTCG CCGCGAGCCC GACCGTCGTG CCCGTGATCG CGGCCGAGGT GTTCCACGGC TTCGCGAGCT GCATGCTCGT GCCGGCAATG GCGGCGATCT CGTTCGCGCT CGTCGGCCGC GAGAGCCTGG GCGACCGGCT CGGCCGCAAC GCGCGCTGGG CGTCGCTCGG CAGCGCGGTC GCGGCGGGTC TGATGGGGCT CACGGGCGAG TACTTCTCCG CGCGCGCGGT GTTCTGGCTG ACGGCGGCCC TCGCGCTGCC CGCGCTCGTC GCGCTCGCGA TGATCGAGCC GACGCACCAT CATCATCACG CGGCGCCACG CGCGTCGGCG CCACGCGCCG ACGAAGACGA AGACGAAGAA CGCGAAACGC TGCGCGAACT GCTGCGCGAC AAGCGGATGC TGATCTTCGC CGCCTGCGTC GTGTTGTTCC ATCTGTCGAA CGCGGCGATG CTGAACCTCG CCGCGGGCGA AGTGACGGCG GGCATGGGCG AGAACGTGCA GCTCGTGATC GCCGCGTGCA TCATCGTCCC GCAGGCGATC GTCGCGATGC TTTCGCCGTG GGTCGGACGC TCCGCGCAGC GCTGGGGCCG CCGGCCGATC CTGCTGCTCG GTTTCGCCGC GCTGCCGCTG CGCGCGCTGC TGTTCGCCGG CGTCTCGAGC CCGTACCTGC TCGTGCCGGT GCAGATGCTC GACGGCATCA GCGCCGCCGT GTTCGGCGTG ATGCTGCCGC TCATCGCGGC GGACGTCGCG GGCGGCAAGG GGCGCTACAA CCTGTGCATC GGGCTCTTCG GACTCGCGGC GGGCGTCGGC GCGACGCTCA GCACCGCGCT CGCCGGCTTC GCGGCCGACC ACTTCGGCAA CGCGATGAGC TTCTTCGGGC TCGCCGCCGC GGGCGCGCTC GCGACGCTGC TCGTGTGGTT CGCGATGCCC GAGACGCGCG ACGCGGCGCT CGCCGAAGAC GCTCGGCACT CGAGCGCCGA GCCGGCGCAG TAA
|
Protein sequence | MTVRHAVSAR SLRALDWLNF FVANVQTGFG PFIASYLASH KWTQGEIGMV LSIGTISAMV SQVPGGAAVD ALKNKKGAAA WAIAAIILSA VLLAASPTVV PVIAAEVFHG FASCMLVPAM AAISFALVGR ESLGDRLGRN ARWASLGSAV AAGLMGLTGE YFSARAVFWL TAALALPALV ALAMIEPTHH HHHAAPRASA PRADEDEDEE RETLRELLRD KRMLIFAACV VLFHLSNAAM LNLAAGEVTA GMGENVQLVI AACIIVPQAI VAMLSPWVGR SAQRWGRRPI LLLGFAALPL RALLFAGVSS PYLLVPVQML DGISAAVFGV MLPLIAADVA GGKGRYNLCI GLFGLAAGVG ATLSTALAGF AADHFGNAMS FFGLAAAGAL ATLLVWFAMP ETRDAALAED ARHSSAEPAQ
|
| |