Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1918 |
Symbol | |
ID | 4887611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1864048 |
End bp | 1865634 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640131856 |
Product | major facilitator superfamily permease |
Protein accession | YP_001062913 |
Protein GI | 126442520 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00659527 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGC CGGCGAGACC GGCGGCAGCG GCCGCGAACG GCGCGGGGGC GCACGAGCCG GCCGCGGCGC GCCCGCTGCG CGGCGCGAAG CTCGCGCTGC TGACGTTCGC GCTGTCGCTC GCGACGTTCA TCGAAGTGCT GGACTCGACG GTGGCGAACG TCGCGGTGCC GGCGATCTCG GGCAGCCTCG GGGTATCGAA CAGCCAGGGC ACGTGGGTGA TCAGCTCGTA CTCGGTGGCC GCGGCGATCG CGGTGCCGCT GACGGGCTGG CTCGCGCGGC GGGTGGGCGA GCAGCGGCTG TTCGTCGCGT CGGTGATTCT GTTCACGCTG ACGTCGCTGC TGTGCGGGCT CGCGCGGGAC CTGGAGGTGC TGGTGGCGTG CCGGGCGCTG CAGGGGCTGT TCTCGGGGCC GATGGTGCCG CTGTCGCAGA CGATCCTGAT GCGCGCGTTT CCGCCGGCGA AGCGCACGCT CGCGCTGGCG CTGTGGGGGA TGACGGTGCT GCTCGCGCCG ATCTTCGGGC CGGTGGTGGG CGGCTGGCTG ATCGACAACT TCTCGTGGCC GTGGATCTTC CTGATCAACC TGCCGATCGG GCTGTTCTCG TTCGCGGTGT GCACGCTGAT GCTGCGGCCG CGGGCCTCGC GCGGCGAGGC GAGCCCGATC GACGTGCCGG GGATCGTGCT GCTGGTGATC GGCGTGGGCT CGCTGCAGGC GATGCTGGAC CTGGGGCATG ACCGGGGGTG GTTCGATTCG TCGCTGATCA CGGCGCTGGC GATCGCGGCG GGGGTGTCGC TCGTGTCGCT GCTGATCTGG GAGCTGGGCG AGGCGCACCC GGTGGTGGAG CTGAGCCTGT TCCGGGAGCG GACCTTCACG TTCTGCGTGG TGATCATCTC GCTGGGGATG ATGAGCTTCT CGGTGGTGGG GGTGGTGTTT CCGCTGTGGC TGCAGGCGGT GATGGGATAC ACGGCGTACC AGGCGGGGCT GGCGACGGCG TCGATGGGGC TGCTGGCGCT GGTGTTCTCG ATCCTGGTGG GGGTGTATGC GAGCCGGGTG GACGCACGGG TGCTGGTGAC GTTCGGGTTC GGGGTGTTCG CGGCGGTGAT GGGGTGGAGC ACGCACTTCA CGCTGTCGAT GACGTTCGCG CAGGTGGTGA CGCCGCGGCT GATCCAGGGG ATGGGGCTGC CGTGCTTCTT CATTCCGCTG ACGGCGGCGA CGCTGTCGCG GGTGGCGGAC GACAAGCTGG CGGCGGCGTC GAGCCTGTCG AACTTCCTGA GGACGTTGTC GGCGGCGTTC GGCACGGCGC TGAGCGTGAC GTGGTGGGAC AACCGGGCGA CGTATCACTA CGCGGTGGTG TCGCAGGCGG TGACGCGGGC CTCGGAGAAC ACGCAGCGGT ACGTGGACGC GCTGCACGCG ATGGGGCTGC ACGGCGCGCG CGAGCTGAGC TCGCTGCACC AGGTGGTGCG GCAGCAGGCG TACATGATGG CGACGAACGA CATGTTCTAC ATGGCGAGCG TGACGTGCGT GCTGCTGGCG GGGCTGATGT GGCTGACGCG GCCGAAGCGG GGCGCGGCGG CGACGATGGG GCATTGA
|
Protein sequence | MSAPARPAAA AANGAGAHEP AAARPLRGAK LALLTFALSL ATFIEVLDST VANVAVPAIS GSLGVSNSQG TWVISSYSVA AAIAVPLTGW LARRVGEQRL FVASVILFTL TSLLCGLARD LEVLVACRAL QGLFSGPMVP LSQTILMRAF PPAKRTLALA LWGMTVLLAP IFGPVVGGWL IDNFSWPWIF LINLPIGLFS FAVCTLMLRP RASRGEASPI DVPGIVLLVI GVGSLQAMLD LGHDRGWFDS SLITALAIAA GVSLVSLLIW ELGEAHPVVE LSLFRERTFT FCVVIISLGM MSFSVVGVVF PLWLQAVMGY TAYQAGLATA SMGLLALVFS ILVGVYASRV DARVLVTFGF GVFAAVMGWS THFTLSMTFA QVVTPRLIQG MGLPCFFIPL TAATLSRVAD DKLAAASSLS NFLRTLSAAF GTALSVTWWD NRATYHYAVV SQAVTRASEN TQRYVDALHA MGLHGARELS SLHQVVRQQA YMMATNDMFY MASVTCVLLA GLMWLTRPKR GAAATMGH
|
| |