Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0882 |
Symbol | |
ID | 4884230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 860229 |
End bp | 861440 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640126810 |
Product | major facilitator family transporter |
Protein accession | YP_001057933 |
Protein GI | 126441928 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.158047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT GCACGACGCG GCCCGCCGGC TTCGCGCGGC CGTCGCGCGA AGCCGCGCGC CTGCCGCTCG CGGGATTGCT CGCGCTCGCG ACGGCCGGCT TCATCACGAT CGTGACCGAG GCGCTGCCCG CCGGGCTGCT GCCGCTGATG GGGCGCGACC TGCGCGTGTC CGATGCGCTC GTCGGCCAGC TCGTCACAGT CTATGCGGCG GGCTCGATCG TCGCGGCGAT TCCGCTCGTC GCGGCGACGC GCGGCATGCG CAGGCGGCCG CTGCTGCTCG CCGCGCTCGC GGGCTTCGTC GTCGCGAACA CGGCGACGGC CGCGTCGCCG TACTACGCGC CCGTGCTCGT CGCGCGCTGC GTCGCGGGCG TCTCGGCGGG GCTCCTGTGG GCGCTGCTCG CGGGCTACGC GAGCCGGATG GTCGACGCGC GGCAGCGCGG CCGCGCGATC GCGATCGCGA TGCTCGGCGC GCCGGTGGCG ATGTCGGTCG GCATTCCGCT CGGCACGGCG CTCGGCGCCG CGCTCGGCTG GCGCGCGACG TTCGCCGGCG TGACGGCGCT CACGCTCGCG CTAATCGCGT GGGTGCGCGC GAGCCTGCCC GATGCGCCGG GGCGGCCCTC GGGCGAGCGG CTGCCGGTCG CCCGCGTGCT GCGGATGCCG GGCGTGCTGC CCGTGCTGGC GGTGATGTTC GCGTACGTGC TCGCGCACAA CATCCTCTAC ACGTACATCG CGCCGTTTCT CGCGAGCGCC GGGATGGGCG CGCGCATCGA CGCGACGCTG TTCGCGTTCG GCGCGGCGTC GTTCGCGGGC ATCGGTCTCA CGGGCGTGTG GATCGGCAAC GGGCTGCGGC GGCTCGCGCT CGCGAGCATC GCGCTTTTCG CGCTCGCGTC CGTGCTGCTC GGCGTGGCGA GCGGATCGCC CGCGGTCGTC TATGCGAGCG TCGCCGTGTG GGGGCTCACG TTCGGCGGCG CGGCGACGGT CTTCCAGACC GCGTCGGCGA ACGCGGCGGG CGAGGCGGCG GACGTCGCGC AATCGATGAT CGTCACGGTG TGGAATCTCG CGATCGCGGC CGGCGGCGTC GCGGGCGGCG TGCTGCTCGA GCGGTTCGGC GCGGGCGCGA TGCCGTGGGC GCTCGTCGCG CTGCTCGTGC CCGCGTGGTT CGGCGCGTGG CGCGCGCGGC GCCACGGCTT CCCGGCGGCC CGCGCGCCGT GA
|
Protein sequence | MSDCTTRPAG FARPSREAAR LPLAGLLALA TAGFITIVTE ALPAGLLPLM GRDLRVSDAL VGQLVTVYAA GSIVAAIPLV AATRGMRRRP LLLAALAGFV VANTATAASP YYAPVLVARC VAGVSAGLLW ALLAGYASRM VDARQRGRAI AIAMLGAPVA MSVGIPLGTA LGAALGWRAT FAGVTALTLA LIAWVRASLP DAPGRPSGER LPVARVLRMP GVLPVLAVMF AYVLAHNILY TYIAPFLASA GMGARIDATL FAFGAASFAG IGLTGVWIGN GLRRLALASI ALFALASVLL GVASGSPAVV YASVAVWGLT FGGAATVFQT ASANAAGEAA DVAQSMIVTV WNLAIAAGGV AGGVLLERFG AGAMPWALVA LLVPAWFGAW RARRHGFPAA RAP
|
| |