Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_3799 |
Symbol | |
ID | 4884898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 3712743 |
End bp | 3714044 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 640129727 |
Product | major facilitator transporter |
Protein accession | YP_001060794 |
Protein GI | 126440344 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000291886 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCTC GTTCGTTTCC CGAGATCCTT TCGATGAACT GCCCTGCCGA TCTTCGCACC GGCCCGGCCG CCGGCCGACC GCGCACCGCT CCCGCCCCGG CGCTCACGCC CGCGCTCACC ATGTTCTTCT CGGCAACGGT CGGCGTGATC GTGCTCAACC TGTTCGCCGC GCAGCCGCTG ACGGGCCCGA TCGCGGCCGA ACTGCGGCTG CCCGCCAGTC TGACGGGGCT CGTCGCGATG CTGCCGCAGC TCGGCTACGC GGCGGGCCTC GTGCTGCTCG TGCCGCTCGT CGACCTGCTC GAAAACCGCC GGCTCATCGT GACGACGCTC GCCGTCTGTG CGGCGACGCT CGCGCTGCCC GCCGTCACGC GCTCCGGCGC CGTGTACCTC GCGGCGGTGT TCGCCGCCGG GGCGGCGTCG AGCGTGATTC AGATGCTGGT GCCGATGGCG GCGTCGATGG CCCCCGACGA ACGGCGCGGC CGCGCGGTCG GCAACGTGAT GAGCGGCCTG ATGCTCGGCA TCCTGCTGTC GCGGCCGCTC GCGAGCCTGA TCGCCGGCGC GGCCGGCTGG CGCGCGTTCT ATGGCGCGGC CGCGGCCGCC GATATCGCGA TCGCCGCGGT GCTCGCCGCG AAGCTGCCGC TGCGCGCGCC GCAGCTGTCG ACCCGCTACG CGGCGCTGCT CCGCTCGCTC TGGACGCTCG TCGCGACCGA GCGCGTGCTG CAGCGGCGCG CGCTGTCCGC GGCGCTGTCG ATGGCCGCGT TCAGCGCGTT CTGGACCGCG ATCGGCCTGC GTCTCGCCGC CGCGCCGTTC GATCTCGGCC TGCACGGCAT CGCGATGTTC GCGTTCGCCG GCGCCACCGG CGCGATCGTC ACGCCGTTCG CGGGCCTGGC CGGCGACCGC GGCCGGGAGC GCGACGCACT GCGCGGCGCG CACGTGGCGA TGCTCGCCGC GTTGGCCGCG CTCGGCGTCG CGGGGGCCGG CTGGGGCGGA TTCGACGCGG CCGCGCATCC GGCGCTCGCG CTCGCGCTGC TCGTCGCCGG TGCGGCGGCG CTCGACGCGG GCGTCGTCGC CGATCAGACG CTCGGCCGGC GCGCGATCAA TCTGCTCGAT CCCGCCATGC GCGGGCGGCT CAACGGGCTG TTCGTCGGCC TGTTCTTCGT CGGCGGCTCG CTCGGCGCCG TGCTCGCCGG CGCGGCATGG GCCTGGGCGG GCTGGGGCGC GGTGTGCGCG GTAGGGCTCG TGTTTGCGGG CGCCGCGTTC GCGCTGGACT GGATCGGCGC GCACGGGCCG GCGCCCCGCT GA
|
Protein sequence | MGARSFPEIL SMNCPADLRT GPAAGRPRTA PAPALTPALT MFFSATVGVI VLNLFAAQPL TGPIAAELRL PASLTGLVAM LPQLGYAAGL VLLVPLVDLL ENRRLIVTTL AVCAATLALP AVTRSGAVYL AAVFAAGAAS SVIQMLVPMA ASMAPDERRG RAVGNVMSGL MLGILLSRPL ASLIAGAAGW RAFYGAAAAA DIAIAAVLAA KLPLRAPQLS TRYAALLRSL WTLVATERVL QRRALSAALS MAAFSAFWTA IGLRLAAAPF DLGLHGIAMF AFAGATGAIV TPFAGLAGDR GRERDALRGA HVAMLAALAA LGVAGAGWGG FDAAAHPALA LALLVAGAAA LDAGVVADQT LGRRAINLLD PAMRGRLNGL FVGLFFVGGS LGAVLAGAAW AWAGWGAVCA VGLVFAGAAF ALDWIGAHGP APR
|
| |