Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0562 |
Symbol | |
ID | 4888842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 527785 |
End bp | 529329 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640130503 |
Product | major facilitator superfamily permease |
Protein accession | YP_001061568 |
Protein GI | 126443791 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACGC ACCTTCCCCA CCCCACCCTT TCCCGCGCCG CGCTCGTGCG GATCGTGTCG ACCGTCAGCG CGGGCTTCGT GATCACGCAG CTCGACGTGA CGATCGTCAA CGTCGCGCTC GCGCGCATCG GCATCGATCT GCATACGAGC GTCGCGGGCC TGCAATGGAT CGTCGACGCC TACACGCTCG CGCTCGCGGG CCTGATGCTG TCGGCCGGCG CGCTCGGCGA CCGGTTCGGC GCGCGCCGGC TGTTCGCCGC CGGGCTCGCG CTCTTCGCCG TCGCATCGTT CGTCTGCGGC ATCGCCGCGA ACGCGGCGAC GCTCATCGCC GCGCGCGCGC TGCAAGGGCT CGCGGCGGCC GCGATGCTCC CGAACTCGCT CGCGCTGCTC AACCATGCGT GCGCGCACGA TCCACGGCTG CGGGCGCGCG CGGTCGGCTG GTGGACCGCG TCGGGCGCGA TCTCGATCGC GGCGGGCCCG GTGATCGGCG GCGTGCTGAT CGCGCAGTTC GGCTGGCGCA GCATCTTCTT CGTCAACCTG CCGCTGTGCG CGGCGGGCCT CGCCGCGACG CGCCTGTGGA TCGACAGCGA CGACACGCGC GCGGCGGCCG CTTCGTCGGC CTCGTCCGCT TCGTCCGCCT CGTCCGCTTC ATCCGCCGCC GACGGCGCGC ACCATTCGCA CCCGGCGGCG GACCGGCGCG CATCCGGCCC GAATGCCTGC GCCCCCGCGC CCAAGCGCGC CGCCGGCGCC GGAACACGCG GCCTCGACCT GCCCGGCCAG TGCCTCGCCG TCGTCGCGCT CACGCTCTTC ACCGGCGCGG TGATCGACTG GCGTCCGGCG CTCGTCGCGA TCGCGCTCGC GGCGGGCGCC GCGTTCGTGT TCGTCGAATC GCGCAGCGCG CACCCGATGA TGCCGCTCGC GCTGTTCAGG CGGCGCACGT TCAGCGTCGC CGTGCTGTTC GGTGTCTGCA TGAATCTGTC GTACTACGGG ATCATCTTCG TGCTGAGCCT GTATTTGCAG CGCGTGCGCC ACGACACGCC GCTCGAGGCG GGGCTCGCGT TCCTGCCGCT CACGGGGGGC TTCCTGCTGT CGAACGTCGC GAGCGGCTGG GCGACCGCGC ACTACGGCCC GCGCCGGCCG ATGATCGCCG GCGCGCTGAT CGGCGCGACC GGTTTCGCGC TGCTGAGCGC GGTGCGTGCC GATACGCCGA TCGCGATGCT CGTCGTGCCG TTCCTGCTGA TTCCGGGCGG CATGGGGCTC GCGGTGCCCG CGATGACGAC GACGGTGCTC GCCTCGGTCG AACGCGCGCG CGCCGCGACC GCGTCGGCCG TGCTCAATAC CGCGCGGCAA GCGGGCGGCG CGATCGGCGT CGCCGCGTTC GGCGCGCTCG CGAGCGGCGC GCGGCCGCCG GACATCGTGT CGGGATTGCA CGCGTCGGCG TACGTATCGG CCGCGCTCTT CGTGTTCGCC TCGGCGATGG CGGCCTTCGT GCACGGCGCG CCGCAGCCGG CGGCGGGCAA GCCGGCGAAC GCGAACGCAC GCTGA
|
Protein sequence | MDTHLPHPTL SRAALVRIVS TVSAGFVITQ LDVTIVNVAL ARIGIDLHTS VAGLQWIVDA YTLALAGLML SAGALGDRFG ARRLFAAGLA LFAVASFVCG IAANAATLIA ARALQGLAAA AMLPNSLALL NHACAHDPRL RARAVGWWTA SGAISIAAGP VIGGVLIAQF GWRSIFFVNL PLCAAGLAAT RLWIDSDDTR AAAASSASSA SSASSASSAA DGAHHSHPAA DRRASGPNAC APAPKRAAGA GTRGLDLPGQ CLAVVALTLF TGAVIDWRPA LVAIALAAGA AFVFVESRSA HPMMPLALFR RRTFSVAVLF GVCMNLSYYG IIFVLSLYLQ RVRHDTPLEA GLAFLPLTGG FLLSNVASGW ATAHYGPRRP MIAGALIGAT GFALLSAVRA DTPIAMLVVP FLLIPGGMGL AVPAMTTTVL ASVERARAAT ASAVLNTARQ AGGAIGVAAF GALASGARPP DIVSGLHASA YVSAALFVFA SAMAAFVHGA PQPAAGKPAN ANAR
|
| |