Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1404 |
Symbol | |
ID | 4881806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1373574 |
End bp | 1374770 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640127332 |
Product | major facilitator family transporter |
Protein accession | YP_001058447 |
Protein GI | 126442269 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCGC AGCCTCGCCA CTCCGCCGGC ACGCCCGGCC ACTATTCACG CAGCCTGTTG CTGCTGCTCG CGACGATCGC CGGCGTCTCC GTCGCGAATA TCTATTACAA CCAGCCGCTG CTCGACGCAT TCCGCGCATC GTTCCCGGGC AGCGCGTCAT GGATCGGCGT CGTGCCGACC GCGACGCAGC TCGGCTACGC AACCGGCATG CTCGTCCTCG CGCCGCTCGG CGACCGCTTC GACCGGCGCA CGCTGATCCT GCTGCAGATC GCCGGGCTGT CGGCCGCGCT CGTCGTCGCG GCGGCCGCGC CGACGCTCGG CGTGCTCGCC GCGGCAAGCC TCGCGATCGG CATCCTCGCG ACGATCGCGC AGCAGGCGGT GCCGTTCGCC GCCGAGATCG CGCCGCCCGC CGCGCGCGGG CAGGCGGTCG GCACCGTGAT GAGCGGCCTG CTGCTCGGCA TCCTGCTCGC GCGCACGGCG GCGGGCTTCG TCGCCGAATA CTTCGGCTGG CGCGCGGTGT TCGCCGTATC GGTCGCGGCG CTCGCCGCGC TCGCGGCCGT GATCGTCGCG CGCCTGCCGC GCAGCTCGCC GACATCGACG CTGCCGTACG GCAAGCTGCT CGCATCGATG TGGCAGCTCG TGCGCGAGTT GCGCGGACTG CGCGAGGCGT CGATGACGGG CGGCGCGATC TTCGCCGCGT TCAGCGCGTT CTGGCCGGTG CTCACGCTGC TGCTCGCGGG CGCGCCGTTT CATCTGGGCC CGCAGGCGGC GGGGCTCTTC GGGATCGTCG GCGCGGCGGG CGCGCTCGCC GCGCCGTACG CGGGCCGCTT CGCCGACAAG CGCGGCCCGC GCGCGATCAT CTCGCTCGCG ATCGCGCTGA TCGCCGCGTC GTTCGCGATC TTCGCGCTGT CGGGCGCGAG CCTCATCGGG CTCGTGATCG GCGTGATCGT GCTCGACGTC GGCGTGCAGG CCGCGCAGAT CTCGAACCAG TCGCGCATCT ACGCGCTGAA GCCGGACGCG CGCAGCCGCG TGAACACGGT GTTCATGGTC TGCTACTTCA TCGGCGGCGC GATCGGCTCG TCCGCGGGCG TCGCCGCATG GCGCGCGACG GGCTGGCTCG GCATGTGCGC GGTCGGCCTG CTGTTCTCGA TCGTCGCGGC GATCGTGCAT TTCCGCGGCG GCGCGGGCGC GCGATAA
|
Protein sequence | MSSQPRHSAG TPGHYSRSLL LLLATIAGVS VANIYYNQPL LDAFRASFPG SASWIGVVPT ATQLGYATGM LVLAPLGDRF DRRTLILLQI AGLSAALVVA AAAPTLGVLA AASLAIGILA TIAQQAVPFA AEIAPPAARG QAVGTVMSGL LLGILLARTA AGFVAEYFGW RAVFAVSVAA LAALAAVIVA RLPRSSPTST LPYGKLLASM WQLVRELRGL REASMTGGAI FAAFSAFWPV LTLLLAGAPF HLGPQAAGLF GIVGAAGALA APYAGRFADK RGPRAIISLA IALIAASFAI FALSGASLIG LVIGVIVLDV GVQAAQISNQ SRIYALKPDA RSRVNTVFMV CYFIGGAIGS SAGVAAWRAT GWLGMCAVGL LFSIVAAIVH FRGGAGAR
|
| |