Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0754 |
Symbol | |
ID | 4904360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 746998 |
End bp | 748035 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640143860 |
Product | quaternary amine ABC transporter periplasmic substrate-binding protein |
Protein accession | YP_001074790 |
Protein GI | 126457569 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | [TIGR03414] choline ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCCG GCGTCACGCA TTTTCAAATG GAGCAAGCGA TGAAACGATA CGAATCCATT GCGCGGCGGC TCGCGCGCCG CGCGGCAGCC GCATCGCCGG CGTTCGCGGC GTTGGCATGG TGCGCCGCGG CGGCCGCCGC CACGACCACG GCGGCCGCGG CGGAGCCGGC CGCCTGTCGC GACGTGCGGA TGGCCGGCCC CGGCTGGACC GATATCGAAG CGACGAACGC GCTCGCGGGC GTCGTGCTGA AGGCGCTCGG TTACCGGCAG AGCGTGTCGA ACCTGTCGGT GCCGATCACG TATCAAGGTC TGAAGAAAGG GCAGCTCGAC GTGTTCCTCG GCAACTGGAT GCCGGCGCAG GCGCCGCTCG TCAAGCCGTT CGTCGACGCG CGCGCGATCG ACGTGCTCCA CGCGAACCTG AGCCATGCGA AATTCACGCT CGCGGTGCCG GACTACGTGG CGGCGGCGGG CGTGCATTCG TTCGCCGACC TCGCGAAGTA CGCGCAGCGC TTCGGCGCGA AGATCTACGG CATCGAGCCG GGCGCGCCGG CCAATCAGAA CATCTCGCGC ATGCTCGCCG ACAAGGCGCT CGGGCCGGCG AACTGGCAGC TCGTCGAATC GAGCGAGACA GGGATGCTGA CGCAGGTCGA GCGCGCGGTG CGCGAGCGCC AGTGGATCGT GTTTCTCGGC TGGGAGCCGC ACCTGATGAA CACGAAATTC CATCTCGTTT ATCTGTCGGG CGGCGACGCG TATTTCGGGC CGGACTACGG CGGCGCGACC GTCAACACCG TCGCGCGCGC GGATTTCGCG AGCCAGTGCG CGAATCTCGC GCGGCTGTTC CGACAAATGA CGTTCACCGT CGATCTGGAG AACGGAATGA TCGCCGCGAT GCTGCAGGGC AAGCGCTCCG CCGTGGATGC CGCGCAACAC GCGCTGCGTG CGAACCCGTC GCTCGTCGAA GCATGGCTCG ACGGCGTGCG CACCGCGAGC GGCGCGCCAG GCTTGCCTGC GGTGCGCGCG GCGCTCGATG CGCAATGA
|
Protein sequence | MTPGVTHFQM EQAMKRYESI ARRLARRAAA ASPAFAALAW CAAAAAATTT AAAAEPAACR DVRMAGPGWT DIEATNALAG VVLKALGYRQ SVSNLSVPIT YQGLKKGQLD VFLGNWMPAQ APLVKPFVDA RAIDVLHANL SHAKFTLAVP DYVAAAGVHS FADLAKYAQR FGAKIYGIEP GAPANQNISR MLADKALGPA NWQLVESSET GMLTQVERAV RERQWIVFLG WEPHLMNTKF HLVYLSGGDA YFGPDYGGAT VNTVARADFA SQCANLARLF RQMTFTVDLE NGMIAAMLQG KRSAVDAAQH ALRANPSLVE AWLDGVRTAS GAPGLPAVRA ALDAQ
|
| |