Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2780 |
Symbol | |
ID | 4885688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2653766 |
End bp | 2655331 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132716 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_001063772 |
Protein GI | 126444225 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein [COG4564] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACGC TCAGCTTCAG ACAAAAGCTC TGGCTGCCGC TTGTTATCAG CCTGATCGCC CTCCTGCTCG TCTCGATCAG CTCCGCCTGG CTTTCCTACC AGACGCGGAT GGAAGAGCGC CGCAACGATC TGACGAACGT CGCGCACGTC GGCCTGAGCA TCGTTCGGGA GTATGCGGCG CTCGCGCAGC GCGGCGTGCT CACCGATGCG CAGGCGCGCA AGGAAGCGCT CGAGCGGCTG CGCAGCGTGC GCTACGGCAG CGACGGCTAT TTCATCGTGA TCGATTCGTC GCCCCGCATG ATCATGCACC CGATCAAGCC GGATACGGTC GGCAAGGACT TGCGACACGT CGCCGACGCC GACGGCCGCC ATCACTACCT GACGTTCGCG TCCGTCGCGC AATCCCCGCA AGGCGGCTTC GTCGACTATG TCTTTGCGCA TCCGAACGCG CATCCCGCGA AAGCCGTCGA CAAGCTCGGC TATGTGATCC GCTACGCGCC GTGGGACTGG ATCATCTCGA CAGGCGCATA CATCGACGAC ATCCAGGCGG CGTTCGCGAA GTCGCTCTGC CTGTCCGCCG GCGTGTTCGC GTTGCTCGCC GCGCTGCTCG CGCTGAGTGT CGTCTACACG AATCGCGGCA TCGAGCGCAT GATCGGCGGC GATCCGCGCA TCGCCGCGCA CATGGCGGGC GTCATCGCGT CGGGCGATCT CGCGACGCCG TTCGCCACGC GCCGGGAGGA CCGCAGCAGC CTGATGTTCG CGATCAGCCA GATGCGCGAC GCGCTCGCGA ACGCCGTCGC CCAGATCCGG GCAAGCGCCG GATATGTCGC GACCACGGCG GGCGAAATCG CGGACAGCAA CATGGATCTG TCTTCGCACA CCGAGTGCCA GGCGACGGCG CTGCAACAGG CGGCGTCGAG CATGCAGCGG CTCACGGAGA GGGTGCGCGA CACGGCGGAC CGCGCACGCG CCGCGAGCGA GCTGGCGGGC AGCGCCGCGC GGATCACGGA TCGCGGCGGC GACATGGTCG TGCGGGTCGT CGCGGCGATG AACGACATCC GCGCCGAGTC GCGGAAGATG GTCGACATCA TCGGTGTCAT CGAGGGCATC GCGTTCCAGA CCAACATCCT CGCGCTGAAC GCCGCGGTCG AAGCGGCGCG CGCGGGCAAC GAAGGGCGGG GCTTCGCGGT CGTCGCGAGC GAGGTGCGCG TGCTTGCGCA GCGCAGCGCG AGCGCGGCGA AGGACATTCG CGCGCTGATC GGCCGTGCGG CCGAGCGCGT CGGCAACGGG GCGGAACTCG TCGAGGCGAC GGGTGCGACG ATCGGCGAAG CGCAGCACGC GATCTGCCGC GTGACCGGCA TCGTGCAGGA TATCGCGGCG GCCGCGACGG ATCAGAGCCA GGGGCTCGAG CAGATCAACG CCGCCGTCTC GCAAATGGAC AGCGTGACGC GGCGCAACGC GACGCTGGTC GAGCACGCCG CCATCGCCGC GCAATCGCTC AACGAGCAAT CCCGGTGTCT TCAGGACGCG GTGGCCGCAT TCAGGACGGA AGCGCTCGGC TGCTGA
|
Protein sequence | MPTLSFRQKL WLPLVISLIA LLLVSISSAW LSYQTRMEER RNDLTNVAHV GLSIVREYAA LAQRGVLTDA QARKEALERL RSVRYGSDGY FIVIDSSPRM IMHPIKPDTV GKDLRHVADA DGRHHYLTFA SVAQSPQGGF VDYVFAHPNA HPAKAVDKLG YVIRYAPWDW IISTGAYIDD IQAAFAKSLC LSAGVFALLA ALLALSVVYT NRGIERMIGG DPRIAAHMAG VIASGDLATP FATRREDRSS LMFAISQMRD ALANAVAQIR ASAGYVATTA GEIADSNMDL SSHTECQATA LQQAASSMQR LTERVRDTAD RARAASELAG SAARITDRGG DMVVRVVAAM NDIRAESRKM VDIIGVIEGI AFQTNILALN AAVEAARAGN EGRGFAVVAS EVRVLAQRSA SAAKDIRALI GRAAERVGNG AELVEATGAT IGEAQHAICR VTGIVQDIAA AATDQSQGLE QINAAVSQMD SVTRRNATLV EHAAIAAQSL NEQSRCLQDA VAAFRTEALG C
|
| |