Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0485 |
Symbol | |
ID | 4886651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 440377 |
End bp | 441921 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640130426 |
Product | aerotaxis sensor receptor |
Protein accession | YP_001061491 |
Protein GI | 126443318 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.537813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAACA ACCAACCCGT CACCCAACAC GAATTCGAGC TTCCCGACGA CGCGACATTG ATGTCGACGA CCGATCCGCA CGGCCGCATC ACCTATGCGA ACGCGACGTT CGTGCACGTC AGCGGCTTTT CGAGCGACGA GATCGTCGGC GCGCCGCACA ACGTCGTGCG CCATCCCGAC ATGCCGCGCG ATGCGTTCGC CGACATGTGG GCGACGTTAA AGCGCGGCGA GCCGTGGACC GCGCTCGTCA AGAACCGCCG CAAGAACGGC GATCACTACT GGGTGCGCGC GAACGCGGTG CCGGTGATTC GCGGCGGGCA GACGCAGGGC TACATGTCGG TGCGCACGAA GCCCGCGCGC GCCGAGACCG CCGCCGCCGA CGCGCTCTAT CGCGATTTTC GCGAGGGCCG CGCGGGCAGC CGGCGCTTTC ACAAGGGGCT GATCGTGCGC ACCGGGCTGC TGCGTGCGTG CTCGCTGCTG CAGACGATGT CGGTGCGCGC GCGCATCCAT CTGCCGATCG TCGCGCTGAC GCCGGCGATC GTCGGCGCTG CCTGGGCGGC CGGCGTGGCG GGCGCGCCGC TCGCGCAGCT CGCGGGCGCG ACGCTCGGCG GCGCGGCGCT CGCCGCGTGG TGGCTCGACG CGCAGATCGC GCGCCCGCTG CGCACGTTGC GCCGGCAGGC GCTCGACGTC GCGACCGGGG CGAGCCGCCG GGGCGTCAAC ATGAATCGCG TCGACGAAAT CGGCATGTCG CTGCGCACGA TCAATCAGCT CGGGCTGATG TTTCGCTGGC TGATCGACGA CGTGAGCGAA CAGGTCTTGA CCGTGCAGCG CGCGGTCAAC GAGATCGCGC AGGGCAATCA CGATCTGAGC GCGCGCACCG AGCAGGCGGC GACGAGCGTT CAGCAGACGG CCGCGTCGAT GGCGCAGATG ACGGCGACCG TGTCGAGCAA CGCGCAGACC GCGACGCAGG CGAACCGGCT GTCCGAATCG GCGAGCCATG CGGCGGAGCG CGGCGGCCAG GCGGTGCGCG AGGTGGTGAG CACGATGGGC GAGATCACCG AGAGCTCGCG CCGGATCTCG GAGATCATCG GCGTGATCGA CGGCATCGCG TTCCAGACCA ACATCCTCGC GCTGAACGCG GCCGTCGAGG CCGCGCGCGC GGGCGAGCAG GGCCGCGGCT TCGCGGTGGT CGCGGGCGAG GTGCGCGCGC TCGCGCAGCG CAGCGCGAAC GCGGCGAAGG AGATCAAGGC GCTGATCGGT GCGAGCGTCG AGCGGGTCGA ATCCGGCGCG CAGACGGTCG ACTACGCGGG CAGGACGATG GGCGAGATCG TCTCGCAGGT GAAGCGCGTG TCCGATCTGA TCGCCGAGAT CAGCGCATCG ACGAGCGAGC AGCGCGCGGG CGTCACGCAG GTCGACGACG CGGTCGTCCA TCTCGACAGC ATCACGCAGC AGAACGCCGC GCTCGTCGAG CAGAGCGCGG CGGCCTCGGA GAGCCTGCGG CAGCAGGCGA CGCTGCTCGT CGACGCGGTC GGCGTGTTTC GCTGA
|
Protein sequence | MRNNQPVTQH EFELPDDATL MSTTDPHGRI TYANATFVHV SGFSSDEIVG APHNVVRHPD MPRDAFADMW ATLKRGEPWT ALVKNRRKNG DHYWVRANAV PVIRGGQTQG YMSVRTKPAR AETAAADALY RDFREGRAGS RRFHKGLIVR TGLLRACSLL QTMSVRARIH LPIVALTPAI VGAAWAAGVA GAPLAQLAGA TLGGAALAAW WLDAQIARPL RTLRRQALDV ATGASRRGVN MNRVDEIGMS LRTINQLGLM FRWLIDDVSE QVLTVQRAVN EIAQGNHDLS ARTEQAATSV QQTAASMAQM TATVSSNAQT ATQANRLSES ASHAAERGGQ AVREVVSTMG EITESSRRIS EIIGVIDGIA FQTNILALNA AVEAARAGEQ GRGFAVVAGE VRALAQRSAN AAKEIKALIG ASVERVESGA QTVDYAGRTM GEIVSQVKRV SDLIAEISAS TSEQRAGVTQ VDDAVVHLDS ITQQNAALVE QSAAASESLR QQATLLVDAV GVFR
|
| |