Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1872 |
Symbol | |
ID | 4902904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1825565 |
End bp | 1827445 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640135102 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_001066137 |
Protein GI | 126451531 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCTGA AGAACATCAC GATCCGAGCG CGCCTCGCCG TCACGATGGC GCTCCTTTCC GTCCTGCTGT GCGCCGTCGG TGCGCTCGGC CTCTTCGGCA TGAGCCGGGC CAACGACGCG AACGAGCAAA CGTCGTCGAA TCAACTGCCG AGCGCGATCG ACGTGGCGAG CGCGGAGTTG TTCGCCGCCC GAGAGCGCCT CGTGCTCGAT CGCGCGGCGC TCCTCGCGGG CACGCCGGAC GTCGCGCCGA CGATCGAGCG CGCGCGCCAG ATGCGCGGAG TCTCCGATTC GTGGTGGAAA AAATACCTCG ACCTGCCGCG CGACGCGCAG GAGGATCGCC TCGCGCAGGA CCTCGCGTCG AAACGGCAGA TTCTGCAGCG TGAGCTCGAC GCGGCCGCCG CGTTGATCAA CGCGAACGAT CGCGACAGAA TCCTCGAATC CGCCAAGCGG ATGCAGAACG TATTCAACGA TTTCTCGCTG GCGAGCGAGG CGCTGCGCGC GTTTCAGCTC AAGCAGGCGA GCGTCAACTT CAGCGATGCG CAAAGCGTAT ACGCGGCGAG CCGGGTCGCC AGCATCGCCG CGCTCGCGCT CGGCCTCGCG CTCTCGCTGT ACTGCTTCCT GAGCCTGCGC GCGGCGATCG CCCGGCCGCT CGCCGACGCG CTCGGTCATT TCGACGCGAT CGCCGCGGGC GAGCTGCGCC GCCCGATCGT CGTCGAACGG CGCGACGAGA TGGGCCTGCT GCTCGAAGGG CTCGCGAAGA TGCAGGCAAG CCTCGTGGGC ACCGTGCGCG CGGTGCGCGT CGGCAGCGAA TCGATCGCGA CGGCGGCCCG GCAGATCGCG GCGGGCAACA TCGATCTGTC CTCGCGCACC GAGGAGCAGG CGGCGGCGCT CGAACAGACG GCGTCGAGCA TGGAGGAACT GACGGGCACG GTGCGGCGCA ACGCGGATAA CGCGCGGCAG GCGAGCGCGC TCGCGGCGAG CGCGTCGGAG ATCGCGAACA AGGGCAACGC GGTGGTCGGC CAGGTGGTCG GCACGATGGG CGACATCAAC CAGAGCTCCG CGAAGATCGC CGACATCATC ACGATCATCG AGGGAATCGC GTTCCAGACC AACATCCTCG CGCTGAACGC GGCGGTCGAG GCGGCGCGCG CGGGCGAGGA AGGGCGCGGC TTCGCGGTCG TCGCGGGCGA GGTGCGCAGC CTCGCGCAGC GTTCGTCGAC GGCCGCCAAG GAGATCAAGG AGCTGATCGA CACGTCGGTC GAGCGCGTGC GCACGGGCTC CATGCTCGTC GACGACGCGG GACGCACGAT GAGCGAGGTG ATCGGCGCGG TGCGGCGCGT GACCGACATC ATGGGCGAGA TCGCGGCGGC CTCGGACGAG CAGAGCACCG GCATCGACCA GGTGTCGCGC GCCGTATCGC AGATGGACGA GGTCACGCAG CAGAACGCGG CGCTCGTCGA GCAGGCCGCG GCGGCCGCGC AATCGCTCGA CGAGCAGGCC GCGCGGCTGC GCGCGACCGT TTCGGTGTTC CATGTCGATG ACGGCGATGC GCGCGATGTC GCCTCGCGCC CGCCGGCCCG GCCGGGGGCC GGCGCCGTGC ACCGGGCGGC GGGCGCGGCC GCGCCGGCGC AGGCCGGCGC GTCTTCGGGC GCGCACGCCG CGCCGGCGGC GCATGCGGCG CGCAAGCCCG CGCCGTCCGC TGCGCTCGCG AAAGCCGCGC CTGTTGCGTC CGTTGCATCC GCTGCATCCG TTACGCCCGC TGCGCCTGCG CCGACGCTTG CCGCGGCAGC GGCGACGCCC AAACCCGCGC CCGCCGCGCC GCGCGCCGAG TCGGTTACTG CGGCGTCCGC CGCGAGCGAC GACGATTGGG AGACCTTCTA A
|
Protein sequence | MLLKNITIRA RLAVTMALLS VLLCAVGALG LFGMSRANDA NEQTSSNQLP SAIDVASAEL FAARERLVLD RAALLAGTPD VAPTIERARQ MRGVSDSWWK KYLDLPRDAQ EDRLAQDLAS KRQILQRELD AAAALINAND RDRILESAKR MQNVFNDFSL ASEALRAFQL KQASVNFSDA QSVYAASRVA SIAALALGLA LSLYCFLSLR AAIARPLADA LGHFDAIAAG ELRRPIVVER RDEMGLLLEG LAKMQASLVG TVRAVRVGSE SIATAARQIA AGNIDLSSRT EEQAAALEQT ASSMEELTGT VRRNADNARQ ASALAASASE IANKGNAVVG QVVGTMGDIN QSSAKIADII TIIEGIAFQT NILALNAAVE AARAGEEGRG FAVVAGEVRS LAQRSSTAAK EIKELIDTSV ERVRTGSMLV DDAGRTMSEV IGAVRRVTDI MGEIAAASDE QSTGIDQVSR AVSQMDEVTQ QNAALVEQAA AAAQSLDEQA ARLRATVSVF HVDDGDARDV ASRPPARPGA GAVHRAAGAA APAQAGASSG AHAAPAAHAA RKPAPSAALA KAAPVASVAS AASVTPAAPA PTLAAAAATP KPAPAAPRAE SVTAASAASD DDWETF
|
| |