Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2627 |
Symbol | cheA |
ID | 6969829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2481582 |
End bp | 2483540 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386490 |
Product | chemotaxis protein CheA |
Protein accession | YP_002270972 |
Protein GI | 209397867 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0596753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000000000635957 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATAA GCGATTTTTA TCAGACATTT TTTGATGAAG CGGACGAACT GTTGGCTGAT ATGGAGCAGC ATCTGCTGGT TTTGCAGCCG GAAGCGCCAG ATGCCGAACA ATTGAATGCC ATCTTTCGGG CTGCCCACTC GATCAAAGGA GGGGCAGGAA CTTTTGGCTT CAGCGTTTTG CAGGAAACCA CGCATCTGAT GGAAAACCTG CTCGATGAAG CCAGACGAGG TGAGATGCAA CTCAACACCG ACATTATCAA TCTGTTTTTG GAAACGAAGG ACATCATGCA AGAACAGCTC GACGCTTATA AACAGTCGCA AGAGCCGGAT GCCGCCAGCT TCGATTATAT CTGCCAGGCC TTGCGTCAAC TGGCATTAGA AGCGAAAGGC GAAACGCCAT CCGCAGTGAC CCGATTAAGT GTGGTTGCCA AAAGTGAACC GCAAGATGAG CAGAGTCGCA GTCAGTTGCC GCGACGAATT ATCCTTTCGC GCCTGAAGGC CAGCGAAGTC GACCTGCTGG AAGAAGAGCT GGGGCATCTG ACAACGTTAA CTGACGTGGT GAAAGGGGCG GATTCTCTCT CGGCAATATT ACCGGGCGAT ATCGCCGAAG ATGACATCAC AGCGGTACTC TGTTTTGTGA TTGAAGCCGA TCAGATTACC TTTGAAACAG TAGAAGTCTC GCCAAAAATA TCCACCCCAC CAGTGCTTAA ACTGGCAGCC GAACAAGCGC CAACCGGTCG CGTGGAGCGG GAAAAAACGA CGCGTAGCAG TGAATCCACC AGCATCCGTG TAGCGGTAGA AAAGGTTGAT CAATTAATTA ACCTCGTCGG CGAGCTGGTT ATCACCCAGT CCATGCTTGC CCAGCGTTCC AGCGAACTGG ACCCGGTTAA TCATGGTGAT TTGATTACCA GCATGGGGCA GTTACAACGT AACGCCCGTG ATTTGCAGGA ATCAGTGATG TCGATTCGCA TGATGCCGAT GGAATATGTC TTTAGTCGCT ATCCCCGGCT GGTGCGTGAT CTGGCGGGAA AACTCGGCAA GCAGGTAGAA CTGACGCTGG TGGGCAGTTC CACCGAGCTC GACAAGAGCC TGATAGAACG CATTATCGAC CCGCTGACCC ACCTGGTACG CAATAGCCTC GATCACGGTA TTGAACTGCC AGAAAAACGG CTCGCCGCAG GTAAAAACAG CGTCGGAAAT TTAATTCTGT CTGCCGAACA TCAGGGCGGC AACATTTGCA TTGAAGTGAC CGACGATGGG GCGGGGCTAA ACCGTGAGCG AATTCTGGCA AAAGCGGCCT CGCAAGGTTT GACTGTCAGC GAAAACATGA GCGACGACGA AGTCGCGATG CTGATATTTG CACCAGGCTT CTCCACGGCA GAGCAGGTCA CCGACGTCTC CGGGCGCGGC GTCGGCATGG ACGTCGTTAA ACGTAATATC CAGGAGATGG GCGGTCATGT TGAAATCCAG TCGAAGCAGG GTACTGGCAC TACGATCCGC ATTTTACTGC CGCTGACGCT GGCCATCCTC GACGGCATGT CCGTACGCGT TGCGGATGAA GTATTCATTC TGCCGCTGAA TGCTGTTATG GAATCACTGC AACCCCGTGA AGCCGATCTG CATCCACTGG CCGGCGGCGA GCGGGTGCTG GAAGTGCGGG GTGAATATCT GCCCATCGTC GAACTGTGGA AAGTGTTCAA CGTCGCGGGC GCGAAAACCG AAGCTACCCA GGGAATTGTG GTGATCTTAC AAAGTGGCGG TCGCCGCTAC GCCTTGCTGG TGGATCAATT AATTGGTCAA CATCAGGTTG TAGTTAAAAA CCTTGAAAGT AACTATCGCA AAGTCCCCGG CATTTCTGCT GCGACCATTC TTGGCGACGG CAGCGTGGCA CTGATTGTTG ATGTCTCCGC CTTGCAGGCG ATAAACCGCG AACAACGTAT GGCGAACACC GCCGCCTGA
|
Protein sequence | MDISDFYQTF FDEADELLAD MEQHLLVLQP EAPDAEQLNA IFRAAHSIKG GAGTFGFSVL QETTHLMENL LDEARRGEMQ LNTDIINLFL ETKDIMQEQL DAYKQSQEPD AASFDYICQA LRQLALEAKG ETPSAVTRLS VVAKSEPQDE QSRSQLPRRI ILSRLKASEV DLLEEELGHL TTLTDVVKGA DSLSAILPGD IAEDDITAVL CFVIEADQIT FETVEVSPKI STPPVLKLAA EQAPTGRVER EKTTRSSEST SIRVAVEKVD QLINLVGELV ITQSMLAQRS SELDPVNHGD LITSMGQLQR NARDLQESVM SIRMMPMEYV FSRYPRLVRD LAGKLGKQVE LTLVGSSTEL DKSLIERIID PLTHLVRNSL DHGIELPEKR LAAGKNSVGN LILSAEHQGG NICIEVTDDG AGLNRERILA KAASQGLTVS ENMSDDEVAM LIFAPGFSTA EQVTDVSGRG VGMDVVKRNI QEMGGHVEIQ SKQGTGTTIR ILLPLTLAIL DGMSVRVADE VFILPLNAVM ESLQPREADL HPLAGGERVL EVRGEYLPIV ELWKVFNVAG AKTEATQGIV VILQSGGRRY ALLVDQLIGQ HQVVVKNLES NYRKVPGISA ATILGDGSVA LIVDVSALQA INREQRMANT AA
|
| |