Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1296 |
Symbol | cheA |
ID | 6143455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1283422 |
End bp | 1285380 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616174 |
Product | chemotaxis protein CheA |
Protein accession | YP_001743354 |
Protein GI | 170683477 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.182817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000116445 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATATAA GCGATTTTTA TCAGACATTT TTTGATGAAG CGGACGAACT GTTGGCTGAC ATGGAGCAGC ATCTGCTGGT TTTGCAGCCG GAAGCGCCAG ATGCCGAACA ATTGAATGCC ATCTTTCGGG CTGCCCACTC GATCAAAGGA GGGGCAGGAA CTTTTGGCTT CAGCGTTTTG CAGGAAACCA CGCATCTGAT GGAAAACCTG CTCGATGAAG CCAGACGAGG TGAGATGCAA CTCAACACCG ACATTATCAA TCTGTTTTTG GAAACGAAGG ACATCATGCA AGAACAGCTC GACGCTTATA AACAGTCGCA AGAGCCGGAT GCCGCCAGCT TCGATTATAT CTGCCAGGCC TTGCGTCAAC TGGCATTAGA AGCGAAAGGC GAAACGCCAT CCGCAGTGAC CCGATTAAGT GTGGTTGCCA AAAGTGAACC GCAAGATGAG CAGAGTCGCA GTCAGTCGCC GCGACGAATT ATCCTTTCGC GCCTGAAGGC CGGGGAAGTC GACCTGCTGG AAGAAGAACT GGGACATCTG ACAACGTTAA CTGACGTGGT GAAAGGGGCG GATTCGCTCT CGGCAATATT ACCGGGCGAC ATTGCCGAAG ATGACATCAC AGCGGTACTC TGTTTTGTGA TTGAAGCCGA TCAGATTACT TTTGAAACAG TAGAAGTCTC GCCAAAAATA TCCACCCCAC CAGTGCTTAA ACTGGCAGCC GAACAAGCGC CAACCGGCCG CGTGGAGCGG GAAAAAACGA CGCGCAGCAA TGAATCCACC AGCATCCGTG TAGCGGTAGA AAAGGTTGAT CAATTAATTA ACCTCGTCGG CGAGCTGGTC ATCACCCAGT CCATGCTTGC CCAGCGCTCC AGTGAACTGG ACCCGGTTAA TCATGGTGAT TTGATTACCA GCATGGGGCA GTTACAACGT AACGCTCGTG ATTTGCAAGA ATCAGTGATG TCGATTCGTA TGATGCCGAT GGAATATGTC TTTAGTCGCT ATCCCCGGCT GGTGCGTGAT CTGGCGGGAA AACTCGGCAA GCAGGTAGAA CTGACGCTGG TGGGCAGTTC TACTGAACTC GACAAGAGCC TGATAGAACG CATTATCGAC CCGCTGACCC ACCTGGTACG CAATAGCCTC GATCACGGTA TTGAACTGCC AGAAAAACGG CTCGCCGCAG GTAAAAACAG CGTCGGAAAT TTAATTCTTT CTGCCGAACA TCAGGGCGGC AACATTTGCA TTGAAGTGAC CGACGATGGT GCAGGGCTAA ACCGTGAGCG AATTCTGGCA AAAGCGGCCT CGCAAGGTTT GACTGTAAGC GAAAACATGA GCGACGACGA AGTCGCGATG CTGATATTTG CACCTGGCTT CTCTACGGCA GAGCAGGTCA CCGACGTCTC CGGGCGCGGC GTCGGCATGG ACGTCGTTAA ACGTAATATC CAGGAGATGG GCGGTCATGT CGAAATCCAG TCGAAGCAGG GTACTGGCAC GACGATCCGC ATTTTACTGC CGCTGACGCT GGCCATCCTC GACGGCATGT CCGTACGCGT TGCGGATGAA GTTTTCATTC TGCCGCTGAA TGCTGTCATG GAATCACTGC AACCCCGTGA AGCCGATCTG CATCCACTGG CAGGCGGCGA GCGGGTGCTG GAAGTGCGGG GTGAATATCT GCCCATCGTC GAACTGTGGA AAGTGTTCAA CGTCGCGGGC GCGAAAACCG AAGCTACCCA GGGAATTGTG GTGATCTTAC AAAGTGGCGG TCGCCGCTAC GCCTTGCTGG TGGATCAATT AATTGGTCAA CACCAGGTTG TGGTTAAAAA CCTGGAAAGT AACTATCGCA AAGTCCCCGG CATTTCTGCT GCGACCATTC TTGGCGACGG CAGCGTGGCA CTGATTGTTG ATGTCTCCGC CTTGCAGGCG ATAAACCGCG AACAACGTAT GGCGAACACC GCCGCCTGA
|
Protein sequence | MDISDFYQTF FDEADELLAD MEQHLLVLQP EAPDAEQLNA IFRAAHSIKG GAGTFGFSVL QETTHLMENL LDEARRGEMQ LNTDIINLFL ETKDIMQEQL DAYKQSQEPD AASFDYICQA LRQLALEAKG ETPSAVTRLS VVAKSEPQDE QSRSQSPRRI ILSRLKAGEV DLLEEELGHL TTLTDVVKGA DSLSAILPGD IAEDDITAVL CFVIEADQIT FETVEVSPKI STPPVLKLAA EQAPTGRVER EKTTRSNEST SIRVAVEKVD QLINLVGELV ITQSMLAQRS SELDPVNHGD LITSMGQLQR NARDLQESVM SIRMMPMEYV FSRYPRLVRD LAGKLGKQVE LTLVGSSTEL DKSLIERIID PLTHLVRNSL DHGIELPEKR LAAGKNSVGN LILSAEHQGG NICIEVTDDG AGLNRERILA KAASQGLTVS ENMSDDEVAM LIFAPGFSTA EQVTDVSGRG VGMDVVKRNI QEMGGHVEIQ SKQGTGTTIR ILLPLTLAIL DGMSVRVADE VFILPLNAVM ESLQPREADL HPLAGGERVL EVRGEYLPIV ELWKVFNVAG AKTEATQGIV VILQSGGRRY ALLVDQLIGQ HQVVVKNLES NYRKVPGISA ATILGDGSVA LIVDVSALQA INREQRMANT AA
|
| |