Gene EcSMS35_1296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1296 
SymbolcheA 
ID6143455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1283422 
End bp1285380 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content53% 
IMG OID641616174 
Productchemotaxis protein CheA 
Protein accessionYP_001743354 
Protein GI170683477 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.182817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000116445 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATATAA GCGATTTTTA TCAGACATTT TTTGATGAAG CGGACGAACT GTTGGCTGAC 
ATGGAGCAGC ATCTGCTGGT TTTGCAGCCG GAAGCGCCAG ATGCCGAACA ATTGAATGCC
ATCTTTCGGG CTGCCCACTC GATCAAAGGA GGGGCAGGAA CTTTTGGCTT CAGCGTTTTG
CAGGAAACCA CGCATCTGAT GGAAAACCTG CTCGATGAAG CCAGACGAGG TGAGATGCAA
CTCAACACCG ACATTATCAA TCTGTTTTTG GAAACGAAGG ACATCATGCA AGAACAGCTC
GACGCTTATA AACAGTCGCA AGAGCCGGAT GCCGCCAGCT TCGATTATAT CTGCCAGGCC
TTGCGTCAAC TGGCATTAGA AGCGAAAGGC GAAACGCCAT CCGCAGTGAC CCGATTAAGT
GTGGTTGCCA AAAGTGAACC GCAAGATGAG CAGAGTCGCA GTCAGTCGCC GCGACGAATT
ATCCTTTCGC GCCTGAAGGC CGGGGAAGTC GACCTGCTGG AAGAAGAACT GGGACATCTG
ACAACGTTAA CTGACGTGGT GAAAGGGGCG GATTCGCTCT CGGCAATATT ACCGGGCGAC
ATTGCCGAAG ATGACATCAC AGCGGTACTC TGTTTTGTGA TTGAAGCCGA TCAGATTACT
TTTGAAACAG TAGAAGTCTC GCCAAAAATA TCCACCCCAC CAGTGCTTAA ACTGGCAGCC
GAACAAGCGC CAACCGGCCG CGTGGAGCGG GAAAAAACGA CGCGCAGCAA TGAATCCACC
AGCATCCGTG TAGCGGTAGA AAAGGTTGAT CAATTAATTA ACCTCGTCGG CGAGCTGGTC
ATCACCCAGT CCATGCTTGC CCAGCGCTCC AGTGAACTGG ACCCGGTTAA TCATGGTGAT
TTGATTACCA GCATGGGGCA GTTACAACGT AACGCTCGTG ATTTGCAAGA ATCAGTGATG
TCGATTCGTA TGATGCCGAT GGAATATGTC TTTAGTCGCT ATCCCCGGCT GGTGCGTGAT
CTGGCGGGAA AACTCGGCAA GCAGGTAGAA CTGACGCTGG TGGGCAGTTC TACTGAACTC
GACAAGAGCC TGATAGAACG CATTATCGAC CCGCTGACCC ACCTGGTACG CAATAGCCTC
GATCACGGTA TTGAACTGCC AGAAAAACGG CTCGCCGCAG GTAAAAACAG CGTCGGAAAT
TTAATTCTTT CTGCCGAACA TCAGGGCGGC AACATTTGCA TTGAAGTGAC CGACGATGGT
GCAGGGCTAA ACCGTGAGCG AATTCTGGCA AAAGCGGCCT CGCAAGGTTT GACTGTAAGC
GAAAACATGA GCGACGACGA AGTCGCGATG CTGATATTTG CACCTGGCTT CTCTACGGCA
GAGCAGGTCA CCGACGTCTC CGGGCGCGGC GTCGGCATGG ACGTCGTTAA ACGTAATATC
CAGGAGATGG GCGGTCATGT CGAAATCCAG TCGAAGCAGG GTACTGGCAC GACGATCCGC
ATTTTACTGC CGCTGACGCT GGCCATCCTC GACGGCATGT CCGTACGCGT TGCGGATGAA
GTTTTCATTC TGCCGCTGAA TGCTGTCATG GAATCACTGC AACCCCGTGA AGCCGATCTG
CATCCACTGG CAGGCGGCGA GCGGGTGCTG GAAGTGCGGG GTGAATATCT GCCCATCGTC
GAACTGTGGA AAGTGTTCAA CGTCGCGGGC GCGAAAACCG AAGCTACCCA GGGAATTGTG
GTGATCTTAC AAAGTGGCGG TCGCCGCTAC GCCTTGCTGG TGGATCAATT AATTGGTCAA
CACCAGGTTG TGGTTAAAAA CCTGGAAAGT AACTATCGCA AAGTCCCCGG CATTTCTGCT
GCGACCATTC TTGGCGACGG CAGCGTGGCA CTGATTGTTG ATGTCTCCGC CTTGCAGGCG
ATAAACCGCG AACAACGTAT GGCGAACACC GCCGCCTGA
 
Protein sequence
MDISDFYQTF FDEADELLAD MEQHLLVLQP EAPDAEQLNA IFRAAHSIKG GAGTFGFSVL 
QETTHLMENL LDEARRGEMQ LNTDIINLFL ETKDIMQEQL DAYKQSQEPD AASFDYICQA
LRQLALEAKG ETPSAVTRLS VVAKSEPQDE QSRSQSPRRI ILSRLKAGEV DLLEEELGHL
TTLTDVVKGA DSLSAILPGD IAEDDITAVL CFVIEADQIT FETVEVSPKI STPPVLKLAA
EQAPTGRVER EKTTRSNEST SIRVAVEKVD QLINLVGELV ITQSMLAQRS SELDPVNHGD
LITSMGQLQR NARDLQESVM SIRMMPMEYV FSRYPRLVRD LAGKLGKQVE LTLVGSSTEL
DKSLIERIID PLTHLVRNSL DHGIELPEKR LAAGKNSVGN LILSAEHQGG NICIEVTDDG
AGLNRERILA KAASQGLTVS ENMSDDEVAM LIFAPGFSTA EQVTDVSGRG VGMDVVKRNI
QEMGGHVEIQ SKQGTGTTIR ILLPLTLAIL DGMSVRVADE VFILPLNAVM ESLQPREADL
HPLAGGERVL EVRGEYLPIV ELWKVFNVAG AKTEATQGIV VILQSGGRRY ALLVDQLIGQ
HQVVVKNLES NYRKVPGISA ATILGDGSVA LIVDVSALQA INREQRMANT AA