Gene ECH74115_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2026 
Symboltrg 
ID6970117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1924797 
End bp1926209 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content52% 
IMG OID643385942 
Productmethyl-accepting chemotaxis protein III 
Protein accessionYP_002270431 
Protein GI209400342 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000214401 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCAGG CCGGGGCTGC GAGTCGTATT GCGGAAATGG AAGCAATGAA GCGAAATATT 
GCGCAAGCCG AATCGGAGAT TAAACAGTCG CAGCAAGGTT ATCGTGCTTA TCAGAATCGG
CCGGTGAAAA CACCTGCTGA TGAAGCCCTC GACACTGAAT TAAATCAACG CTTTCAGGCT
TATATCACGG GTATGCAACC TATGTTGAAA TATGCCAAAA ATGGCATGTT TGAAGCGATT
ATCAATCATG AAAGTGAGCA GATCCGAACG CTGGATAATG CTTATACCGA TATTTTGAAC
AAAGCCGTTA AGATACGTAG CACCAGAGCC AACCAACTGG CGGAACTGGC CCATCAGCGC
ACCCGCCTGG GCGGGATGTT CATGATTGGT GCGTTTGTGC TTGCCCTGGT CATGACGCTG
ATAACATTTA TGGTGCTACG TCGGATCGTC ATTCGTCCAC TGCAACATGC CGCACAACGG
ATTGAAAAAA TCGCTAGTGG CGATCTGACG ATGAAGGATG AACCGGCGGG TCGTAATGAA
ATCGGTCGCT TAAGTCGTCA TTTACAGCAA ATGCAGCATT CACTGGGGAT GACCGTAGGG
ACTGTTCGAC AGGGTGCGGA AGAGATTTAT CGTGGCACCA GCGAAATTTC AGCTGGCAAT
GCGGACCTGT CATCTCGCAC CGAAGAACAA GCGGCGGCTA TCGAACAAAC TGCCGCCAGC
ATGGAGCAAC TCACTGCGAC GGTGAAACAG AATGCGGATA ACGCGCATTA TGCCAGCAAA
CTGGCGCAAG AGGCTTCTAT TAAAGCCAGC GATGGCGGGC AGACGGTTTC CGGTGTAGTA
AAAACGATGG GCGCTATCTC CACGAGTTCG AAGAAAATTT CTGAGATCAC CGCCGTCATC
AACAGTATTG CTTTCCAGAC GAATATTCTG GCACTGAATG CTGCCGTTGA AGCTGCGCGA
GCGGGTGAGC AAGGGCGTGG ATTTGCCGTT GTCGCCAGCG AAGTACGGAC ACTCGCAAGC
CGCAGCGCCC AGGCGGCAAA AGAAATTGAA GGCTTGATCA GTGAATCAGT CAGGTTAATT
GACCTGGGAT CGGATGAGGT GGCAACGGCC GGGAAAACCA TGAGCACTAT TGTTGATGCC
GTCGCGAGTG TCACACATAT CATGCAGGAA ATTGCCGCCG CCTCGGATGA GCAAAGTAGG
GGCATAACGC AGGTTAGCCA GGCGATTTCT GAAATGGATA AGGTGACGCA ACAGAATGCT
TCTCTGGTAG AAGAGGCCTC AGCGGCGGCG GTGTCCCTTG AAGAACAGGC GGCACGATTA
ACTGAGGCGG TGGACGTATT CCGTCTGAAC AAACAATCTG TGTTGGCAGA ACCTCGCGGA
GCGGGTGAAC TAGTTAGTTT CGCTCCGGTG TGA
 
Protein sequence
MIQAGAASRI AEMEAMKRNI AQAESEIKQS QQGYRAYQNR PVKTPADEAL DTELNQRFQA 
YITGMQPMLK YAKNGMFEAI INHESEQIRT LDNAYTDILN KAVKIRSTRA NQLAELAHQR
TRLGGMFMIG AFVLALVMTL ITFMVLRRIV IRPLQHAAQR IEKIASGDLT MKDEPAGRNE
IGRLSRHLQQ MQHSLGMTVG TVRQGAEEIY RGTSEISAGN ADLSSRTEEQ AAAIEQTAAS
MEQLTATVKQ NADNAHYASK LAQEASIKAS DGGQTVSGVV KTMGAISTSS KKISEITAVI
NSIAFQTNIL ALNAAVEAAR AGEQGRGFAV VASEVRTLAS RSAQAAKEIE GLISESVRLI
DLGSDEVATA GKTMSTIVDA VASVTHIMQE IAAASDEQSR GITQVSQAIS EMDKVTQQNA
SLVEEASAAA VSLEEQAARL TEAVDVFRLN KQSVLAEPRG AGELVSFAPV