Gene EcolC_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2238 
Symbol 
ID6067316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2457232 
End bp2458872 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID641601643 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001725202 
Protein GI170020248 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CTCCCTCACA GCGATTAGGT TTTTTGCATC ACATCAGGCT GGTTCCGTTA 
TTTGCCTGCA TTCTTGGCGG TATCTTAGTT CTATTCGCAT TAAGTTCTGC TCTGGCTGGC
TATTTCCTCT GGCAGGCCGA TCGCGATCAG CGTGATGTTA CTGCGGAGAT TGAGATTCGA
ACCGGGTTAG CGAACAGTTC AGATTTTTTG CGTTCAGCCC GGATCAATAT GATTCAGGCC
GGGGCTGCGA GTCGTATTGC GGAAATGGAA GCAATGAAGC GAAATATTGC GCAAGCCGAA
TCGGAGATTA AACAGTCGCA GCAAGGTTAT CGTGCTTATC AGAATCGACC GGTGAAAACA
CCTGCTGATG AAGCCCTCGA CACTGAATTA AATCAACGCT TTCAGGCTTA TATCACGGGT
ATGCAACCTA TGTTGAAATA TGCCAAAAAT GGCATGTTTG AAGCGATTAT CAATCATGAA
AGTGAGCAGA TCCGACCGCT GGATAATGCT TATACCGATA TTTTGAACAA AGCCGTTAAG
ATACGTAGCA CCAGAGCCAA CCAACTGGCG GAACTGGCCC ATCAGCGCAC CCGCCTGGGT
GGGATGTTCA TGATTGGCGC GTTTGTGCTT GCCCTGGTCA TGACGCTGAT AACATTTATG
GTGCTACGTC GGATCGTCAT TCGTCCACTG CAACATGCCG CACAACGGAT TGAAAAAATC
GCTAGTGGCG ATCTGACGAT GAAGGATGAA CCGGCGGGTC GTAATGAAAT CGGTCGCTTA
AGTCGTCATT TACAGCAAAT GCAGCATTCA CTGGGGATGA CAGTAGGGAC TGTTCGACAG
GGTGCGGAAG AGATTTATCG TGGCACCAGC GAAATTTCAG CTGGCAATGC GGACCTGTCA
TCTCGCACCG AAGAACAAGC GGCGGCTATC GAACAAACTG CCGCTAGCAT GGAGCAACTC
ACTGCGACGG TGAAACAGAA TGCGGATAAC GCGCATCATG CCAGCAAACT GGCGCAAGAG
GCTTCTATTA AAGCCAGCGA TGGCGGGCAG ACGGTTTCCG GTGTAGTAAA AACGATGGGC
GCTATCTCTA CAAGTTCGAA GAAAATTTCC GAGATCACCG CCGTCATCAA CAGTATTGCT
TTCCAGACGA ATATTCTGGC ACTGAATGCT GCCGTTGAAG CCGCGCGAGC GGGTGAGCAA
GGCCGTGGAT TTGCCGTTGT CGCCAGCGAA GTACGGACAC TCGCAAGCCG CAGCGCCCAA
GCGGCGAAAG AGATTGAAGG CTTGATCAGT GAATCAGTCA GGTTAATTGA CCTGGGGTCG
GATGAGGTGG CAACGGCAGG GAAAACCATG AGCACTATTG TTGATGCCGT CGCGAGTGTC
ACACATATCA TGCAGGAAAT CGCCGCCGCC TCGGATGAAC AAAGTAGAGG CATAACGCAG
GTTAGCCAGG CGATTTCTGA AATAGATAAG GTGACGCAAC AGAATGCTTC TCTGGTAGAA
GAGGCCTCAG CGGCGGCGTT GTCCCTTGAA GAACAGGCGG CACGATTAAC TGAGGCGGTG
GATGTATTCC GTCTGCACAA ACATTCTGTG TCGGCAAAAC CTCGCGGAGC GGGTGAACCA
GTTAGTTTCG CTACGGTGTG A
 
Protein sequence
MNTTPSQRLG FLHHIRLVPL FACILGGILV LFALSSALAG YFLWQADRDQ RDVTAEIEIR 
TGLANSSDFL RSARINMIQA GAASRIAEME AMKRNIAQAE SEIKQSQQGY RAYQNRPVKT
PADEALDTEL NQRFQAYITG MQPMLKYAKN GMFEAIINHE SEQIRPLDNA YTDILNKAVK
IRSTRANQLA ELAHQRTRLG GMFMIGAFVL ALVMTLITFM VLRRIVIRPL QHAAQRIEKI
ASGDLTMKDE PAGRNEIGRL SRHLQQMQHS LGMTVGTVRQ GAEEIYRGTS EISAGNADLS
SRTEEQAAAI EQTAASMEQL TATVKQNADN AHHASKLAQE ASIKASDGGQ TVSGVVKTMG
AISTSSKKIS EITAVINSIA FQTNILALNA AVEAARAGEQ GRGFAVVASE VRTLASRSAQ
AAKEIEGLIS ESVRLIDLGS DEVATAGKTM STIVDAVASV THIMQEIAAA SDEQSRGITQ
VSQAISEIDK VTQQNASLVE EASAAALSLE EQAARLTEAV DVFRLHKHSV SAKPRGAGEP
VSFATV