Gene EcE24377A_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1600 
Symbol 
ID5587324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1592442 
End bp1594082 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID640925289 
Productmethyl-accepting chemotaxis protein III 
Protein accessionYP_001462694 
Protein GI157157689 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CTCCCTCACA TCGATTAGGT TTTTTGCATC ACATCAGGCT GGTTCCGTTA 
TTTGCCTGCA TTCTTGGCGG TATCTTAGTT CTATTCGCAT TAAGTTCTGC TCTGGCTGGC
TATTTCCTCT GGCAGGCCGA TCGCGATCAG CGTGATGTTA CTGCGGAGAT TGAGATTCGG
ACCGGGTTAG CGAACAGTTC AGATTTTTTG CGTTCAGCCC GGATCAATAT GATTCAGGCC
GGGGCTGCGA GTCGTATTGC GGAAATGGAA GCAATGAAGC GCAATATTGC GCAAGCCGAA
TCGGAGATTA AACAGTCGCA GCAAGGTTAT CGTGCTTATC AGAATCGGCC GGTGAAAACA
CCTGCTGATG AAGCCCTCGA CACTGAATTA AATCAACGCT TTCAGGCTTA TATCACGGGT
ATGCAACCTA TGTTGAAATA TGCCAAAAAT GGCATGTTTG AAGCGATTAT CAATCATGAA
AGTGAGCAGA TCCGACCGCT GGATAATGCT TATACCGATA TTTTGAACAA AGCCGTTAAG
ATACGTAGCA CCAGAGCCAA CCAACTGGCG GAACTGGCCC ATCAGCGCAC CCGCCTGGGT
GGGATGTTCA TGATTGGCGC GTTTGTGCTT GCCCTGGTCA TGACGCTGAT AACATTTATG
GTGCTACGTC GGATCGTCAT TCGTCCACTG CAACATGCCG CACAACGGAT TGAAAAAATC
GCCAGTGGCG ATCTGACGAT GAATGATGAA CCGGCGGGTC GTAATGAAAT CGGTCGCTTA
AGTCGTCATT TACAGCAAAT GCAGCATTCA CTGGGGATGA CAGTAGGGAC TGTTCGACAG
GGTGCGGAAG AGATTTATCG TGGCACCAGC GAAATTTCAG CTGGCAATGC GGACCTGTCA
TCTCGCACCG AAGAACAAGC GGCGGCTATC GAACAAACTG CCGCTAGCAT GGAGCAACTC
ACTGCGACGG TGAAACAGAA TGCGGATAAC GCGCATCATG CCAGCAAACT GGCGCAAGAG
GCTTCTATTA AAGCCAGCGA TGGCGGGCAG ACGGTTTCCG GTGTAGTAAA AACGATGGGC
GCTATCTCTA CAAGTTCGAA GAAAATTTCC GAGATCACCG CCGTCATCAA CAGTATTGCT
TTCCAGACGA ATATTCTGGC ACTGAATGCT GCCGTTGAAG CCGCGCGAGC GGGTGAGCAA
GGCCGTGGAT TTGCCGTTGT CGCCAGCGAA GTACGGACAC TCGCAAGCCG CAGCGCCCAA
GCGGCGAAAG AGATTGAAGG CTTGATCAGT GAATCAGTCA GGTTAATTGA CCTGGGGTCG
GATGAGGTGG CAACGGCAGG GAAAACCATG AGCACTATTG TTGATGCCGT CGCGAGTGTC
ACACATATCA TGCAGGAAAT CGCCGCCGCC TCGGATGAAC AAAGTAGAGG CATAACGCAG
GTTAGCCAGG CGATTTCTGA AATGGATAAG GTGACGCAAC AGAATGCTTC TCTGGTAGAA
GAGGCCTCAG CGGCGGCGGT GTCCCTTGAA GAACAGGCGG CACGATTAAC TGAGGCGGTG
GATGTATTCC GTCTGCACAA ACATTCTGTG TCGGCAGAAC CTCGCGGAGC GGGTGAACCA
GTTAGTTTCG CTACGGTGTG A
 
Protein sequence
MNTTPSHRLG FLHHIRLVPL FACILGGILV LFALSSALAG YFLWQADRDQ RDVTAEIEIR 
TGLANSSDFL RSARINMIQA GAASRIAEME AMKRNIAQAE SEIKQSQQGY RAYQNRPVKT
PADEALDTEL NQRFQAYITG MQPMLKYAKN GMFEAIINHE SEQIRPLDNA YTDILNKAVK
IRSTRANQLA ELAHQRTRLG GMFMIGAFVL ALVMTLITFM VLRRIVIRPL QHAAQRIEKI
ASGDLTMNDE PAGRNEIGRL SRHLQQMQHS LGMTVGTVRQ GAEEIYRGTS EISAGNADLS
SRTEEQAAAI EQTAASMEQL TATVKQNADN AHHASKLAQE ASIKASDGGQ TVSGVVKTMG
AISTSSKKIS EITAVINSIA FQTNILALNA AVEAARAGEQ GRGFAVVASE VRTLASRSAQ
AAKEIEGLIS ESVRLIDLGS DEVATAGKTM STIVDAVASV THIMQEIAAA SDEQSRGITQ
VSQAISEMDK VTQQNASLVE EASAAAVSLE EQAARLTEAV DVFRLHKHSV SAEPRGAGEP
VSFATV