Gene EcHS_A1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1503 
Symboltrg 
ID5595393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1512899 
End bp1514539 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID640920660 
Productmethyl-accepting chemotaxis protein III 
Protein accessionYP_001458216 
Protein GI157160898 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CTCCCTCACA GCGATTAGGT TTTTTGCATC ACATCAGGCT GGTTCCGTTA 
TTTGCCTGCA TTCTTGGCGG TATCTTAGTT CTATTCGCAT TAAGTTCTGC TCTGGCTGGC
TATTTCCTCT GGCAGGCCGA TCGCGATCAG CGTGATGTTA CTGCGGAGAT TGAGATTCGA
ACCGGGTTAG CGAACAGTTC AGATTTTTTG CGTTCAGCCC GGATCAATAT GATTCAGGCC
GGGGCTGCGA GTCGTATTGC GGAAATGGAA GCAATGAAGC GAAATATTGC GCAAGCCGAA
TCGGAGATTA AACAGTCGCA GCAAGGTTAT CGTGCTTATC AGAATCGACC GGTGAAAACA
CCTGCTGATG AAGCCCTCGA CACTGAATTA AATCAACGCT TTCAGGCTTA TATCACGGGT
ATGCAACCTA TGTTGAAATA TGCCAAAAAT GGCATGTTTG AAGCGATTAT CAATCATGAA
AGTGAGCAGA TCCGACCGCT GGATAATGCT TATACCGATA TTTTGAACAA AGCCGTTAAG
ATACGTAGCA CCAGAGCCAA CCAACTGGCG GAACTGGCCC ATCAGCGCAC CCGCCTGGGT
GGGATGTTCA TGATTGGCGC GTTTGTGCTT GCCCTGGTCA TGACGCTGAT AACATTTATG
GTGCTACGTC GGATCGTCAT TCGTCCACTG CAACATGCCG CACAACGGAT TGAAAAAATC
GCTAGTGGCG ATCTGACGAT GAAGAATGAA CCGGCGGGTC GTAATGAAAT CGGTCGCTTA
AGTCGTCATT TACAGCAAAT GCAGCATTCA CTGGGGATGA CAGTAGGGAC TGTTCGACAG
GGTGCGGAAG AGATTTATCG TGGCACCAGC GAAATTTCAG CTGGCAATGC GGACCTGTCA
TCTCGCACCG AAGAACAAGC GGCGGCTATC GAACAAACTG CCGCTAGCAT GGAGCAACTC
ACTGCGACGG TGAAACAGAA TGCGGATAAC GCGCATCATG CCAGCAAACT GGCGCAAGAG
GCTTCTATTA AAGCCAGCGA TGGCGGGCAG ACGGTTTCCG GTGTAGTAAA AACGATGGGC
GCTATCTCTA CAAGTTCGAA GAAAATTTCC GAGATCACCG CCGTCATCAA CAGTATTGCT
TTCCAGACGA ATATTCTGGC ACTGAATGCT GCCGTTGAAG CCGCGCGAGC GGGTGAGCAA
GGCCGTGGAT TTGCCGTTGT CGCCAGCGAA GTACGGACAC TCGCAAGCCG CAGCGCCCAA
GCGGCGAAAG AGATTGAAGG CTTGATCAGT GAATCAGTCA GGTTAATTGA CCTGGGGTCG
GATGAGGTGG CAACGGCAGG GAAAACCATG AGCACTATTG TTGATGCCGT CGCGAGTGTC
ACACATATCA TGCAGGAAAT CGCCGCCGCC TCGGATGAAC AAAGTAGAGG CATAACGCAG
GTTAGCCAGG CGATTTCTGA AATAGATAAG GTGACGCAAC AGAATGCTTC TCTGGTAGAA
GAGGCCTCAG CGGCGGCGTT GTCCCTTGAA GAACAGGCGG CACGATTAAC TGAGGCGGTG
GATGTATTCC GTCTGCACAA ACATTCTGTG TCGGCAAAAC CTCGCGGAGC GGGTGAACCA
GTTAGTTTCG CTACGGTGTG A
 
Protein sequence
MNTTPSQRLG FLHHIRLVPL FACILGGILV LFALSSALAG YFLWQADRDQ RDVTAEIEIR 
TGLANSSDFL RSARINMIQA GAASRIAEME AMKRNIAQAE SEIKQSQQGY RAYQNRPVKT
PADEALDTEL NQRFQAYITG MQPMLKYAKN GMFEAIINHE SEQIRPLDNA YTDILNKAVK
IRSTRANQLA ELAHQRTRLG GMFMIGAFVL ALVMTLITFM VLRRIVIRPL QHAAQRIEKI
ASGDLTMKNE PAGRNEIGRL SRHLQQMQHS LGMTVGTVRQ GAEEIYRGTS EISAGNADLS
SRTEEQAAAI EQTAASMEQL TATVKQNADN AHHASKLAQE ASIKASDGGQ TVSGVVKTMG
AISTSSKKIS EITAVINSIA FQTNILALNA AVEAARAGEQ GRGFAVVASE VRTLASRSAQ
AAKEIEGLIS ESVRLIDLGS DEVATAGKTM STIVDAVASV THIMQEIAAA SDEQSRGITQ
VSQAISEIDK VTQQNASLVE EASAAALSLE EQAARLTEAV DVFRLHKHSV SAKPRGAGEP
VSFATV