Gene EcHS_A4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4237 
SymbolzraS 
ID5591583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4231508 
End bp4232884 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content54% 
IMG OID640923341 
Productsensor protein ZraS 
Protein accessionYP_001460790 
Protein GI157163472 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.355168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC 
GTGGTCATTG TTGGGCTGGT AGGGTTGTTT GCGGTGACGG TGATTCGCGA TTATGGGCGC
GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT TCGCGCTCTT
GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCCTTA
CTGGAAGAAA TGGCCGGGCA GCCTGGAGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA
ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGCTTTATTC CCCGCAGGAA
ATGCAGCAGT TACATCCTGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGC
GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGCTGCTGG AATGCACCGG
ATGCGCCATA TGCAGCAGTA TGCCGCAACA CCACAAGCAA TTTTCATCGC TTTCGACGCC
AGTAATATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC
CTGGCGACGG TCTTGCTGGC AAGCGTATTG TCATTCTTCT GGTATCGCCG CTATCTGCGC
TCGCGCCAGC TTCTACAAGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC GCTGGGGCAT
CTTGCGGCAG GCGTTGCCCA CGAAATCCGT AACCCACTTT CCTCGATTAA AGGACTGGCG
AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAATTGGC GCAGGTGATG
GCGAAAGAAG CCGACCGTTT AAACCGCGTG GTAAGCGAGT TGCTGGAACT GGTTAAGCCA
ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG
GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA
CCGGAAATTC AGGCCGATCC GGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT
GCTATTCAGG CGATTGGTCA GCATGGCGTG ATTAGCGTGA CGGTCAGCGA AAGCGGCGCG
GGCGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGATCA GCTTGAAGCC
ATCTTCACTC CGTACTTCAC CACCAAAGCC GAAGGCACCG GATTGGGGCT GGCGGTCGTG
CATAATATTG TTGAACAACA CGGTGGTACA ATTCAGGTCG CAAGCCAGGA GGGAAAAGGC
TCAACGTTCA CCCTCTGGCT TCCGGTCAAT ATTACGCGTA AGGACCCACA AGGATGA
 
Protein sequence
MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL 
ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQLYSPQE
MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFAAGMHR MRHMQQYAAT PQAIFIAFDA
SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH
LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVKP
THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PEIQADPDRL TQVLLNLYLN
AIQAIGQHGV ISVTVSESGA GVKISVTDSG KGIAADQLEA IFTPYFTTKA EGTGLGLAVV
HNIVEQHGGT IQVASQEGKG STFTLWLPVN ITRKDPQG