Gene EcolC_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4022 
Symbol 
ID6064595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4429952 
End bp4431328 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content54% 
IMG OID641603437 
Productsensor protein ZraS 
Protein accessionYP_001726948 
Protein GI170021994 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.425845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC 
GTGGTCATTG TTGGGCTGGT GGGGCTGTTT GCGGTGACGG TGATTCGCGA TTATGGGCGC
GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT TCGCGCTCTT
GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCCTTA
CTGGAAGAAA TGGCCGGGCA GCCTGGAGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA
ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGCTTTATTC CCCGCAGGAA
ATGCAGCAGT TACATCCGGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGC
GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGCTGCTGG AATGCACCGG
ATGCGCCATA TGCAGCAGTA TGCCGCGACA CCACAAGCAA TTTTCATCGC TTTCGACGCC
AGTAATATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC
CTGGCGACGG TCTTGCTGGC AAGCGTGTTG TCATTCTTCT GGTATCGCCG CTATCTGCGC
TCGCGCCAGC TGTTGCAGGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC ACTGGGGCAT
CTTGCAGCAG GCGTTGCCCA CGAAATCCGT AATCCACTTT CCTCAATTAA AGGGCTGGCG
AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAATTGGC GCAGGTGATG
GCGAAAGAAG CCGACCGTTT AAACCGTGTG GTAAGCGAGT TGCTGGAACT GGTTAAGCCA
ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG
GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA
CCGGTAATTC AGGCCGACCC AGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT
GCTATTCAGG CGATTGGTCA GCATGGTGTG ATTAGCGTGA CGGCCAGCGA AAGCGGCGCG
GGCGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGGTCA GCTTGAAGCC
ATCTTCACTC CGTACTTCAC CACCAAAGCC GAAGGCACCG GATTGGGGCT GGCGGTCGTG
CATAATATTG TTGAACAACA CGGTGGTACA ATTCAGGTCG CAAGCCAGGA GGGAAAAGGC
GCAACGTTCA CCCTCTGTCT TCCGGTCAAT ATTACGCGTA AGGACCCACA AGGATGA
 
Protein sequence
MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL 
ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQLYSPQE
MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFAAGMHR MRHMQQYAAT PQAIFIAFDA
SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH
LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVKP
THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PVIQADPDRL TQVLLNLYLN
AIQAIGQHGV ISVTASESGA GVKISVTDSG KGIAAGQLEA IFTPYFTTKA EGTGLGLAVV
HNIVEQHGGT IQVASQEGKG ATFTLCLPVN ITRKDPQG