Gene EcSMS35_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4452 
SymbolzraS 
ID6143174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4546451 
End bp4547827 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content55% 
IMG OID641619272 
Productsensor protein ZraS 
Protein accessionYP_001746388 
Protein GI170683574 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.260374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000970682 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC 
GTGGTCATTG TTGGGCTGGT GGGGCTGTTT GCGGTGACAG TGATTCGCGA TTATGGGCGC
GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT CCGTGCTCTC
GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCATTA
CTGGAAGAAA TGGCCGGGCA GCCTGGTGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA
ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGCTTTATTC CCCGCAGGAA
ATGCAGCAGT TACATCCGGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGC
GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGCTGCTGG AATGCACCGG
ATGCGCCATA TGCAGCAATA TGCCGCGACA CCACAAGCAA TTTTCATTGC TTTCGACGCC
AGTAACATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC
CTGGCGACGG TCTTGCTGGC AAGCGTGTTG TCATTCTTCT GGTATCGTCG CTATCTGCGC
TCGCGCCAGT TGTTGCAGGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC GCTGGGGCAC
CTTGCGGCAG GTGTTGCCCA CGAAATCCGT AACCCACTTT CCTCGATTAA AGGGCTGGCG
AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAACTGGC GCAGGTGATG
GCGAAAGAAG CCGACCGTTT AAACCGCGTG GTAAGCGAGT TGCTGGAACT GGTTAAGCCA
ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG
GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA
CCGGAAATTC AGGCCGATCC GGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT
GCTATTCAGG CGATTGGTCA GCATGGCGTG ATTAGCGTGA CGGCCAGCGA AAGCGGCACG
GGTGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGATCA GCTTGAAGCC
ATCTTCACCC CGTACTTCAC CACCAAAGCC GAAGGCACCG GACTGGGGCT GGCGGTCGTG
CATAATATTG TTGAACAACA CGGTGGTACA ATTCAGGTCG CAAGCCAGGA GGGAAAAGGC
GCAACGTTCA CCCTCTGGCT TCCGGTCAAT ATTACGCGTA AGGACCCACA AGGATGA
 
Protein sequence
MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL 
ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQLYSPQE
MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFAAGMHR MRHMQQYAAT PQAIFIAFDA
SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH
LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVKP
THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PEIQADPDRL TQVLLNLYLN
AIQAIGQHGV ISVTASESGT GVKISVTDSG KGIAADQLEA IFTPYFTTKA EGTGLGLAVV
HNIVEQHGGT IQVASQEGKG ATFTLWLPVN ITRKDPQG