Gene EcSMS35_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4454 
SymbolzraR 
ID6142865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4547824 
End bp4549149 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content57% 
IMG OID641619273 
Producttranscriptional regulatory protein ZraR 
Protein accessionYP_001746389 
Protein GI170680150 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.209341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000519331 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCACG ATAATATCGA TATTCTGGTG GTGGATGATG ACATTAGCCA CTGCACTATT 
TTGCAGGCTT TACTGCGCGG CTGGGGCTAT AACGTCGCGC TGGCGAACAG CGGGCGACAG
GCGCTGGAGC AGGTGCGGGA ACAGGTTTTT GATCTTGTGC TTTGCGATGT GCGAATGGCA
GAGATGGACG GCATCGCCAC GCTCAAAGAG ATCAAAGCGT TAAATCCGGC AATTCCGGTA
CTGATTATGA CCGCGTACTC CAGCGTCGAG ACGGCGGTAG AGGCGTTGAA AACCGGGGCG
CTGGATTATC TCATCAAGCC GCTGGATTTC GATAACCTGC AGGCGACGCT GGAAAAGGCC
CTCGCGCATA CGCACAGTAT TGACGCTGAA ACGCCTGCGG TGTCTGCCAG CCAGTTCGGT
ATGGTCGGTA AAAGCCCGGC GATGCAACAC CTGCTCAGTG AAATCGCCCT CGTCGCGCCA
TCGGAAGCCA CGGTGCTGAT CCATGGCGAT TCCGGCACCG GTAAAGAGCT GGTCGCCAGG
GCGATTCACG CCAGTAGCGC ACGGAGTGAA AAACCACTGG TAACGCTCAA CTGTGCGGCA
CTCAACGAAT CCTTGCTGGA ATCTGAATTG TTCGGTCACG AAAAAGGGGC GTTTACCGGG
GCCGACAAAC GGCGGGAGGG GCGCTTTGTT GAGGCGGACG GCGGCACGTT GTTTCTCGAT
GAAATTGGCG ATATCTCGCC GATGATGCAG GTGCGTCTGC TACGTGCGAT TCAGGAGCGC
GAAGTTCAGC GTGTCGGCAG CAACCAGACC ATCTCGGTTG ATGTCCGGCT GATTGCGGCG
ACTCATCGCG ATCTTGCCGC AGAGGTGAAT GCCGGGCGTT TTCGCCAGGA TCTCTACTAT
CGCCTGAACG TGGTGGCGAT TGAAGTTCCG TCGCTGCGTC AGCGGCGGGA AGATATTCCT
CTGCTGACAA ACCATTTTCT TCAGCGCTTT GCCGAGCGTA ATCGCAAGGC GGTAAAAGGT
TTTACGCCCC AGGCGATGGA TCTGCTGATT CACTACGACT GGCCGGGAAA TATTCGTGAA
CTGGAAAACG CGGTGGAACG GGCGGTGGTG CTGCTGACCG GGGAATATAT TTCCGAACGC
GAGCTGCCGC TGGCGATTGC GAGTACGCCG ATCCCGCTGG TACAAAGTCA GGATATTCAG
CCGCTGGTGG AAGTCGAAAA AGAGGTGATT CTTGCGGCAC TGGAGAAAAC GGGCGGCAAC
AAAACCGAAG CCGCCCGTCA GTTAGGGATC ACGCGCAAAA CGCTCTTGGC AAAACTGTCG
CGTTAG
 
Protein sequence
MTHDNIDILV VDDDISHCTI LQALLRGWGY NVALANSGRQ ALEQVREQVF DLVLCDVRMA 
EMDGIATLKE IKALNPAIPV LIMTAYSSVE TAVEALKTGA LDYLIKPLDF DNLQATLEKA
LAHTHSIDAE TPAVSASQFG MVGKSPAMQH LLSEIALVAP SEATVLIHGD SGTGKELVAR
AIHASSARSE KPLVTLNCAA LNESLLESEL FGHEKGAFTG ADKRREGRFV EADGGTLFLD
EIGDISPMMQ VRLLRAIQER EVQRVGSNQT ISVDVRLIAA THRDLAAEVN AGRFRQDLYY
RLNVVAIEVP SLRQRREDIP LLTNHFLQRF AERNRKAVKG FTPQAMDLLI HYDWPGNIRE
LENAVERAVV LLTGEYISER ELPLAIASTP IPLVQSQDIQ PLVEVEKEVI LAALEKTGGN
KTEAARQLGI TRKTLLAKLS R