Gene ECH74115_5555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5555 
Symbolalr 
ID6969044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5194137 
End bp5195216 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content56% 
IMG OID643389196 
Productalanine racemase 
Protein accessionYP_002273593 
Protein GI209397944 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGG CAACTGTTGT GATTAACCGC CGCGCTCTGC GACACAACCT GCAACGTCTT 
CGTGAACTGG CACCTGCCAG TAAAATGGTT GCGGTGGTGA AAGCGAACGC TTATGGTCAC
GGTCTTCTTG AGACCGCGCG AACGCTCCCC GATGCTGACG CCTTTGGCGT AGCCCGTCTC
GAAGAAGCTC TGCGACTGCG TGCGGGGGGA ATCACCAAAC CTGTACTGTT ACTCGAAGGC
TTTTTTGATG CCAGAGATCT GCCGACGATT TCTGCGCAAC ATTTTCATAC CGCCGTGCAT
AACGAAGAAC AGCTGGCTGC GCTGGAAGAG GCTAGCCTGG ACGAGCCGGT TACCGTCTGG
ATGAAACTCG ATACCGGTAT GCACCGTCTG GGCGTAAGGC CGGAACAGGC TGAGGCGTTT
TATCATCGCC TGACCCAGTG CAAAAACGTT CGGCAGCCGG TGAATATCGT CAGCCATTTT
GCGCGCGCGG ATGAACCAAA ATGTGGCGCA ACCGAGAAAC AACTCGCTAT CTTTAATACC
TTTTGCGAAG GCAAACCAGG TCAACGTTCC ATTGCCGCGT CGGGTGGCAT TCTGCTGTGG
CCACAGTCGC ATTTTGACTG GGTGCGCCCG GGCATCATTC TTTATGGCGT CTCGCCGCTG
GAAGATCGCT CCACCGGTGC CGATTTTGGC TGTCAGCCAG TGATGTCACT AACCTCCAGC
CTGATTGCCG TGCGTGAGCA TAAAGTCGGA GAGCCTGTCG GTTATGGTGG AACCTGGATA
AGCGAACGTG ATACTCGTCT TGGTGTAGTT GCGATGGGCT ATGGCGATGG TTATCCGCGC
GCCGCGCCGT CCGGTACGCC AGTGCTGGTG AACGGTCGCG AAGTGCCGAT TGTCGGGCGC
GTGGCGATGG ATATGATCTG CGTAGACTTA GGTCCACAGG CACAGGATAA AGCCGGGGAC
CCGGTCATTT TGTGGGGTGA AGGTTTGCCC GTAGAACGTA TCGCTGAAAT GACGAAAGTA
AGCGCTTACG AACTTATTAC GCGCCTGACT TCAAGGGTCG CGATGAAATA CGTGGATTAA
 
Protein sequence
MQAATVVINR RALRHNLQRL RELAPASKMV AVVKANAYGH GLLETARTLP DADAFGVARL 
EEALRLRAGG ITKPVLLLEG FFDARDLPTI SAQHFHTAVH NEEQLAALEE ASLDEPVTVW
MKLDTGMHRL GVRPEQAEAF YHRLTQCKNV RQPVNIVSHF ARADEPKCGA TEKQLAIFNT
FCEGKPGQRS IAASGGILLW PQSHFDWVRP GIILYGVSPL EDRSTGADFG CQPVMSLTSS
LIAVREHKVG EPVGYGGTWI SERDTRLGVV AMGYGDGYPR AAPSGTPVLV NGREVPIVGR
VAMDMICVDL GPQAQDKAGD PVILWGEGLP VERIAEMTKV SAYELITRLT SRVAMKYVD