Gene EcSMS35_4514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4514 
Symbolalr 
ID6147339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4613349 
End bp4614428 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content56% 
IMG OID641619330 
Productalanine racemase 
Protein accessionYP_001746442 
Protein GI170680683 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGG CAACTGTTGT GATTAACCGC CGCGCTCTGC GACACAACCT GCAACGTCTT 
CGTGAACTGG CACCTGCCAG TAAAATGGTT GCGGTGGTGA AAGCGAACGC TTATGGTCAC
GGTCTTCTTG AGACCGCGCG AACGCTCCCC GATGCTGACG CCTTTGGCGT AGCCCGTCTC
GAAGAAGCTC TGCGACTGCG TGCGGGGGGA ATCACCAAAC CTGTACTGTT ACTCGAAGGC
TTTTTTGATG CCAGAGATCT GCCGACGATT TCTGCGCAAC ATTTTCATAC CGCCGTGCAT
AACGAAGAAC AGCTGGCTGC GCTGGAAGAG GCCAGCCTGG ACGAGCCGGT TACCGTCTGG
ATGAAACTCG ATACCGGTAT GCACCGTCTG GGCGTAAGGC CGGAACAGGC TGAGGCGTTT
TATCATCGCC TGACCCAGTG TAAAAACGTT CGTCAGCCGG TGAATATCGT CAGCCATTTT
GCGCGCGCGG ATGAACCAAA ATGCGGCGCA ACCGAGAAAC AACTCGCTAT CTTTAATACC
TTTTGCGAAG GCAAACCAGG TCAACGTTCC ATTGCCGCAT CGGGTGGCAT TCTGCTGTGG
CCACAGTCGC ATTTTGACTG GGTGCGTCCG GGCATCATTC TTTACGGCGT CTCGCCGCTG
GAAGATCGTT CCACCGGTGC CGATTTTGGC TGTCAGCCAG TGATGTCACT AACCTCCAGC
CTGATTGCCG TGCGTGAGCA CAAAGCCGGA GAGCCTGTCG GTTATGGTGG AACCTGGGTA
AGCGAACGTG ATACCCGCCT GGGCGTAGTC GCGATGGGTT ATGGCGATGG TTATCCGCGC
GCCGCGCCGT CCGGTACGCC AGTGCTGGTG AACGGTCGCG AAGTGCCGAT TGTCGGGCGA
GTCGCGATGG ATATGATCTG CGTAGACTTA GGTCCACAGG CGCAGGATAA AGCCGGGGAC
CCGGTCATTT TATGGGGCGA AGGTTTGCCC GTAGAACGTA TCGCTGAAAT GACGAAAGTA
AGCGCTTACG AACTTATCAC GCGCCTGACT TCAAGGGTCG CGATGAAATA CGTGGATTAA
 
Protein sequence
MQAATVVINR RALRHNLQRL RELAPASKMV AVVKANAYGH GLLETARTLP DADAFGVARL 
EEALRLRAGG ITKPVLLLEG FFDARDLPTI SAQHFHTAVH NEEQLAALEE ASLDEPVTVW
MKLDTGMHRL GVRPEQAEAF YHRLTQCKNV RQPVNIVSHF ARADEPKCGA TEKQLAIFNT
FCEGKPGQRS IAASGGILLW PQSHFDWVRP GIILYGVSPL EDRSTGADFG CQPVMSLTSS
LIAVREHKAG EPVGYGGTWV SERDTRLGVV AMGYGDGYPR AAPSGTPVLV NGREVPIVGR
VAMDMICVDL GPQAQDKAGD PVILWGEGLP VERIAEMTKV SAYELITRLT SRVAMKYVD