Gene EcSMS35_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1959 
SymboldadX 
ID6146230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1981763 
End bp1982833 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID641616835 
Productalanine racemase 
Protein accessionYP_001744011 
Protein GI170681793 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0787] Alanine racemase 
TIGRFAM ID[TIGR00492] alanine racemase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.148876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.720893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGTC CGATACAGGC CAGCCTCGAT CTGCAGGCAT TAAAACAGAA TCTGTCCATT 
GTCCGCCAGG CCGCGCCGCG CGCGCGCGTC TGGTCGGTGG TAAAAGCGAA CGCTTACGGG
CACGGTATTG AGCGTATCTG GAGCGCGCTC GGGGCCACCG ATGGCTTTGC ATTACTTAAC
CTGGAAGAGG CAATAACGTT ACGTGAGCGC GGCTGGAAAG GACCGATCCT GATGCTGGAA
GGATTTTTCC ATGCTCAGGA TCTGGAGATT TATGACCAGC ACCGCCTGAC CACCTGCGTA
CACAGCAACT GGCAGCTCAA AGCACTGCAA AATGCACGGC TAAAAGCACC GTTGGATATT
TATCTTAAAG TGAACAGTGG GATGAATCGG TTGGGCTTCC AGCCCGATCG CGTGCTTACC
GTCTGGCAGC AGTTGCGGGC GATGGCGAAT GTTGGCGAAA TGACCCTGAT GTCGCATTTT
GCCGAGGCGG AACATCCTGA TGGAATTTCC AGCGCGATGG CGCGTATTGA GCAGGCGGCG
GAAGGGCTGG AGTGTCGGCG TTCGTTGTCC AATTCGGCGG CGACTCTGTG GCACCCGGAA
GCGCATTTTG ACTGGGTTCG GCCTGGCATT ATTTTGTATG GCGCTTCGCC GTCCGGTCAG
TGGCGTGATA TCGCCAATAC CGGATTACGC CCGGTGATGA CGCTAAGCAG TGAGATTATT
GGTGTCCAGA CGCTAAAAGC GGGCGAGCGT GTGGGCTACG GCGGTCGCTA TACTGCGCGC
GATGAACAGC GAATCGGCAT TGTCGCCGCA GGATACGCCG ACGGTTATCC GCGCCATGCG
CCTACCGGTA CTCCTGTTTT AGTGGACGGC GTGCGCACCA TGACGGTGGG GACCGTCTCG
ATGGATATGC TGGCGGTTGA TTTAACGCCT TGCCCGCAGG CCGGTATTGG TACGCCGGTT
GAGCTGTGGG GCAAGGAGAT CAAGATTGAT GATGTCGCCG CCGCTGCCGG AACGGTGGGC
TATGAGTTGA TGTGCGCGCT GGCGTTACGC GTCCCGGTTG TGACGGTGTA A
 
Protein sequence
MTRPIQASLD LQALKQNLSI VRQAAPRARV WSVVKANAYG HGIERIWSAL GATDGFALLN 
LEEAITLRER GWKGPILMLE GFFHAQDLEI YDQHRLTTCV HSNWQLKALQ NARLKAPLDI
YLKVNSGMNR LGFQPDRVLT VWQQLRAMAN VGEMTLMSHF AEAEHPDGIS SAMARIEQAA
EGLECRRSLS NSAATLWHPE AHFDWVRPGI ILYGASPSGQ WRDIANTGLR PVMTLSSEII
GVQTLKAGER VGYGGRYTAR DEQRIGIVAA GYADGYPRHA PTGTPVLVDG VRTMTVGTVS
MDMLAVDLTP CPQAGIGTPV ELWGKEIKID DVAAAAGTVG YELMCALALR VPVVTV