Gene EcSMS35_2227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2227 
SymbolserS 
ID6144327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2246505 
End bp2247797 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID641617103 
Productseryl-tRNA synthetase 
Protein accessionYP_001744277 
Protein GI170683015 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0172] Seryl-tRNA synthetase 
TIGRFAM ID[TIGR00414] seryl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.495423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATC CCAATCTGCT GCGTAATGAG CCAGACGCAG TCGCTGAAAA ACTGGCACGC 
CGGGGCTTTA AGCTGGATGT AGATAAGCTG GGCGCTCTTG AAGAGCGTCG TAAAGTATTG
CAGGTCAAAA CGGAAAACCT GCAAGCGGAG CGTAACTCCC GATCGAAATC CATTGGCCAG
GCGAAAGCGC GCGGGGAAGA TATCGAGCCT TTACGTCTGG AAGTGAACAA ACTGGGCGAA
GAGCTGGATG CAGCAAAAGC CGAGCTGGAT GCTTTACAGG CTGAAATTCG CGATATCGCG
CTAACCATCC CTAACCTGCC TGCAGATGAA GTGCCGGTAG GTAAAGACGA AAATGACAAC
GTTGAAGTCA GCCGCTGGGG CACCCCGCGT GAGTTTGACT TTGAAGTTCG TGACCATGTG
ACGCTGGGTG AAATGCACTC TGGCCTCGAC TTTGCAGCCG CAGTTAAGTT GACTGGTTCC
CGCTTTGTGG TAATGAAAGG GCAGATTGCT CGCATGCACC GCGCACTGTC GCAGTTTATG
CTGGATCTGC ATACCGAACA GCATGGCTAC AGTGAGAACT ATGTTCCGTA CCTGGTTAAC
CAGGACACGC TGTACGGTAC GGGGCAACTG CCGAAATTTG CTGGCGATCT GTTCCATACT
CGTCCGCTGG AAGAAGAAGC AGACACCAGT AACTATGCGC TGATCCCAAC GGCAGAAGTT
CCGCTGACCA ACCTGGTACG CGGTGAAATC ATCGATGAAG ATGATCTGCC AATTAAGATG
ACCGCCCACA CCCCATGTTT CCGTTCTGAA GCTGGTTCAT ATGGTCGTGA CACTCGTGGT
CTGATCCGTA TGCACCAGTT CGACAAAGTT GAAATGGTGC AGATCGTGCG CCCGGAAGAC
TCAATGGCGG CGCTGGAAGA GATGACCGGT CATGCGGAAA AAGTCCTGCA GCTGCTGGGC
CTGCCGTACC GCAAAATCAT CCTTTGCACC GGCGACATGG GCTTTGGTGC TTGCAAAACT
TACGACCTGG AAGTATGGAT CCCGGCGCAA AACACCTACC GCGAGATCTC TTCATGCTCT
AACGTCTGGG ATTTCCAGGC ACGTCGTATG CAGGCACGCT GCCGCAGCAA GTCTGACAAG
AAAACCCGTC TGGTTCATAC CCTGAACGGT TCTGGTCTGG CTGTTGGCCG TACGCTGGTT
GCGGTAATGG AAAACTATCA GCAGGCTGAT GGTCGTATTG AAGTACCAGA AGTTCTACGT
CCGTATATGA ACGGACTGGA ATATATTGGC TAA
 
Protein sequence
MLDPNLLRNE PDAVAEKLAR RGFKLDVDKL GALEERRKVL QVKTENLQAE RNSRSKSIGQ 
AKARGEDIEP LRLEVNKLGE ELDAAKAELD ALQAEIRDIA LTIPNLPADE VPVGKDENDN
VEVSRWGTPR EFDFEVRDHV TLGEMHSGLD FAAAVKLTGS RFVVMKGQIA RMHRALSQFM
LDLHTEQHGY SENYVPYLVN QDTLYGTGQL PKFAGDLFHT RPLEEEADTS NYALIPTAEV
PLTNLVRGEI IDEDDLPIKM TAHTPCFRSE AGSYGRDTRG LIRMHQFDKV EMVQIVRPED
SMAALEEMTG HAEKVLQLLG LPYRKIILCT GDMGFGACKT YDLEVWIPAQ NTYREISSCS
NVWDFQARRM QARCRSKSDK KTRLVHTLNG SGLAVGRTLV AVMENYQQAD GRIEVPEVLR
PYMNGLEYIG