Gene EcSMS35_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0206 
SymbolproS 
ID6142892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp226194 
End bp227912 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content54% 
IMG OID641615107 
Productprolyl-tRNA synthetase 
Protein accessionYP_001742323 
Protein GI170680577 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.835118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG 
ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT
ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG
ATGAACAACG CCGGTGCGAT CGAGGTGTTA ATGCCGGTGG TTCAGCCATC TGAACTGTGG
CAAGAGAGTG GTCGTTGGGA ACAGTATGGC CCGGAATTGC TGCGTATTGC TGACCGTGGT
GACCGTCCGT TCGTACTTGG CCCAACTCAT GAAGAAGTGA TTACCGACCT GATTCGTAAC
GAGCTGAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC
GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT
TACTCTTTCC ATACTTCTCA GGAATCCTTA CAGGAAACCT ACGATGCAAT GTATGCGGCC
TACAGCAAAA TCTTCAGCCG CATGGGGCTG GATTTCCGCG CCGTACAGGC CGACACCGGT
TCTATCGGCG GTAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT
GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AGCTGGCAGA AGCTATCGCG
CCGAAAGAAC CGCGCGCTGC TGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG
AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTGAGAAAAC GGTTAAGACT
CTGCTGGTTA AAGCCGTTGA AGGCAGTAGC TTCCCGCTGG TTGCGCTGCT GGTGCGCGGT
GACCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCCAG CCCGCTGACT
TTTGCGACCG AAGAAGAAAT TCGTGCCGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG
GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCGCT
GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGTATCA ACTGGGATCG CGATGTCGCT
ACCCCGGAAA TTGCTGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGTCAG
GGTACGCTGC TGATCAAACG TGGTATCGAA GTCGGTCACA TCTTCCAGCT GGGTACCAAG
TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACG
ATGGGTTGCT ACGGTATCGG TGTAACGCGT GTGGTAGCAG CGGCGATTGA GCAGAACTAC
GACGAACGCG GCATCGTATG GCCTGACGCT ATCGCGCCGT TCCAGGTGGC GATTCTGCCA
ATGAACATGC ACAAATCCTT CCGCGTACAG GAACTTGCTG AGAAACTGTA CAGCGAACTG
CGCGCACAAG GTATCGAAGT GCTGCTGGAT GACCGCAAAG AGCGTCCGGG CGTGATGTTT
GCTGATATGG AACTGATCGG TATTCCGCAC ACTATCGTGC TGGGCGACCG TAACCTCGAC
AACGACGATA TCGAATATAA ATATCGTCGT AACGGCGAGA AACAGTTAAT TAAGACTGGT
GACATCGTCG ATTATCTGGT GAAACAGATT AAAGGCTGA
 
Protein sequence
MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE 
MNNAGAIEVL MPVVQPSELW QESGRWEQYG PELLRIADRG DRPFVLGPTH EEVITDLIRN
ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA
YSKIFSRMGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA
PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIEKTVKT LLVKAVEGSS FPLVALLVRG
DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFA
AGANIDGKHY FGINWDRDVA TPEIADIRNV VAGDPSPDGQ GTLLIKRGIE VGHIFQLGTK
YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP
MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD
NDDIEYKYRR NGEKQLIKTG DIVDYLVKQI KG