Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0206 |
Symbol | proS |
ID | 6142892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 226194 |
End bp | 227912 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615107 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001742323 |
Protein GI | 170680577 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.835118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG ATGAACAACG CCGGTGCGAT CGAGGTGTTA ATGCCGGTGG TTCAGCCATC TGAACTGTGG CAAGAGAGTG GTCGTTGGGA ACAGTATGGC CCGGAATTGC TGCGTATTGC TGACCGTGGT GACCGTCCGT TCGTACTTGG CCCAACTCAT GAAGAAGTGA TTACCGACCT GATTCGTAAC GAGCTGAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT TACTCTTTCC ATACTTCTCA GGAATCCTTA CAGGAAACCT ACGATGCAAT GTATGCGGCC TACAGCAAAA TCTTCAGCCG CATGGGGCTG GATTTCCGCG CCGTACAGGC CGACACCGGT TCTATCGGCG GTAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AGCTGGCAGA AGCTATCGCG CCGAAAGAAC CGCGCGCTGC TGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTGAGAAAAC GGTTAAGACT CTGCTGGTTA AAGCCGTTGA AGGCAGTAGC TTCCCGCTGG TTGCGCTGCT GGTGCGCGGT GACCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCCAG CCCGCTGACT TTTGCGACCG AAGAAGAAAT TCGTGCCGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCGCT GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGTATCA ACTGGGATCG CGATGTCGCT ACCCCGGAAA TTGCTGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGTCAG GGTACGCTGC TGATCAAACG TGGTATCGAA GTCGGTCACA TCTTCCAGCT GGGTACCAAG TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACG ATGGGTTGCT ACGGTATCGG TGTAACGCGT GTGGTAGCAG CGGCGATTGA GCAGAACTAC GACGAACGCG GCATCGTATG GCCTGACGCT ATCGCGCCGT TCCAGGTGGC GATTCTGCCA ATGAACATGC ACAAATCCTT CCGCGTACAG GAACTTGCTG AGAAACTGTA CAGCGAACTG CGCGCACAAG GTATCGAAGT GCTGCTGGAT GACCGCAAAG AGCGTCCGGG CGTGATGTTT GCTGATATGG AACTGATCGG TATTCCGCAC ACTATCGTGC TGGGCGACCG TAACCTCGAC AACGACGATA TCGAATATAA ATATCGTCGT AACGGCGAGA AACAGTTAAT TAAGACTGGT GACATCGTCG ATTATCTGGT GAAACAGATT AAAGGCTGA
|
Protein sequence | MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE MNNAGAIEVL MPVVQPSELW QESGRWEQYG PELLRIADRG DRPFVLGPTH EEVITDLIRN ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA YSKIFSRMGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIEKTVKT LLVKAVEGSS FPLVALLVRG DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFA AGANIDGKHY FGINWDRDVA TPEIADIRNV VAGDPSPDGQ GTLLIKRGIE VGHIFQLGTK YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD NDDIEYKYRR NGEKQLIKTG DIVDYLVKQI KG
|
| |