Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0201 |
Symbol | proS |
ID | 5587814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 218919 |
End bp | 220637 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923928 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001461365 |
Protein GI | 157156008 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00399222 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG ATGAACAACG CCGGTGCGAT CGAGGTGTCG ATGCCGGTGG TTCAGCCAGC TGATTTGTGG CAAGAGAGTG GTCGTTGGGA ACAGTACGGT CCGGAACTGC TGCGTTTTGT TGACCGTGGC GAGCGTCCGT TCGTACTCGG CCCAACTCAT GAAGAAGTTA TCACTGACCT GATTCGTAAC GAGCTTAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT TACTCTTTCC ATACTTCTCA GGAATCCCTG CAGGAAACCT ACGATGCAAT GTATGCGGCC TACAGCAAAA TCTTCAGCCG CATAGGGCTG GATTTCCGCG CCGTACAAGC CGACACCGGT TCTATCGGCG GCAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AACTGGCAGA AGCTATCGCG CCGAAAGAAC CGCGCGCTGC TGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTGAGAAAAC GGTTAAGACT CTGCTGGTTA AAGCGGTTGA AGGCAGTAGC TTCCCGCTGG TTGCGCTGCT GGTGCGCGGT GATCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCAAG CCCGCTGACT TTCGCGACCG AAGAAGAAAT TCGTGCCGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCGCT GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGCATCA ACTGGGATCG CGATGTCGCT ACCCCGGAAG TTGCAGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGCCAG GGTACGCTGC TGATCAAACG TGGTATCGAA GTTGGTCACA TCTTCCAGCT GGGTACCAAG TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACG ATGGGTTGCT ACGGTATCGG GGTAACGCGT GTGGTAGCTG CGGCGATTGA GCAGAACTAC GACGAACGAG GCATCGTATG GCCTGACGCT ATCGCGCCGT TCCAGGTGGC GATTCTGCCG ATGAACATGC ACAAATCCTT CCGCGTACAA GAGCTTGCTG AGAAACTGTA CAGCGAACTG CGTGCACAAG GTATCGAAGT GCTGCTGGAT GACCGCAAAG AGCGTCCGGG CGTGATGTTT GCTGATATGG AACTGATCGG TATTCCGCAC ACTATTGTGC TGGGCGACCG TAACCTCGAC AACGACGATA TCGAATATAA ATATCGTCGC AACGGCGAGA AACAGTTAAT TAAGACTGGT GACATCGTCG AATATCTGGT GAAACAGATT AAAGGCTGA
|
Protein sequence | MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE MNNAGAIEVS MPVVQPADLW QESGRWEQYG PELLRFVDRG ERPFVLGPTH EEVITDLIRN ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA YSKIFSRIGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIEKTVKT LLVKAVEGSS FPLVALLVRG DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFA AGANIDGKHY FGINWDRDVA TPEVADIRNV VAGDPSPDGQ GTLLIKRGIE VGHIFQLGTK YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD NDDIEYKYRR NGEKQLIKTG DIVEYLVKQI KG
|
| |