Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00192 |
Symbol | proS |
ID | 8114403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 219894 |
End bp | 221612 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644846484 |
Product | hypothetical protein |
Protein accession | YP_002998057 |
Protein GI | 251783753 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.165421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG ATGAACAACG CCGGTGCAAT CGAGGTGTCG ATGCCGGTGG TTCAGCCAGC CGATTTGTGG CAAGAGAGTG GTCGTTGGGA ACAGTACGGT CCGGAACTGC TGCGTTTTGT TGACCGTGGC GAGCGTCCGT TCGTACTCGG CCCAACTCAT GAAGAAGTTA TCACTGACCT GATTCGTAAC GAGCTTAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT TACTCTTTCC ATACTTCCCA GGAATCCCTG CAGGAAACCT ACGATGCAAT GTATGCGGCC TACAGCAAAA TCTTCAGCCG CATGGGGCTG GATTTCCGCG CCGTACAGGC CGACACCGGT TCTATCGGCG GCAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AACTGGCAGA AGCTATCGCG CCGAAAGAAC CGCGCGCTGC GGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTGAGAAAAC GGTTAAGACT CTGCTGGTTA AAGCGGTTGA AGGCAGTAGC TTCCCGTTAG TTGCGCTGCT GGTGCGCGGT GACCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCAAG CCCGCTGACT TTCGCGACCG AAGAAGAAAT TCGTGCTGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCGCT GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGCATCA ACTGGGATCG CGATGTCGCT ACCCCGGAAG TTGCGGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGCCAG GGTACGCTGC TGATCAAACG TGGTATCGAA GTTGGTCACA TCTTCCAGCT GGGTACCAAG TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACT ATGGGTTGCT ACGGTATCGG GGTAACGCGC GTTGTGGCTG CGGCGATTGA GCAGAACTAC GACGAACGCG GCATCGTATG GCCTGACGCG ATCGCACCGT TCCAGGTGGC GATTCTGCCG ATGAACATGC ACAAATCCTT CCGCGTACAG GAGCTTGCTG AGAAACTGTA CAGCGAACTG CGTGCACAAG GTATCGAAGT GCTGCTGGAT GACCGCAAAG AGCGTCCGGG CGTGATGTTT GCTGATATGG AACTGATCGG TATTCCGCAC ACTATTGTGC TGGGCGACCG TAACCTCGAC AACGACGATA TCGAATATAA ATATCGTCGC AACGGCGAGA AACAGTTAAT TAAGACTGGT GACATCGTCG AATATCTGGT GAAACAGATT AAAGGCTGA
|
Protein sequence | MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE MNNAGAIEVS MPVVQPADLW QESGRWEQYG PELLRFVDRG ERPFVLGPTH EEVITDLIRN ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA YSKIFSRMGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIEKTVKT LLVKAVEGSS FPLVALLVRG DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFA AGANIDGKHY FGINWDRDVA TPEVADIRNV VAGDPSPDGQ GTLLIKRGIE VGHIFQLGTK YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD NDDIEYKYRR NGEKQLIKTG DIVEYLVKQI KG
|
| |