Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01677 |
Symbol | thrS |
ID | 8112870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1744372 |
End bp | 1746300 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644847900 |
Product | hypothetical protein |
Protein accession | YP_002999473 |
Protein GI | 251785169 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0538316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGATCACGC TGTAAGCCCC ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCGATCATT ACCGCCAAAG ACGAAGAAGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG CACGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC TGGCACGAAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT ACGGCGTGGG CAGACAAAAA AGCACTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCG CCGGGTATGG TATTCTGGCA CAACGACGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACA TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT TTCAACCAGG GGCTGAAGTC TTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC TGCCACCGTA ACGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC CAGGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC GGTACAGTAC AGCTGGACTT CTCTTTGCCG TCTCGTCTGA GCGCTTCTTA TGTAGGCGAA GACAATGAAC GTAAAGTACC GGTAATGATT CACCGCGCAA TTCTGGGGTC GATGGAACGT TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCGGTT CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGAAT ACGTTAACGA ATTGACGCAA AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT AAAATCCGCG AGCACACTTT GCGTCGCGTC CCATATATGC TGGTCTGTGG TGATAAAGAG GTGGAATCAG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC GTAAATGAAG TGATCGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG GAGGAATAA
|
Protein sequence | MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL EKRMHELAEK NYDVIKKKVS WHEARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV QVVIMNITDS QSEYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE
|
| |