Gene EcSMS35_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1472 
SymbolthrS 
ID6147349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1456318 
End bp1458246 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID641616350 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001743530 
Protein GI170680212 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000205456 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGACCACGC TGTAAGCCCC 
ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT
AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCGATCATT
ACCGCCAAAG ACGAAGAAGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG
CACGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC
GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC
GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT
GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT
ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA
ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT
ACGGCGTGGG CAGACAAAAA AGCGCTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG
AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCA
CCGGGTATGG TATTCTGGCA CAACGATGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT
GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC
CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACG
TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT
TTCAACCAGG GGCTGAAGTC GTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC
TGCCACCGTA ATGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC
CAGGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT
ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC
TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC
CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT
TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTGCCG TCTCGTCTGA GCGCCTCCTA TGTGGGCGAA
GACAACGAGC GTAAGGTACC GGTAATGATT CACCGCGCAA TTCTGGGGTC GATGGAACGT
TTCATCGGTA TCCTGACCGA AGAATTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCGGTT
CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGAAT ACGTTAACGA ATTGACGCAA
AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT GCGTCGCGTC CCGTATATGC TGGTCTGTGG TGATAAAGAG
GTGGAATCAG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC
GTAAATGAAG TGATCGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL
STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVIMNITDS QSEYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE