Gene ECD_01688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01688 
SymbolthrS 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1745057 
End bp1746985 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID 
Productthreonyl-tRNA synthetase 
Protein accessionACT43544 
Protein GI253977874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0363313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGATCACGC TGTAAGCCCC 
ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT
AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCGATCATT
ACCGCCAAAG ACGAAGAAGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG
CACGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC
GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC
GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT
GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT
ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA
ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT
ACGGCGTGGG CAGACAAAAA AGCACTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG
AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCG
CCGGGTATGG TATTCTGGCA CAACGACGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT
GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC
CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACA
TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT
TTCAACCAGG GGCTGAAGTC TTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC
TGCCACCGTA ACGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC
CAGGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT
ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC
TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC
CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT
TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCTTTGCCG TCTCGTCTGA GCGCTTCTTA TGTAGGCGAA
GACAATGAAC GTAAAGTACC GGTAATGATT CACCGCGCAA TTCTGGGGTC GATGGAACGT
TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCGGTT
CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGAAT ACGTTAACGA ATTGACGCAA
AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT GCGTCGCGTC CCATATATGC TGGTCTGTGG TGATAAAGAG
GTGGAATCAG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC
GTAAATGAAG TGATCGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL
STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVIMNITDS QSEYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE