Gene EcHS_A1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1800 
SymbolthrS 
ID5595060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1817443 
End bp1819371 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID640920947 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001458499 
Protein GI157161181 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000021107 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGACCACGC TGTAAGCCCC 
ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT
AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCCATCATT
ACCGCCAAAG ACGAAGACGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG
CACGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC
GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC
GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT
GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT
ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA
ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT
ACGGCGTGGG CAGACAAAAA AGCACTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG
AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCG
CCGGGTATGG TATTCTGGCA CAACGACGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT
GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC
CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACG
TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT
TTTAACCAGG GGCTGAAGTC TTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC
TGCCACCGTA ACGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC
CAGGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT
ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC
TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC
CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT
TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTGCCG TCTCGTCTGA GCGCCTCCTA TGTGGGCGAA
GACAACGAGC GTAAGGTACC GGTAATGATT CACCGCGCAA TTCTTGGGTC GATGGAACGT
TTCATCGGTA TCCTGACCGA AGAATTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCGGTT
CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGATT ACGTTAACGA ATTGACGCAA
AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT GCGTCGCGTC CCATATATGC TGGTCTGTGG TGATAAAGAG
GTGGAATCTG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC
GTAAATGAAG TGATCGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII 
TAKDEDGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL
STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVIMNITDS QSDYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE