Gene EcolC_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1913 
Symbol 
ID6066916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2114119 
End bp2116047 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID641601324 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001724886 
Protein GI170019932 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.299034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGACCACGC TGTAAGCCCC 
ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT
AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCCATCATT
ACCGCCAAAG ACGAAGACGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG
CACGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC
GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC
GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT
GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT
ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA
ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT
ACGGCGTGGG CAGACAAAAA AGCACTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG
AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCG
CCGGGTATGG TATTCTGGCA CAACGACGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT
GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC
CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACG
TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT
TTTAACCAGG GGCTGAAGTC TTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC
TGCCACCGTA ACGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC
CAGGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT
ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC
TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC
CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT
TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTGCCG TCTCGTCTGA GCGCCTCCTA TGTGGGCGAA
GACAACGAGC GTAAGGTACC GGTAATGATT CACCGCGCAA TTCTTGGGTC GATGGAACGT
TTCATCGGTA TCCTGACCGA AGAATTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCGGTT
CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGATT ACGTTAACGA ATTGACGCAA
AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT GCGTCGCGTC CCATATATGC TGGTCTGTGG TGATAAAGAG
GTGGAATCTG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC
GTAAATGAAG TGATCGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII 
TAKDEDGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL
STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVIMNITDS QSDYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE