Gene ECH74115_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0205 
SymbolproS 
ID6970459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp220369 
End bp222087 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content55% 
IMG OID643384280 
Productprolyl-tRNA synthetase 
Protein accessionYP_002268803 
Protein GI209398549 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0288028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTA GCCAATACCT GCTCTCCACT CTCAAGGAGA CACCTGCCGA CGCCGAGGTG 
ATCAGCCATC AGCTGATGCT GCGCGCCGGG ATGATCCGCA AGCTGGCCTC CGGGTTATAT
ACCTGGCTGC CGACCGGCGT GCGCGTTCTG AAAAAAGTCG AAAACATCGT GCGTGAAGAG
ATGAACAACG CCGGTGCAAT CGAGGTGTCG ATGCCGGTGG TTCAGCCAGC CGATTTGTGG
CAAGAGAGTG GTCGTTGGGA ACAGTACGGT CCGGAACTGC TGCGTTTTGT TGACCGTGGC
GAGCGTCCGT TCGTACTCGG CCCAACTCAT GAAGAAGTTA TCACTGACCT GATTCGTAAC
GAGCTTAGCT CTTACAAACA GCTGCCGCTG AACTTCTATC AGATCCAGAC CAAGTTCCGC
GACGAAGTGC GTCCGCGTTT CGGCGTCATG CGTTCCCGCG AATTCCTGAT GAAAGATGCT
TACTCTTTCC ATACTTCTCA GGAATCCCTG CAGGAAACCT ACGATGCAAT GTATGCGGCC
TACAGCAAAA TCTTCAGCCG CATGGGGCTG GATTTCCGCG CCGTACAGGC CGACACCGGT
TCTATCGGCG GCAGCGCCTC TCACGAATTC CAGGTGCTGG CGCAGAGCGG TGAAGACGAT
GTGGTCTTCT CCGACACCTC TGACTATGCA GCGAACATTG AGCTGGCAGA AGCTATCGCG
CCGAAAGAAC CGCGCGCTGC TGCTACCCAG GAAATGACGC TGGTTGATAC GCCGAACGCG
AAAACCATCG CGGAACTGGT TGAACAGTTC AATCTGCCGA TTAAGAAAAC GGTTAAGACT
CTGCTGGTTA AAGCGGTTGA AGGCAGTAGC TTCCCGCTGG TTGCGCTGCT GGTGCGCGGT
GATCACGAGC TGAACGAAGT TAAAGCAGAA AAACTGCCGC AGGTTGCCAG CCCGCTGACT
TTCGCGACCG AAGAAGAAAT TCGTGCCGTG GTTAAAGCCG GTCCGGGTTC ACTGGGTCCG
GTAAACATGC CGATTCCGGT GGTGATTGAC CGTACCGTTG CGGCGATGAG TGATTTCACT
GCTGGTGCTA ACATCGATGG TAAACACTAC TTCGGCATCA ACTGGGATCG CGATGTCGCT
ACCCCGGAAG TTGCGGATAT CCGTAACGTG GTGGCTGGCG ATCCAAGCCC GGATGGCCAG
GGTACGCTGC TGATCAAACG TGGTATCGAA GTTGGTCACA TCTTCCAGCT GGGTACCAAG
TACTCCGAAG CACTGAAAGC CTCCGTACAG GGTGAAGATG GCCGTAACCA AATCCTGACT
ATGGGTTGCT ACGGTATCGG GGTAACGCGC GTAGTGGCTG CGGCGATTGA GCAGAACTAC
GACGAGCGCG GCATCGTATG GCCTGACGCT ATCGCGCCGT TCCAGGTGGC AATTCTGCCG
ATGAACATGC ACAAATCCTT CCGCGTACAA GAGCTTGCTG AGAAACTGTA CAGCGAACTG
CGTGCACAAG GTATCGAAGT GCTGCTGGAT GACCGAAAAG AACGTCCGGG CGTGATGTTT
GCCGATATGG AACTGATCGG TATTCCGCAC ACTATTGTGC TGGGCGACCG TAACCTCGAC
AACGACGATA TCGAATATAA ATATCGTCGC AACGGCGAGA AACAGTTAAT TAAGACTGGT
GACATCGTCG AATATCTGGT GAAACAGATT AAAGGCTGA
 
Protein sequence
MRTSQYLLST LKETPADAEV ISHQLMLRAG MIRKLASGLY TWLPTGVRVL KKVENIVREE 
MNNAGAIEVS MPVVQPADLW QESGRWEQYG PELLRFVDRG ERPFVLGPTH EEVITDLIRN
ELSSYKQLPL NFYQIQTKFR DEVRPRFGVM RSREFLMKDA YSFHTSQESL QETYDAMYAA
YSKIFSRMGL DFRAVQADTG SIGGSASHEF QVLAQSGEDD VVFSDTSDYA ANIELAEAIA
PKEPRAAATQ EMTLVDTPNA KTIAELVEQF NLPIKKTVKT LLVKAVEGSS FPLVALLVRG
DHELNEVKAE KLPQVASPLT FATEEEIRAV VKAGPGSLGP VNMPIPVVID RTVAAMSDFT
AGANIDGKHY FGINWDRDVA TPEVADIRNV VAGDPSPDGQ GTLLIKRGIE VGHIFQLGTK
YSEALKASVQ GEDGRNQILT MGCYGIGVTR VVAAAIEQNY DERGIVWPDA IAPFQVAILP
MNMHKSFRVQ ELAEKLYSEL RAQGIEVLLD DRKERPGVMF ADMELIGIPH TIVLGDRNLD
NDDIEYKYRR NGEKQLIKTG DIVEYLVKQI KG