Gene ECH74115_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4688 
SymboltrpS 
ID6970200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4327910 
End bp4328914 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content53% 
IMG OID643388390 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_002272818 
Protein GI209397315 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.79843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAGC CCATCGTTTT TAGTGGCGCA CAGCCCTCAG GTGAATTGAC TATTGGTAAC 
TACATGGGTG CGCTGCGTCA GTGGGTAAAC ATGCAGGATG ACTACCATTG CATTTACTGT
ATCGTTGACC AACACGCGAT CACCGTGCGC CAGGATGCAC AGAAGCTGCG TAAAGCGACG
CTGGATACGC TGGCCTTGTA TCTGGCTTGT GGTATCGATC CTGAGAAAAG CACCATTTTT
GTTCAGTCCC ACGTACCGGA ACATGCGCAG TTAGGCTGGG CACTGAACTG CTATACCTAC
TTCGGCGAAC TGAGCCGCAT GACGCAGTTT AAAGATAAAT CTGCGCGTTA TGCCGAGAAC
ATCAACGCTG GTCTGTTTGA CTATCCGGTG CTGATGGCAG CGGACATCCT GCTGTATCAA
ACTAATCTGG TACCGGTGGG TGAAGACCAG AAACAGCACC TGGAACTGAG CCGTGATATC
GCCCAGCGTT TCAACGCGCT GTACGGCGAT ATCTTTAAGG TGCCGGAGCC GTTTATTCCG
AAATCCGGCG CGCGCGTAAT GTCGCTGCTG GAGCCGACCA AGAAGATGTC CAAGTCTGAC
GATAACCGCA ATAACGTTAT TGGCCTGCTG GAAGATCCGA AATCGGTAGT GAAGAAAATC
AAACGCGCGG TCACCGACTC CGACGAGCCG CCGGTAGTTC GCTACGATGT GCAGAACAAA
GCGGGCGTTT CCAACCTGCT GGATATCCTT TCTGCGGTAA CGGGCCAGAG CATCCCGGAA
CTGGAAAAAC AGTTCGAAGG CAAGATGTAT GGTCATCTGA AAGGCGAAGT GGCTGATGCC
GTTTCCGGTA TGCTGACTGA ATTGCAGGAA CGTTACCACC GTTTCCGCAA CGATGAAGCC
TTCCTGCAAC AGGTGATGAA AGACGGCGCA GAAAAAGCCA GCGCGCACGC TTCCCGTACG
CTAAAAGCGG TGTACGAAGC GATTGGTTTT GTGGCGAAGC CGTAA
 
Protein sequence
MTKPIVFSGA QPSGELTIGN YMGALRQWVN MQDDYHCIYC IVDQHAITVR QDAQKLRKAT 
LDTLALYLAC GIDPEKSTIF VQSHVPEHAQ LGWALNCYTY FGELSRMTQF KDKSARYAEN
INAGLFDYPV LMAADILLYQ TNLVPVGEDQ KQHLELSRDI AQRFNALYGD IFKVPEPFIP
KSGARVMSLL EPTKKMSKSD DNRNNVIGLL EDPKSVVKKI KRAVTDSDEP PVVRYDVQNK
AGVSNLLDIL SAVTGQSIPE LEKQFEGKMY GHLKGEVADA VSGMLTELQE RYHRFRNDEA
FLQQVMKDGA EKASAHASRT LKAVYEAIGF VAKP