Gene ECH74115_2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2602 
SymbolaspS 
ID6971094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2456974 
End bp2458746 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content54% 
IMG OID643386467 
Productaspartyl-tRNA synthetase 
Protein accessionYP_002270949 
Protein GI209396974 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.445839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.911145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACAG AATATTGTGG ACAGCTCCGT TTGTCCCACG TGGGGCAGCA GGTGACTCTG 
TGTGGTTGGG TCAACCGTCG TCGTGATCTT GGTAGCCTGA TCTTCATCGA TATGCGCGAC
CGCGAAGGTA TCGTGCAGGT ATTTTTCGAT CCGGATCGTG CGGACGCGTT AAAGCTGGCC
TCTGAACTGC GTAATGAGTT CTGCATTCAG GTCACGGGCA CCGTACGTGC GCGTGACGAA
AAAAATATTA ACCGCGATAT GGCGACCGGC GAAATCGAAG TGCTGGCGTC CTCGCTGACT
ATCATCAACC GCGCAGATGT TCTGCCGCTT GACTCTAACC ACGTCAACAC CGAAGAAGCG
CGTCTGAAAT ACCGCTACCT CGACCTGCGT CGTCCGGAAA TGGCTCAGCG CCTGAAAACC
CGCGCTAAAA TCACCAGCCT GGTGCGCCGT TTTATGGATG ACCACGGTTT CCTCGACATC
GAAACTCCGA TGCTGACCAA AGCCACGCCG GAAGGCGCGC GTGACTACCT GGTGCCTTCT
CGTGTGCACA AAGGTAAATT CTACGCACTG CCGCAATCCC CACAGTTGTT CAAACAGCTG
CTGATGATGT CCGGCTTTGA CCGCTACTAT CAGATCGTTA AATGCTTCCG TGACGAAGAC
CTGCGCGCTG ATCGTCAGCC TGAATTTACT CAGATCGATG TGGAAACTTC TTTCATGACC
GCGCCGCAAG TGCGTGAAGT GATGGAAGCG CTGGTGCGCC ATCTGTGGCT GGAAGTGAAG
GGCGTGGATC TGGGCGATTT CCCAGTAATG ACCTTTGCGG AAGCCGAACG CCGTTATGGT
TCTGATAAAC CGGATCTGCG TAACCCGATG GAACTGACCG ACGTTGCTGA TCTGCTGAGA
TCTGTTGAGT TTGCAGTATT TGCAGGTCCG GCGAACGATC CGAAAGGTCG CGTTGCCGCT
CTGCGTGTTC CGGGTGGCGC ATCGCTGACC CGTAAGCAGA TCGACGAATA CGGTAACTTC
GTTAAAATCT ACGGCGCGAA AGGTCTGGCT TACATCAAAG TTAACGAACG CGCGAAAGGT
CTGGAAGGTA TCAACAGCCC GGTAGCGAAG TTCCTTAATG CAGAAATCAT CGAAGCCATC
CTGGATCGTA CTGCCGCGCA AGATGGCGAT ATGATTTTCT TCGGTGCCGA CAACAAGAAA
ATTGTTGCCG ACGCGATGGG CGCACTGCGC CTGAAAGTGG GTAAAGACCT TGGTCTGACC
GACGAAAGCA AATGGGCACC GCTGTGGGTT ATCGACTTCC CGATGTTTGA AGACGACGGT
GAAGGCGGCC TGACGGCAAT GCACCATCCG TTCACCTCAC CGAAAGACAT GACGGCTGCA
GAGCTGAAAG CTGCACCGGA AAATGCGGTG GCGAACGCTT ACGATATGGT CATCAATGGT
TACGAAGTGG GCGGTGGTTC AGTACGTATC CATAATGGTG ATATGCAGCA GACGGTGTTT
GGTATTCTGG GTATCAACGA AGAGGAACAG CGCGAGAAAT TCGGCTTCCT GCTCGACGCT
CTGAAATACG GTACTCCGCC GCACGCAGGT CTGGCATTCG GTCTTGACCG TCTGACCATG
CTGCTGACCG GCACCGACAA TATCCGTGAC GTTATCGCCT TCCCGAAAAC CACGGCGGCA
GCGTGTCTGA TGACTGAAGC ACCGAGCTTT GCTAACCCGG CTGCACTGGC TGAGCTGAGC
ATTCAGGTTG TGAAGAAGGC TGAGAATAAC TGA
 
Protein sequence
MRTEYCGQLR LSHVGQQVTL CGWVNRRRDL GSLIFIDMRD REGIVQVFFD PDRADALKLA 
SELRNEFCIQ VTGTVRARDE KNINRDMATG EIEVLASSLT IINRADVLPL DSNHVNTEEA
RLKYRYLDLR RPEMAQRLKT RAKITSLVRR FMDDHGFLDI ETPMLTKATP EGARDYLVPS
RVHKGKFYAL PQSPQLFKQL LMMSGFDRYY QIVKCFRDED LRADRQPEFT QIDVETSFMT
APQVREVMEA LVRHLWLEVK GVDLGDFPVM TFAEAERRYG SDKPDLRNPM ELTDVADLLR
SVEFAVFAGP ANDPKGRVAA LRVPGGASLT RKQIDEYGNF VKIYGAKGLA YIKVNERAKG
LEGINSPVAK FLNAEIIEAI LDRTAAQDGD MIFFGADNKK IVADAMGALR LKVGKDLGLT
DESKWAPLWV IDFPMFEDDG EGGLTAMHHP FTSPKDMTAA ELKAAPENAV ANAYDMVING
YEVGGGSVRI HNGDMQQTVF GILGINEEEQ REKFGFLLDA LKYGTPPHAG LAFGLDRLTM
LLTGTDNIRD VIAFPKTTAA ACLMTEAPSF ANPAALAELS IQVVKKAENN