Gene ECH74115_3944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3944 
SymbolalaS 
ID6970116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3647545 
End bp3650175 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content54% 
IMG OID643387717 
Productalanyl-tRNA synthetase 
Protein accessionYP_002272160 
Protein GI209398791 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000803556 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.620679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA GCACCGCTGA GATCCGTCAG GCGTTTCTCG ACTTTTTCCA TAGTAAGGGA 
CATCAGGTAG TTGCCAGCAG CTCCCTGGTA CCCCATAACG ACCCAACTTT GTTGTTTACC
AACGCCGGGA TGAACCAGTT CAAGGATGTG TTCCTTGGGC TCGACAAGCG TAATTATTCC
CGCGCTACCA CTTCCCAACG CTGCGTGCGT GCGGGTGGTA AACACAACGA CCTGGAAAAC
GTCGGTTACA CCGCGCGTCA CCATACCTTC TTCGAAATGC TGGGCAACTT CAGCTTCGGC
GACTATTTCA AACACGATGC CATTCAGTTT GCATGGGAAC TGCTGACCAG CGAAAAATGG
TTTGCCCTGC CGAAAGAGCG TCTGTGGGTT ACCGTCTATG AAAGCGACGA CGAAGCCTAC
GAAATCTGGG AAAAAGAAGT AGGGATCCCG CGCGAACGTA TTATTCGCAT CGGCGATAAC
AAAGGTGCGC CATACGCATC TGACAACTTC TGGCAGATGG GTGACACTGG TCCGTGCGGC
CCGTGCACCG AAATCTTCTA CGATCACGGC GACCACATTT GGGGTGGCCC TCCGGGAAGT
CCGGAAGAAG ACGGCGACCG CTACATTGAG ATCTGGAACA TCGTCTTCAT GCAGTTCAAC
CGCCAGGCCG ATGGCACGAT GGAACCGCTG CCGAAGCCGT CTGTAGATAC CGGTATGGGT
CTGGAGCGTA TTGCTGCGGT GCTGCAACAC GTTAACTCTA ACTATGACAT CGACCTGTTC
CGCACGTTGA TCCAGGCGGT AGCGAAAGTC ACTGGCGCAA CCGATCTGAG CAATAAATCG
CTGCGCGTAA TCGCTGACCA CATTCGTTCT TGTGCGTTCC TGATCGCGGA TGGCGTAATG
CCGTCCAATG AAAACCGTGG TTATGTACTG CGTCGTATCA TTCGTCGCGC AGTGCGTCAC
GGTAATATGC TCGGCGCGAA AGAAACCTTC TTCTACAAAC TGGTTGGTCC GCTGATCGAC
GTTATGGGCT CTGCGGGTGA AGACCTGAAA CGCCAGCAGG CGCAGGTTGA GCAGGTGCTG
AAGACTGAAG AAGAGCAGTT TGCTCGTACT CTGGAGCGCG GTCTGGCGTT GCTGGATGAA
GAGCTGGCAA AACTTTCTGG TGATACGCTG GATGGTGAAA CTGCTTTCCG TCTGTACGAC
ACCTATGGCT TCCCGGTTGA CCTGACGGCT GATGTTTGTC GTGAGCGCAA CATCAAAGTT
GACGAAGCTG GTTTTGAAGC AGCAATGGAA GAGCAGCGTC GTCGGGCGCG CGAAGCCAGC
GGCTTTGGTG CCGATTACAA CGCAATGATC CGTGTTGACA GTGCATCTGA ATTTAAAGGC
TATGACCATC TGGAACTGAA CGGCAAAGTG ACCGCGCTGT TTGTTGATGG TAAAGCGGTT
GATGCCATCA ATGCAGGCCA GGAAGCTGTG GTCGTGCTGG ATCAAACGCC ATTCTATGCG
GAATCCGGCG GTCAGGTTGG TGATAAAGGC GAACTGAAAG GCGCTAACTT CTCCTTCGTG
GTGGAAGATA CGCAGAAATA CGGCCAGGCG ATTGGTCACA TCGGTAAACT TGCTGCGGGT
TCTCTGAAAG TGGGCGACGC TGTGCAGGCT GATGTTGATG AGGCTCGTCG CGCCCGTATT
CGTTTGAATC ACTCCGCAAC GCACCTGATG CACGCTGCGC TGCGCCAGGT TCTGGGGACT
CATGTATCGC AGAAAGGTTC ACTGGTTAAC GACAAGGTGC TGCGCTTCGA CTTCTCACAC
AACGAAGCGA TGAAACCGGA AGAGATTCGT GCGGTCGAAG ACCTGGTGAA CGCACAGATT
CGTCGCAATT TGCCGATCGA AACCAACATC ATGGATCTCG AAGCGGCGAA AGCGAAAGGT
GCGATGGCGC TGTTCGGCGA GAAGTATGAT GAGCGTGTAC GCGTGCTGAG CATGGGCGAT
TTCTCCACCG AGCTGTGTGG CGGTACTCAC GCCAGCCGCA CTGGTGATAT TGGTCTGTTC
CGCATCATCT CTGAATCGGG TACTGCTGCA GGCGTTCGTC GTATCGAAGC GGTAACCGGA
GAAGGCGCTA TCACCACCGT TCATGCAGAC AGCGATCGCT TAAGCGAAGT CGCGCATCTG
CTGAAAGGCG ATAGCAATAA TCTGGCGGAT AAAGTGCGCT CAGTACTGGA ACGTACGCGT
CAGTTAGAAA AAGAACTACA ACAGCTTAAA GAACAAGCTG CCGCACAGGA GAGCGCAAAT
CTTTCCAGTA AGGCAATTGA TGTTAATGGT GTTAAGCTGT TGGTTAGCGA GCTTAGCGGT
GTTGAGCCGA AAATGTTGCG TACCATGGTT GACGATTTAA AAAATCAGCT GGGGTCGACA
ATTATCGTGC TGGCAACGGT AGCCGAAGGT AAGGTTTCTC TGATTGCAGG CGTATCTAAG
GACGTCACAG ATCGTGTGAA AGCAGGGGAA CTGATTGGTA TGGTCGCTCA GCAGGTGGGC
GGCAAGGGTG GTGGACGTCC TGACATGGCG CAAGCCGGTG GTACGGATGC TGCGGCCTTA
CCTGCAGCGT TAGCCAGTGT GAAAGGCTGG GTCAGCGCGA AATTGCAATA A
 
Protein sequence
MSKSTAEIRQ AFLDFFHSKG HQVVASSSLV PHNDPTLLFT NAGMNQFKDV FLGLDKRNYS 
RATTSQRCVR AGGKHNDLEN VGYTARHHTF FEMLGNFSFG DYFKHDAIQF AWELLTSEKW
FALPKERLWV TVYESDDEAY EIWEKEVGIP RERIIRIGDN KGAPYASDNF WQMGDTGPCG
PCTEIFYDHG DHIWGGPPGS PEEDGDRYIE IWNIVFMQFN RQADGTMEPL PKPSVDTGMG
LERIAAVLQH VNSNYDIDLF RTLIQAVAKV TGATDLSNKS LRVIADHIRS CAFLIADGVM
PSNENRGYVL RRIIRRAVRH GNMLGAKETF FYKLVGPLID VMGSAGEDLK RQQAQVEQVL
KTEEEQFART LERGLALLDE ELAKLSGDTL DGETAFRLYD TYGFPVDLTA DVCRERNIKV
DEAGFEAAME EQRRRAREAS GFGADYNAMI RVDSASEFKG YDHLELNGKV TALFVDGKAV
DAINAGQEAV VVLDQTPFYA ESGGQVGDKG ELKGANFSFV VEDTQKYGQA IGHIGKLAAG
SLKVGDAVQA DVDEARRARI RLNHSATHLM HAALRQVLGT HVSQKGSLVN DKVLRFDFSH
NEAMKPEEIR AVEDLVNAQI RRNLPIETNI MDLEAAKAKG AMALFGEKYD ERVRVLSMGD
FSTELCGGTH ASRTGDIGLF RIISESGTAA GVRRIEAVTG EGAITTVHAD SDRLSEVAHL
LKGDSNNLAD KVRSVLERTR QLEKELQQLK EQAAAQESAN LSSKAIDVNG VKLLVSELSG
VEPKMLRTMV DDLKNQLGST IIVLATVAEG KVSLIAGVSK DVTDRVKAGE LIGMVAQQVG
GKGGGRPDMA QAGGTDAAAL PAALASVKGW VSAKLQ