Gene EcSMS35_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2820 
SymbolalaS 
ID6145390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2894605 
End bp2897235 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content53% 
IMG OID641617689 
Productalanyl-tRNA synthetase 
Protein accessionYP_001744844 
Protein GI170682603 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000729274 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00002729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA GCACCGCTGA GATCCGTCAG GCGTTTCTCG ACTTTTTCCA TAGTAAGGGA 
CATCAGGTAG TTGCCAGCAG CTCCCTGGTA CCCCATAACG ACCCAACTTT GTTGTTTACC
AACGCCGGGA TGAACCAGTT CAAGGATGTG TTCCTTGGGC TCGACAAGCG TAATTATTCC
CGCGCTACCA CTTCCCAACG CTGCGTGCGT GCGGGTGGTA AACACAACGA CCTGGAAAAC
GTCGGTTACA CCGCGCGTCA CCATACCTTC TTCGAAATGC TGGGCAACTT CAGCTTCGGC
GACTATTTCA AACACGATGC CATTCAGTTT GCATGGGAAC TGCTGACCAG CGAAAAATGG
TTTGCCCTGC CGAAAGAGCG TCTGTGGGTT ACCGTCTATG AAAGCGACGA CGAAGCCTAC
GAAATCTGGG AAAAAGAAGT AGGGATCCCG CGCGAACGTA TTATTCGCAT CGGCGATAAC
AAAGGTGCGC CATACGCATC TGACAACTTC TGGCAGATGG GTGACACTGG TCCGTGCGGC
CCGTGCACCG AAATCTTCTA CGATCACGGC GACCACATTT GGGGTGGCCC TCCGGGAAGT
CCGGAAGAAG ACGGCGACCG CTACATTGAG ATCTGGAACA TCGTCTTCAT GCAGTTCAAC
CGCCAGGCCG ATGGCACGAT GGAACCGCTG CCGAAGCCGT CTGTAGATAC CGGTATGGGT
CTGGAGCGTA TTGCTGCGGT GCTGCAACAC GTTAACTCTA ACTATGACAT CGACTTGTTC
CGCACGCTGA TCCAGGCGGT AGCGAAAGTC ACTGGCGCAA CCGATCTGAG CAATAAATCG
CTGCGCGTAA TCGCTGACCA CATTCGTTCT TGTGCGTTCC TGATCGCGGA TGGCGTAATG
CCGTCCAACG AAAACCGTGG TTATGTACTG CGTCGTATCA TTCGTCGCGC AGTGCGTCAC
GGTAATATGC TCGGCGCGAA AGAAACCTTT TTCTACAAAC TGGTTGGTCC GCTGATCGAC
GTTATGGGCT CTGCGGGTGA AGACCTGAAA CGCCAGCAGG CGCAGGTTGA GCAGGTGCTG
AAGACTGAAG AAGAGCAGTT TGCTCGTACT CTGGAGCGCG GTCTGGCGTT GCTGGATGAA
GAGCTGGCAA AACTTTCTGG TGATACGCTG GATGGTGAAA CTGCTTTCCG TCTGTACGAC
ACCTATGGCT TCCCGGTTGA CCTGACGGCT GATGTTTGTC GTGAGCGCAA CATCAAAGTT
GACGAAGCTG GATTTGAAGC AGCAATGGAA GAGCAGCGTC GTCGTGCGCG CGAAGCCAGC
GGCTTTGGTG CCGATTACAA CGCAATGATC CGTGTTGACA GTGCATCTGA ATTTAAAGGC
TATGACCATC TGGAACTGAA CGGCAAAGTG ACCGCGCTGT TTGTTGATGG TAAAGCGGTT
GATGCCATCA ATGCAGGCCA GGAAGCTGTG GTCGTGCTGG ATCAAACGCC ATTCTATGCG
GAATCCGGCG GTCAGGTTGG TGATAAAGGC GAACTGAAAG GCGCTAACTT CTCCTTTGCG
GTGGAAGATA CTCAGAAATA CGGCCAGGCG ATTGGTCACA TCGGTAAACT TGCTACGGGT
TCTCTGAAAG TGGGCGACGC GGTGCAGGCT GATGTTGATG AGGCTCGTCG CGCCCGTATT
CGTCTGAATC ACTCCGCAAC GCACCTGATG CACGCTGCGC TGCGCCAGGT TCTGGGTACT
CATGTATCGC AGAAAGGTTC ACTGGTTAAC GACAAAGTGC TGCGCTTCGA CTTCTCACAC
AACGAAGCGA TGAAACCAGA AGAGATTCGT GCGGTCGAAG ACCTGGTGAA CGCACAGATT
CGCCGTAACT TGCCGATCGA AACCAACATC ATGGATCTCG AAGCGGCGAA AGCGAAAGGT
GCGATGGCGC TGTTTGGCGA GAAGTATGAT GAGCGCGTAC GCGTGCTGAG CATGGGTGAT
TTCTCCACCG AGTTGTGTGG CGGTACTCAC GCCAGCCGCA CTGGTGATAT TGGTCTGTTC
CGCATCATCT CTGAATCGGG TACTGCTGCA GGCGTTCGTC GTATCGAAGC GGTAACCGGA
GAAGGCGCTA TCGCCACCGT TCATGCAGAC AGTGATCGCT TAAGCGAAGT CGCGCATCTG
CTGAAAGGCG ATAGCAATAA TCTGGCTGAT AAAGTGCGTT CAGTACTGGA ACGTACGCGT
CAGTTGGAAA AAGAGTTACA ACAGCTTAAA GAACAAGCTG CCGCACAGGA GAGCGCAAAT
CTTTCCAGTA AGGCAATTGA TGTTAATGGT GTTAAGCTGT TGGTTAGCGA GCTTAGCGGT
GTTGAGCCGA AAATGTTGCG TACCATGGTT GACGATTTAA AAAATCAGCT GGGGTCGACA
ATTATCGTGC TGGCAACGGT AGCCGAAGGT AAGGTTTCTC TGATTGCAGG CGTATCTAAG
GACGTCACAG ATCGTGTGAA AGCAGGGGAG CTGATTGGTA TGGTCGCTCA GCAGGTGGGC
GGCAAGGGTG GTGGACGTCC TGACATGGCG CAAGCCGGTG GTACGGATGC TGCGGCCTTA
CCTGCAGCGT TAGCCAGTGT GAAAGGCTGG GTCAGCGCGA AATTGCAATA A
 
Protein sequence
MSKSTAEIRQ AFLDFFHSKG HQVVASSSLV PHNDPTLLFT NAGMNQFKDV FLGLDKRNYS 
RATTSQRCVR AGGKHNDLEN VGYTARHHTF FEMLGNFSFG DYFKHDAIQF AWELLTSEKW
FALPKERLWV TVYESDDEAY EIWEKEVGIP RERIIRIGDN KGAPYASDNF WQMGDTGPCG
PCTEIFYDHG DHIWGGPPGS PEEDGDRYIE IWNIVFMQFN RQADGTMEPL PKPSVDTGMG
LERIAAVLQH VNSNYDIDLF RTLIQAVAKV TGATDLSNKS LRVIADHIRS CAFLIADGVM
PSNENRGYVL RRIIRRAVRH GNMLGAKETF FYKLVGPLID VMGSAGEDLK RQQAQVEQVL
KTEEEQFART LERGLALLDE ELAKLSGDTL DGETAFRLYD TYGFPVDLTA DVCRERNIKV
DEAGFEAAME EQRRRAREAS GFGADYNAMI RVDSASEFKG YDHLELNGKV TALFVDGKAV
DAINAGQEAV VVLDQTPFYA ESGGQVGDKG ELKGANFSFA VEDTQKYGQA IGHIGKLATG
SLKVGDAVQA DVDEARRARI RLNHSATHLM HAALRQVLGT HVSQKGSLVN DKVLRFDFSH
NEAMKPEEIR AVEDLVNAQI RRNLPIETNI MDLEAAKAKG AMALFGEKYD ERVRVLSMGD
FSTELCGGTH ASRTGDIGLF RIISESGTAA GVRRIEAVTG EGAIATVHAD SDRLSEVAHL
LKGDSNNLAD KVRSVLERTR QLEKELQQLK EQAAAQESAN LSSKAIDVNG VKLLVSELSG
VEPKMLRTMV DDLKNQLGST IIVLATVAEG KVSLIAGVSK DVTDRVKAGE LIGMVAQQVG
GKGGGRPDMA QAGGTDAAAL PAALASVKGW VSAKLQ