Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2820 |
Symbol | alaS |
ID | 6145390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2894605 |
End bp | 2897235 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617689 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_001744844 |
Protein GI | 170682603 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000729274 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00002729 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAAGA GCACCGCTGA GATCCGTCAG GCGTTTCTCG ACTTTTTCCA TAGTAAGGGA CATCAGGTAG TTGCCAGCAG CTCCCTGGTA CCCCATAACG ACCCAACTTT GTTGTTTACC AACGCCGGGA TGAACCAGTT CAAGGATGTG TTCCTTGGGC TCGACAAGCG TAATTATTCC CGCGCTACCA CTTCCCAACG CTGCGTGCGT GCGGGTGGTA AACACAACGA CCTGGAAAAC GTCGGTTACA CCGCGCGTCA CCATACCTTC TTCGAAATGC TGGGCAACTT CAGCTTCGGC GACTATTTCA AACACGATGC CATTCAGTTT GCATGGGAAC TGCTGACCAG CGAAAAATGG TTTGCCCTGC CGAAAGAGCG TCTGTGGGTT ACCGTCTATG AAAGCGACGA CGAAGCCTAC GAAATCTGGG AAAAAGAAGT AGGGATCCCG CGCGAACGTA TTATTCGCAT CGGCGATAAC AAAGGTGCGC CATACGCATC TGACAACTTC TGGCAGATGG GTGACACTGG TCCGTGCGGC CCGTGCACCG AAATCTTCTA CGATCACGGC GACCACATTT GGGGTGGCCC TCCGGGAAGT CCGGAAGAAG ACGGCGACCG CTACATTGAG ATCTGGAACA TCGTCTTCAT GCAGTTCAAC CGCCAGGCCG ATGGCACGAT GGAACCGCTG CCGAAGCCGT CTGTAGATAC CGGTATGGGT CTGGAGCGTA TTGCTGCGGT GCTGCAACAC GTTAACTCTA ACTATGACAT CGACTTGTTC CGCACGCTGA TCCAGGCGGT AGCGAAAGTC ACTGGCGCAA CCGATCTGAG CAATAAATCG CTGCGCGTAA TCGCTGACCA CATTCGTTCT TGTGCGTTCC TGATCGCGGA TGGCGTAATG CCGTCCAACG AAAACCGTGG TTATGTACTG CGTCGTATCA TTCGTCGCGC AGTGCGTCAC GGTAATATGC TCGGCGCGAA AGAAACCTTT TTCTACAAAC TGGTTGGTCC GCTGATCGAC GTTATGGGCT CTGCGGGTGA AGACCTGAAA CGCCAGCAGG CGCAGGTTGA GCAGGTGCTG AAGACTGAAG AAGAGCAGTT TGCTCGTACT CTGGAGCGCG GTCTGGCGTT GCTGGATGAA GAGCTGGCAA AACTTTCTGG TGATACGCTG GATGGTGAAA CTGCTTTCCG TCTGTACGAC ACCTATGGCT TCCCGGTTGA CCTGACGGCT GATGTTTGTC GTGAGCGCAA CATCAAAGTT GACGAAGCTG GATTTGAAGC AGCAATGGAA GAGCAGCGTC GTCGTGCGCG CGAAGCCAGC GGCTTTGGTG CCGATTACAA CGCAATGATC CGTGTTGACA GTGCATCTGA ATTTAAAGGC TATGACCATC TGGAACTGAA CGGCAAAGTG ACCGCGCTGT TTGTTGATGG TAAAGCGGTT GATGCCATCA ATGCAGGCCA GGAAGCTGTG GTCGTGCTGG ATCAAACGCC ATTCTATGCG GAATCCGGCG GTCAGGTTGG TGATAAAGGC GAACTGAAAG GCGCTAACTT CTCCTTTGCG GTGGAAGATA CTCAGAAATA CGGCCAGGCG ATTGGTCACA TCGGTAAACT TGCTACGGGT TCTCTGAAAG TGGGCGACGC GGTGCAGGCT GATGTTGATG AGGCTCGTCG CGCCCGTATT CGTCTGAATC ACTCCGCAAC GCACCTGATG CACGCTGCGC TGCGCCAGGT TCTGGGTACT CATGTATCGC AGAAAGGTTC ACTGGTTAAC GACAAAGTGC TGCGCTTCGA CTTCTCACAC AACGAAGCGA TGAAACCAGA AGAGATTCGT GCGGTCGAAG ACCTGGTGAA CGCACAGATT CGCCGTAACT TGCCGATCGA AACCAACATC ATGGATCTCG AAGCGGCGAA AGCGAAAGGT GCGATGGCGC TGTTTGGCGA GAAGTATGAT GAGCGCGTAC GCGTGCTGAG CATGGGTGAT TTCTCCACCG AGTTGTGTGG CGGTACTCAC GCCAGCCGCA CTGGTGATAT TGGTCTGTTC CGCATCATCT CTGAATCGGG TACTGCTGCA GGCGTTCGTC GTATCGAAGC GGTAACCGGA GAAGGCGCTA TCGCCACCGT TCATGCAGAC AGTGATCGCT TAAGCGAAGT CGCGCATCTG CTGAAAGGCG ATAGCAATAA TCTGGCTGAT AAAGTGCGTT CAGTACTGGA ACGTACGCGT CAGTTGGAAA AAGAGTTACA ACAGCTTAAA GAACAAGCTG CCGCACAGGA GAGCGCAAAT CTTTCCAGTA AGGCAATTGA TGTTAATGGT GTTAAGCTGT TGGTTAGCGA GCTTAGCGGT GTTGAGCCGA AAATGTTGCG TACCATGGTT GACGATTTAA AAAATCAGCT GGGGTCGACA ATTATCGTGC TGGCAACGGT AGCCGAAGGT AAGGTTTCTC TGATTGCAGG CGTATCTAAG GACGTCACAG ATCGTGTGAA AGCAGGGGAG CTGATTGGTA TGGTCGCTCA GCAGGTGGGC GGCAAGGGTG GTGGACGTCC TGACATGGCG CAAGCCGGTG GTACGGATGC TGCGGCCTTA CCTGCAGCGT TAGCCAGTGT GAAAGGCTGG GTCAGCGCGA AATTGCAATA A
|
Protein sequence | MSKSTAEIRQ AFLDFFHSKG HQVVASSSLV PHNDPTLLFT NAGMNQFKDV FLGLDKRNYS RATTSQRCVR AGGKHNDLEN VGYTARHHTF FEMLGNFSFG DYFKHDAIQF AWELLTSEKW FALPKERLWV TVYESDDEAY EIWEKEVGIP RERIIRIGDN KGAPYASDNF WQMGDTGPCG PCTEIFYDHG DHIWGGPPGS PEEDGDRYIE IWNIVFMQFN RQADGTMEPL PKPSVDTGMG LERIAAVLQH VNSNYDIDLF RTLIQAVAKV TGATDLSNKS LRVIADHIRS CAFLIADGVM PSNENRGYVL RRIIRRAVRH GNMLGAKETF FYKLVGPLID VMGSAGEDLK RQQAQVEQVL KTEEEQFART LERGLALLDE ELAKLSGDTL DGETAFRLYD TYGFPVDLTA DVCRERNIKV DEAGFEAAME EQRRRAREAS GFGADYNAMI RVDSASEFKG YDHLELNGKV TALFVDGKAV DAINAGQEAV VVLDQTPFYA ESGGQVGDKG ELKGANFSFA VEDTQKYGQA IGHIGKLATG SLKVGDAVQA DVDEARRARI RLNHSATHLM HAALRQVLGT HVSQKGSLVN DKVLRFDFSH NEAMKPEEIR AVEDLVNAQI RRNLPIETNI MDLEAAKAKG AMALFGEKYD ERVRVLSMGD FSTELCGGTH ASRTGDIGLF RIISESGTAA GVRRIEAVTG EGAIATVHAD SDRLSEVAHL LKGDSNNLAD KVRSVLERTR QLEKELQQLK EQAAAQESAN LSSKAIDVNG VKLLVSELSG VEPKMLRTMV DDLKNQLGST IIVLATVAEG KVSLIAGVSK DVTDRVKAGE LIGMVAQQVG GKGGGRPDMA QAGGTDAAAL PAALASVKGW VSAKLQ
|
| |