Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3523 |
Symbol | alaS |
ID | 7092380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3868028 |
End bp | 3870715 |
Gene Length | 2688 bp |
Protein Length | 895 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643466814 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_002363774 |
Protein GI | 217979627 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0021732 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACGGCG TTAACGAAAT ACGGGCGGGT TTCCTCGATT ACTTCAAGAA GAACGGCCAT GAGATCGTGG CGTCTTCGCC GCTCGTGCCG CGCAATGACC CCACCCTAAT GTTCACCAAC GCCGGGATGG TCCAGTTCAA GAATGTCTTC ACCGGCCTCG AAAAGCGCCC CTATACGCGC GCCGCGAGCG CGCAGAAATG CGTCCGCGCC GGCGGCAAGC ACAATGATCT CGACAATGTC GGCTATACTG CAAGGCACCA CACCTTCTTC GAGATGCTCG GCAATTTCTC CTTCGGCGAT TACTTCAAGC CGCTGGCGAT CGAGCTGGCC TGGAAGCTGA TCACGCAGGA ATTCGGTCTG CCGAAAGATC GTCTCCTCGT CACCGTCTAT CATACCGACG ACGAAGCCTT TGATTTATGG AAGAAAATCG CCGGCTTTCC CGATTCGAAG ATCATCCGCA TCGCCTCTTC CGATAATTTC TGGGCGATGG GCGACACCGG CCCCTGCGGC CCCTGCTCCG AAATTTTTTA TGACCAGGGC GAAAGGCTGA AGGGCGGCCC GCCCGGCAGC CCCGACGAGG ATGGCGACCG GTTCCTCGAA TTCTGGAACC TCGTCTTCAT GCAGTTCGAA CAGCTCGGGC CAGGCGAACG GATCGATCTG CCGCGCCCCT CGATCGATAC CGGCATGGGG CTCGAGCGCA TCGCCGCGCT GCTGCAGGGC GTCACGTCGA ATTATGACAT CGACCTCATG CGCGCGCTGA TCGTGGCGAT CGCCGCGGCG ACCGGCGTCG ATCCCGACGG GCCGCAGAAG GCCAGCCATC GCGTCATCGC CGACCATTTG CGCGCCTGCG CCTTCCTCAT TGCCGATGGA GTGACCCCCT CCAACGAAGG CCGCGGTTAT GTGCTGCGCC GCATCATGCG CCGCGCCATG CGCCACGCGC AGCTGCTTAG CGCGCGCGAG CCATTGATGT GGCGGCTCGT CCCCGTGCTC TCGCGCGAGA TGGGGCAGGC CTATCCCGAG CTGATCCGCG CCGAGTCGCT GATCGTCGAG ACCCTGCGGC TCGAAGAGGC CCGCTTCATC GACACGCTGG CGCGCGGCCT CTCGATCCTC GACGAAGCCG TGCGCGACTT ACCCGAGGGC GCCCCCCTGC CGGGGCAGGT GGCGTTCAAG CTCTATGATA CCTATGGTTT TCCGCTCGAC CTGACGCAGG ACGCGCTGCG GGCGCGTTCG CTCGCCGTAG ACGTGGACGG CTTCAATACC GCGATGGAGC GTCAGCGCGC CGACGCGCGC AGGGCTTGGG CCGGCTCCGG CGAGGCTGCG ACCGAAACCC TCTGGTTCGC GATCAAGGAC GAGGCGGGCG CGACAGATTT CCTCGGCTAT GAAGCCGAAC AAGCGGAAGG CGTCGTCGCC GCGCTCGTCA AGGACGGCAA GCCGGTCGAA CGCCTCAACA AGGGCGAGAC CGGAGCCATT ATCCTCAACC AAACGCCGTT CTATGCCGAA TCAGGGGGAC AGGTCGGCGA CACGGGCGTG ATGAGCGCCG CCGGGGTTCG CTTCCGCGTC ACAGACACCA AGAAGAAGCT TGGCGATCTG TTCGTGCATG AGGGAATCGT CGAGGAAGGC GAGATCACGC CCGGCCTCGC GCTCGAACTG TCGGTCGATC ACGCGCGCCG CTCGGCGATC CGGGCCAATC ATTCGGCGAC CCATCTCCTG CATGAATCGC TGCGGCTCGT GCTCGGAGAT CACGTCGCGC AAAAAGGCTC GCTCGTCGCC GACGACCGTC TGCGGTTTGA CTTCACCCAT CTGAAGCCGA TCTCGCCGGA AGAGCTCACG CGCGTCGAGG ACATCGCCAA TCGCGCGGTC ACCGACAATG CGCCCGTCGT CACCCGGCTG ATGGCGGTCG ACGAGGCGAT CGCCTCCGGC GCGCGCGCTT TGTTCGGCGA GAAATATGGC GACGAGGTCC GCGTCGTAAC GATGGGCTAT GCGCAGGACC GCGAGGACAG CGGAGACGGC GGCGCCAACA GGGCCTTTTC CGTCGAGCTC TGCGGCGGCA CCCATGTTGC GCGCACGGGC GACATCGGCC TTATTGCGAT CACGTCGGAA TCGGCCGTCG CCGCCGGCGT GCGGCGCATC GAGGCGAAGA CGGCGGCCGC GGCGCGGCGC CATCTCAACG CGCGCTCCGA CAAGCTTGAG CACATCGCCG TCCTGCTCAA ATCCTCCGAG GACGACGCCG AAAAACGATT GGCCGCTCTC CTGGAGGAGC GGCGCAAGCT CGATCGCGAG CTGGCCGATG CGCGCAAGAA ACTGGCCATG GGCGGCGGCG AAAAAGCCGC GGCCGAGGCG CAAGAAATCG GGGGCGTTAA ATTTTTTGGC CGCGCCGTTT CGGGCGTCGA TCTGAAGGAT CTGAAATCGC TCGCCGACGA GGCCAAACAG ACGGTTGGTT CGGGCGTCGT CGCCATCGCC GGCGTCGATA GCGACGGCAA GGCGGGTGTC GTCGTCGGCG TGACCGCGGA CCTCGTCGAG CGGTTCGATT CTGTCGCGCT GGTTCGCCTC GCCGCCACGG AGCTCGGCGG CAAGGGCGGC GGCGGCCGGC GCGACATGGC GCAGGCGGGC GGCCCGAACG GCGCTGGCGT CGACGCGGCC CTGGCTGCGA TCGCCAACGG ACTGCGCGGC GCTCAAAACG CCGCCTGA
|
Protein sequence | MNGVNEIRAG FLDYFKKNGH EIVASSPLVP RNDPTLMFTN AGMVQFKNVF TGLEKRPYTR AASAQKCVRA GGKHNDLDNV GYTARHHTFF EMLGNFSFGD YFKPLAIELA WKLITQEFGL PKDRLLVTVY HTDDEAFDLW KKIAGFPDSK IIRIASSDNF WAMGDTGPCG PCSEIFYDQG ERLKGGPPGS PDEDGDRFLE FWNLVFMQFE QLGPGERIDL PRPSIDTGMG LERIAALLQG VTSNYDIDLM RALIVAIAAA TGVDPDGPQK ASHRVIADHL RACAFLIADG VTPSNEGRGY VLRRIMRRAM RHAQLLSARE PLMWRLVPVL SREMGQAYPE LIRAESLIVE TLRLEEARFI DTLARGLSIL DEAVRDLPEG APLPGQVAFK LYDTYGFPLD LTQDALRARS LAVDVDGFNT AMERQRADAR RAWAGSGEAA TETLWFAIKD EAGATDFLGY EAEQAEGVVA ALVKDGKPVE RLNKGETGAI ILNQTPFYAE SGGQVGDTGV MSAAGVRFRV TDTKKKLGDL FVHEGIVEEG EITPGLALEL SVDHARRSAI RANHSATHLL HESLRLVLGD HVAQKGSLVA DDRLRFDFTH LKPISPEELT RVEDIANRAV TDNAPVVTRL MAVDEAIASG ARALFGEKYG DEVRVVTMGY AQDREDSGDG GANRAFSVEL CGGTHVARTG DIGLIAITSE SAVAAGVRRI EAKTAAAARR HLNARSDKLE HIAVLLKSSE DDAEKRLAAL LEERRKLDRE LADARKKLAM GGGEKAAAEA QEIGGVKFFG RAVSGVDLKD LKSLADEAKQ TVGSGVVAIA GVDSDGKAGV VVGVTADLVE RFDSVALVRL AATELGGKGG GGRRDMAQAG GPNGAGVDAA LAAIANGLRG AQNAA
|
| |