Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0719 |
Symbol | alaS |
ID | 7400192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 732162 |
End bp | 734942 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643707785 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_002565391 |
Protein GI | 222479154 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.507157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.398718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC TCGAAGCGGA GTACCGCCTC GATTACTTCG AGGAGGAGGG GTTTGAGCGG AAGGAGTGTC CCTCCTGTGG CGCGCACTTC TGGACCCGCG ACGCCGACCG CGAACTGTGC GGCGAGCCAC CCTGTGAGGA CTACAGTTTC ATCGACGACC CGGGGTTCCC GGAGGCTTAC TCACTGTCCG AGATGCGAGA GGCGTTCCTC TCATTCTTCG AGGAGCATGG CCACGAGCGA ATCGACCCGT ATCCGGTCGC CGCGAACCGC TGGCGCGACG ACGTACTCCT GACGCAGGCG TCGATTTACG ACTTCCAGCC GCTCGTCACG TCGGGACAGA CCCCGCCGCC CGCGAACCCC CTCACCATCT CACAGCCCTG TATCCGGATG CAGGACATCG ACAACGTGGG GAAGACGGGC CGCCACACGA TGGCCTTCGA GATGATGGCC CACCACGCGT TCAACACGCG CGAGGAGGTC GCCGAGGACG AGTACGCCTA CCACGGCGAG GTGTACTGGA AAGACGAGAC TGTTGAGTAT TGCGACAAGC TGTTCGAGAG CTTGGGCGCG GATCTAGAGG ATATCACCTA CATCGAGGAC CCGTGGGTCG GCGGCGGGAA CGCCGGCCCC GCCATCGAGG TCATTTATAA GGGAGCCGAG CTGGCGACGC TCGTCTTCAT GTGCATGGAG CGGGACCCCG ACGGCGACTA CGAGATGAAG GACGGGCACA CGTACTCCTT TATGGACACG TACATCGTCG ACACCGGGTA CGGGCTCGAA CGGTGGACGT GGATGAGTCA GGGGACGGCG ACGGTGTACG AGGCCGTCTA CCCCGACGCG ATCGACTTCC TCAAGGAGAA CGCGGGGATT GAACACACGG AGGCAGAACA GAAGCTCGTC CACCGGGCGG CGAAGCTCTC CGGCCGACTC GACATCGACG ATGTCGACGA TGTGGAGGCC GCCCGCGGCG AGATCGCCGA TCGGCTCGAC GTCGATGTCA ACCGGCTCCG TGAGCTGGTC GAACCGCTCG AATCGATCTA CGCGATTGCC GACCACTCGC GGACGCTCGC GTACATGTTC GGCGACGGGA TTGTCCCCTC GAACGTCGGG ACGGGCTACC TCGCCCGGAT GGTCCTGCGG CGCACGAAGC GGCTCGTCGA CGAGATCGGG ATCGACGCCC CGCTCGACGA GCTGGTCGAC ATGCAGGCCG AGCGGCTCGG CTACGAGAAC CGCGACACGA TCCGCGAGAT CGTCCGCACC GAGGAGCGGA AGTACCGCAA GACGCTCGAA CGCGGCTCCC GAAAGGTCGA ATCGCTCGCG GACGAGTACG CCGGCACGGA CGAACCGATC CCAACGGAGG TGCTCTTGGA GCTGTACGAC TCCCACGGGA TTCAGCCGGA CATGGTTGCC GACATCGCCG CCGAGCGAGG CGCGACCGTC GACGTGCCGG ACGACTTCTA CGCGCTCGTC GCCGACCGGC ACGAGGAGGC GGACGGCGAC GAGGCGGCAG CCGAGCGCGA CGACCGCTTC GACGATCTCC CCGAGACGGA GAAGCTGTTC TACGACGACC AAGGACGCAC CGAGTTCGAG GCGGTCGTCC TCGACGTGTT TGAGCTCGAG GAGGGATACG ACGTGGTCTT AGACCAGACG ATGTTCTATC CCGAGGGCGG CGGCCAGCCG GCCGACAGGG GCCAGCTCAC CGCCGGCGAG ACGACCGTCG ACGTGGTCGA CGTACAGGAG CGCAACGGCG TCGTGCTCCA TCGCACCGAT GCCGACCCCG GGAAGGGAGA GTTCGTCCGC GGGCAGGTCG ACGGCGACCG CCGTGACCGG CTCCGCGCGC ACCACACCGC GACCCACCTG ATCGGCCACG CGGCTCGCGA GGTGCTCGGC AATCACGTTC GACAGGCGGG CGCACAGAAA GGAATCGACT CCTCCCGGCT CGACATCCGC CACTTCGAGC GGATCACTCG CGAACAAGTC AAAGAGATCG AGCGCGTCGC AAACGCGCTC GTCCGCGACG ACGTGCCGGT GCGACAGGAG TGGCCCGACC GCAACGAGGC GGAGTCCGAA CACGGCTTCG ACCTCTATCA GGGCGGCGTC CCGCCGGGAA CGAACATCCG CCTGATCCAC GTCGGCGACG AGGACGTGCA GGCCTGTGCC GGCACCCACG TCGAGCGCAC CGGCGAGATC GGCGCGGTGA AAGTGCTGAA GACGGAGCCC GTCCAAGACG GCGTCGAACG GATCGTGTTC GCGGCGGGCG GTGCCGCCGT CGAGGCGACC CAGCGCACGG AAGACGCGCT GTACGACGCG GCCAGAGCCC TCGACGTCGA CCCGCTCGAC GTGCCCGAAA CGGCCGAGCG GTTCTTCGAG GAGTGGAAGG GGCGCGGCAA AGAGATCGAG TCGCTCAAAG AGGAGCTTGC GGCGGCGCGT GCGTCCGGCG GCGCCGACGC AGAGGAGGTC GAGATCGGCG GCGTGACCGC GGTGATACAG CGGCTCGACG GCGACGCGGA CGAGCTGCGC GCGACCGCGA ACGCTCACGT CGACGACGGG AAGGTGGCGG TCGTCGGCAG CGGGGCAGAC GGCTCCGCGA GCTTCGTCGT CGGTGTCCCC GATGGTGTCG ACGTGAACGC CGGACAGGTG GTCTCGGAGC TCGCCGCCCG CGTCGGCGGC GGCGGCGGTG GCCCGCCGGA CTTCGCGCAG GGGGGCGGCC CGGACGCGGA CGCGCTCGAC GACGCGCTCG ACGCGGCACC GGACGTTCTT CGCAGCCTTC AGGAGGCCTG A
|
Protein sequence | MSDLEAEYRL DYFEEEGFER KECPSCGAHF WTRDADRELC GEPPCEDYSF IDDPGFPEAY SLSEMREAFL SFFEEHGHER IDPYPVAANR WRDDVLLTQA SIYDFQPLVT SGQTPPPANP LTISQPCIRM QDIDNVGKTG RHTMAFEMMA HHAFNTREEV AEDEYAYHGE VYWKDETVEY CDKLFESLGA DLEDITYIED PWVGGGNAGP AIEVIYKGAE LATLVFMCME RDPDGDYEMK DGHTYSFMDT YIVDTGYGLE RWTWMSQGTA TVYEAVYPDA IDFLKENAGI EHTEAEQKLV HRAAKLSGRL DIDDVDDVEA ARGEIADRLD VDVNRLRELV EPLESIYAIA DHSRTLAYMF GDGIVPSNVG TGYLARMVLR RTKRLVDEIG IDAPLDELVD MQAERLGYEN RDTIREIVRT EERKYRKTLE RGSRKVESLA DEYAGTDEPI PTEVLLELYD SHGIQPDMVA DIAAERGATV DVPDDFYALV ADRHEEADGD EAAAERDDRF DDLPETEKLF YDDQGRTEFE AVVLDVFELE EGYDVVLDQT MFYPEGGGQP ADRGQLTAGE TTVDVVDVQE RNGVVLHRTD ADPGKGEFVR GQVDGDRRDR LRAHHTATHL IGHAAREVLG NHVRQAGAQK GIDSSRLDIR HFERITREQV KEIERVANAL VRDDVPVRQE WPDRNEAESE HGFDLYQGGV PPGTNIRLIH VGDEDVQACA GTHVERTGEI GAVKVLKTEP VQDGVERIVF AAGGAAVEAT QRTEDALYDA ARALDVDPLD VPETAERFFE EWKGRGKEIE SLKEELAAAR ASGGADAEEV EIGGVTAVIQ RLDGDADELR ATANAHVDDG KVAVVGSGAD GSASFVVGVP DGVDVNAGQV VSELAARVGG GGGGPPDFAQ GGGPDADALD DALDAAPDVL RSLQEA
|
| |