Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1124 |
Symbol | alaS |
ID | 8410643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 1073596 |
End bp | 1076370 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645019459 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_003176957 |
Protein GI | 257387184 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.118182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACC TCGCCGACGA ATATCGACTC GACTACTTCG AGAAGGAGGG GTTCGAGCGG CTGACGTGTG GGACGTGTGG CGATCACTTC TGGACGCGCG ACGCCTCCCG CGAGACCTGC GGTGAGCCGC CCTGTGCCGA GTACGGCTTC ATCGACGACC CCGGCTTCGA GGAGACGCTC TCGCTCGAAG AGATGCGCGA GACGTTTCTC TCCTTCTTCG AAGAGCGCGA CCACGGGCGG GTCGACCCCT ATCCCGTCGC CGCCAACCGC TGGCGCGACG ACGTACTGTT GACGCAGGCG TCGATCTACG ACTTCCAGCC CCACGTCACG ACGGGCCAGT CGCCGCCGCC GTCGAACCCG CTCGTGGTCT CCCAGCCCTG CATCCGGATG CAGGACATCG ACAACGTCGG CAAGACCGGG CGTCACACGA TGGCCTTCGA GATGGGCGGC CACCACGCGT TCAACGCCGA CGAGGGGGCG GGCTACGCCT ACGAGGGCGA GGTGTACTGG AAGGAAGAGA CCGTCCAGTA CTGCGTCGAG CTGTTCGAGG AACTCGGCGT CGACAGCAGC GAGCTGACCC TCATCGAGGA CGTGTGGGTG GGCGGCGGCA ACGCCGGGCC GTGTTTCGAG GTGATCTACC AGGGCCTGGA GCTGGCGACG CTCGTGTTCA TGCAGTTCGA GCAGGACCCC GACGGCGACT ACGAGATGAA AGACGGCAAC AGCTACTCGG AGATGGACCG CCGGGTCGTC GACACGGGGT ACGGGATCGA ACGCTGGACC TGGATGAGCC AGGGGACCCC GACGGTCTAC GAGGCGATCT ACCCCGAGAT GATCGAGTTC CTGCGGGACA ACGCCGGGCT CGACTACACC GACGAGGAAG ACGAGATCGT CTTCCGGGCG GCCAAGCTGG CCGGCCACAT GGACATCGAC GAGGCCGAGG ACGTCGAGGC GGCCCGCGAC ACCATGGCCG AGCAGCTGGG CGTCGAGACG GCGCGGCTCG TCGAGCTGAT GGAGCCCCTC GAAGACATCT ACGCCATCGC CGACCACTCC CGGACGCTCG CGTACATGCT CGGCGACGGC ATCGTCCCCT CGAACGTCGG CACGGGCTAT CTCGCGCGGA TGGTGCTGCG TCGCACCAAG CGCCTCGTCG ACACGGTCGG CGTCGACGCG CCCCTGGACG AACTCGTCGA CATGCAGGCC GACCGGCTGG GCTACGACAA CCGCGACACC ATCCGCGACA TCGTCCGGAC CGAAGTCGAA AAGTACCGCG AGACGCTGGA CCGTGGCGGC CGGAAGGTCC GCCAGCTGGC CGACGACTAC GCCGAGAAAG ACGAGCCGAT TCCGGTGCAA GAGCTGATCG AGCTGTACGA CTCCCACGGC ATCCAGCCGG ACATGGTCGC GGAGATCGCC GCCGAGCGCG GCGTCGACGT GGACGTACCC GACAACTTCT ACGGACTCGT CGCCGACCGC CACGGCGGCG GCCAGGCCTT CGAGGAGGAC GCGACGGTCC CACACGAGAA ACGGCTCGAC GAACTCCCAG AGACCGACCG GCTCTACTAC GAGGACCAGC AGCGCATGGA GTTCGAGGCG GTCGTCCTCG AAGTGTTCGA CCGCGAGGAC GGCGACTACG ACGTGGTGCT CGACCAGACG ATGTTCTACC CCGAAGGCGG GGGCCAGCCC CCGGACCACG GGACCATCTC GACCGACGAC GTCACCGGCG AGGTGACCGA CGTACAGGTC CACGGCGGCG TCATCGTCCA CACCTGTGAC GAGGACCCCG GAACGGGAGA GTTCGTCCGC GGACAGGTCG ACGCCACCCG TCGCCGTCGA TTGATGCGCC ACCACACCGC GACACACGTC GTCATCCACA GCGCCCGGCA AGTGCTGGGC GAGCACGTCC GCCAGGCCGG CGCACAGAAG GGAACCGACT CCTCGCGGAT CGACATCCGC CACTACGAGT CGATCAGTCG CGAACAGGTC AAGGAGATCG AGCACCGCGC CAACGAGATC GTGATGGACA ACGCCGCCGT CACCCAGGAG TGGCCAGATC GCCACGAGGC CGAGAAAGCG CACGGATTCG ACCTCTACCA GGGCGGGATC CCGACCGGCG AGCAGATCCG CCTGATCCAC GTCGCCGACG ACGTGCAGGC CTGCGGTGGC ACCCACGTCG GCCGGACCGG CGACATCGGG GCGATCAAGC TGCTGAACAC CGAGCGCATC CAGGACGGCG TCATCCGACT CACCTTCGCC GCCGGCGATG CCGCTATCGA CGCCACCCAC GAAATCGAGG ACGCCCTCTA CGAGACCGCC GAGCTCTACG ACGTGGCCAC CGAGGACGTG CCCGACACCG CCGCACGCTT TTTCGACGAG TGGAAAGCGC GGGGCAAGGA GATCGAGGAC CTCAAGGAAC AGCTCGCGGA GGCCCGCGCC AGCGGCGGCG GGGACAGCGA AGAGATCGAG GTCGCCGACA CGACCGCCGT CGTCCAGCGC ATCGACGCCG ACATGGACGA ACTCCGGGCC CAGGCCAACG CCGTCGTCGA CCAGGGCTCG ATCGCCGTAC TCGGCTCCGG TGCCAGCGGT GCGCAGTTCG TCGTCGCCGT TCCCGACGGC GTCGCGGTCA ACGCCGGCGA GGTCGTCGGC GAACTCGCCG GCCGCGTCGG CGGCGGCGGC GGCGGTCCGT CGGACTTCGC ACAGGGTGGC GGTCCCGACG CCGACAAGCT CGACGACGCC CTCGCGGACG CGCCGGAGAT CCTCCGGCAG GTCGCGAACG CGTAA
|
Protein sequence | MSDLADEYRL DYFEKEGFER LTCGTCGDHF WTRDASRETC GEPPCAEYGF IDDPGFEETL SLEEMRETFL SFFEERDHGR VDPYPVAANR WRDDVLLTQA SIYDFQPHVT TGQSPPPSNP LVVSQPCIRM QDIDNVGKTG RHTMAFEMGG HHAFNADEGA GYAYEGEVYW KEETVQYCVE LFEELGVDSS ELTLIEDVWV GGGNAGPCFE VIYQGLELAT LVFMQFEQDP DGDYEMKDGN SYSEMDRRVV DTGYGIERWT WMSQGTPTVY EAIYPEMIEF LRDNAGLDYT DEEDEIVFRA AKLAGHMDID EAEDVEAARD TMAEQLGVET ARLVELMEPL EDIYAIADHS RTLAYMLGDG IVPSNVGTGY LARMVLRRTK RLVDTVGVDA PLDELVDMQA DRLGYDNRDT IRDIVRTEVE KYRETLDRGG RKVRQLADDY AEKDEPIPVQ ELIELYDSHG IQPDMVAEIA AERGVDVDVP DNFYGLVADR HGGGQAFEED ATVPHEKRLD ELPETDRLYY EDQQRMEFEA VVLEVFDRED GDYDVVLDQT MFYPEGGGQP PDHGTISTDD VTGEVTDVQV HGGVIVHTCD EDPGTGEFVR GQVDATRRRR LMRHHTATHV VIHSARQVLG EHVRQAGAQK GTDSSRIDIR HYESISREQV KEIEHRANEI VMDNAAVTQE WPDRHEAEKA HGFDLYQGGI PTGEQIRLIH VADDVQACGG THVGRTGDIG AIKLLNTERI QDGVIRLTFA AGDAAIDATH EIEDALYETA ELYDVATEDV PDTAARFFDE WKARGKEIED LKEQLAEARA SGGGDSEEIE VADTTAVVQR IDADMDELRA QANAVVDQGS IAVLGSGASG AQFVVAVPDG VAVNAGEVVG ELAGRVGGGG GGPSDFAQGG GPDADKLDDA LADAPEILRQ VANA
|
| |