Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1929 |
Symbol | alaS |
ID | 8384221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1953799 |
End bp | 1956579 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644972998 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_003130831 |
Protein GI | 257052998 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATC TCGAGGAGGA ATACCGGCTC GAATACTTCG AGGAGCACGG GTTCCATCGC AAGGAGTGTA TGGAGTGTGG GGATCACTTC TGGACGCTCG ACGGGGCGCG GGAAACCTGC GGTGAACCGC CCTGTGACGA GTACAGCTTT ATCGACGATC CCGGATTCGA CGAGGCATTC GAACTCGGCG AGATGCGCGA ACAGTTTCTC TCCTTTTTCG AGGAGCACGA TCACGAGCGC ATCGACCCCT ATCCCGTCGC GGCCAACCGC TGGCGGGAGG ACGTCCTCCT GACCCAGGCG TCGATCTACG ACTTTCAGCC ACACGTCACT AGCGGCGCGT CGCCGCCGCC GGCGAACCCA CTCGCCGTGA GCCAGCCCTG CATTCGGATG CAGGACATCG ACAACGTCGG CGTCACCGGC CGACACACCA TGGCCTTCGA GATGATGGGC CATCACGCGT TCAACGCTGA CGAGGGGACC GACTACGCCT ACGAGGGCGA GGTCTACTGG AAGGACGAGT GCGTCGAGTA CTGCATGGAC TTACTCGCCG AAGTCGGTGC CGACCGCGAG GCTGTGACGC TGATAGAGGA CCCCTGGGTC GGCGGCGGCA ACGCCGGACC CGCCTTCGAG GTGATCTACG AGGGCGCGGA ACTCGCGACG CTCGTGTTCA TGCAGTTCGA GCGCGACCCG GACGGCGATT TCGAGATGAA GGACGGCAAC ACCTACAGCC GGATGGATCG CCGGGTCATC GACACCGGCT ACGGGCTCGA ACGCTGGACC TGGATGTCCC AGGGCACGCC GACGGTCTAC GAGGCCGTTT ACCCCGACAT GATCGACTTC CTGACGGACA ATGCGGGGAT CGATCATACT GCCGACGAGG AGGAGATCGT CCACAAAGCG GCGAAGCTCT CCGGTCGGAT GGATATCGAC GAGGCCGAGG ACGTCGAGGC TGCCCGCGAC GACATCGCCG CGGAGATCGG CGTCGCGACC GACCGCCTCC GCGAACTCGT CGAACCCCTC GAATCCATCT ACGCGATCGC CGACCACAGC CGGACACTCG CGTACATGCT CGGCGACGGG ATCGTACCGA GTAACGTCGG CACGGGCTAT CTCGCGCGGA TGGTATTGCG GCGGACGAAA CGCCTGGTCG ACGAGGTCGG CGTCGACGCC CCGTTGGACG AACTCGTCGA CATGCAGGCC GAACGGCTTG GGTACGACAA CCGCGATACG ATCCGGGACA TCGTCCGCAC GGAGGTCGAG AAGTATCGCG AGACGCTCGA CCGCGGCCAT CGACGCGTCG AGCAGTTGGC CGAGGAGTAC GCCGACTCCG GCGACCCGAT TCCCGCCGAC GAACTTGTCG AACTCTATGA CAGCCACGGG ATGCAGCCCG AGACAGTCGA ACAGATCGCC GCCGATCGTG GCGTCGCTGT CGAGACGCCC GAGGACTTCT ATTCACTGGT TGCCGAGCGC CACGAGGACG CCGAACGCGC CGGGTCGACC GAGGAAAGCG ACACGGACGA GCGTCTCGCC GACCTCCCCG AGACCGAGAA ACTCTACTAC GACGACCAGA CTCGCACCGA GTTCGAGGCG GTCGTTCTCG ACGTCTTCGA GCGTGCGGAC GGGTACGACG TGGTGCTCGA TCAGACGATG TTCTACCCGG AAGGCGGGGG CCAGCCCCCG GACCAGGGGA CGCTCTCGAC GGACGACGAG ACGGTCGAGG TGACCGACGT CCAGATCCAG GACGGTGTCA TCTTACACCG GACAGATGCC GACCCGGGGA CGGGCGACTT CGTTCGCGGT CAGGTCGATG GACGGCGGCG ACGACGGCTG ATGGCCCATC ACACGGCCAC CCACATCGTC GTTCACGCCG CCCGGCAGGT TCTCGGCGAC CACATCCGCC AGGCCGGCGC ACAGAAGGGA ACTGACTCGG CACGCATCGA CGTCCGTCAC TACGAACCGG TCTCTCGCGC GGCGGTCAAG GAGATCGAGC GGGTCGCCAA CGATATCGTC ACCGACAACG CCCAGGTCAC CCAGGAGTGG CCAGATCGCC ACGAGGCCGA AGCCGAGTAC GGGTTCGATC TCTACCAGGG CGGGATCCCG GCCGGCGAGC AGATCCGCCT GATCCACGTC GAGGACGACG TCCAGGCCTG TGGCGGGACC CACGTCTCGC GAACGGGCGA AGTCGGGGCG ATCAAAATCC GCTCGACGGA ACGCATCCAG GACGGCGTCG TCAGGCTGAC GTTCGCCGCC GGCGAGGCCG CCATCGAAGC GACCCAGGCG ACCGAGGACG CCCTCTACGA GGCGGCCGAA GCGTTCGACG TGGATCCGCA GGACGTCCCG GAGACGGCCG AACGCTTCTT CGCGGAGTGG AAAGAACGCG GCAAGGAGAT CGAAGCGCTC CGGGCTGAAC TCGCCGAGGT TCGCGCATCG GGCGCAAGCG AGGGCGAGGA AGTCGATCTC GACGGGACGA CGGCGGTCGT CCAGCGCGTC GACGGCGAGA TGGACGAGTT ACGGGCGACG GCCAACGCGA TCGCCGATTC GGGGTCGGTG GCGGTCGTCG GTTCGGCGGC CGACGGCGCG CAGTTCGTCG TGGCCGCACC CGAGGGTAGC GGTGTCGACG CTGGTTCGGT CGTGAGTGAA CTCGCCGATC GCGTCGGCGG CGGTGGCGGC GGTCCCTCGG ACTTCGCCCA GGGTGGAGGG CCGGACGCCG ACGCCCTCGA TGCGGCACTC GACGCTGCGC CGGACGTCCT GCGGCAGGTC ATGCGTCCTT CGGACCGCTG A
|
Protein sequence | MSDLEEEYRL EYFEEHGFHR KECMECGDHF WTLDGARETC GEPPCDEYSF IDDPGFDEAF ELGEMREQFL SFFEEHDHER IDPYPVAANR WREDVLLTQA SIYDFQPHVT SGASPPPANP LAVSQPCIRM QDIDNVGVTG RHTMAFEMMG HHAFNADEGT DYAYEGEVYW KDECVEYCMD LLAEVGADRE AVTLIEDPWV GGGNAGPAFE VIYEGAELAT LVFMQFERDP DGDFEMKDGN TYSRMDRRVI DTGYGLERWT WMSQGTPTVY EAVYPDMIDF LTDNAGIDHT ADEEEIVHKA AKLSGRMDID EAEDVEAARD DIAAEIGVAT DRLRELVEPL ESIYAIADHS RTLAYMLGDG IVPSNVGTGY LARMVLRRTK RLVDEVGVDA PLDELVDMQA ERLGYDNRDT IRDIVRTEVE KYRETLDRGH RRVEQLAEEY ADSGDPIPAD ELVELYDSHG MQPETVEQIA ADRGVAVETP EDFYSLVAER HEDAERAGST EESDTDERLA DLPETEKLYY DDQTRTEFEA VVLDVFERAD GYDVVLDQTM FYPEGGGQPP DQGTLSTDDE TVEVTDVQIQ DGVILHRTDA DPGTGDFVRG QVDGRRRRRL MAHHTATHIV VHAARQVLGD HIRQAGAQKG TDSARIDVRH YEPVSRAAVK EIERVANDIV TDNAQVTQEW PDRHEAEAEY GFDLYQGGIP AGEQIRLIHV EDDVQACGGT HVSRTGEVGA IKIRSTERIQ DGVVRLTFAA GEAAIEATQA TEDALYEAAE AFDVDPQDVP ETAERFFAEW KERGKEIEAL RAELAEVRAS GASEGEEVDL DGTTAVVQRV DGEMDELRAT ANAIADSGSV AVVGSAADGA QFVVAAPEGS GVDAGSVVSE LADRVGGGGG GPSDFAQGGG PDADALDAAL DAAPDVLRQV MRPSDR
|
| |