Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88206 |
Symbol | ALA1 |
ID | 4837683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 711471 |
End bp | 714506 |
Gene Length | 3036 bp |
Protein Length | 954 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388998 |
Product | Alanyl-tRNA synthetase, cytoplasmic (Alanine--tRNA ligase) (AlaRS) |
Protein accession | XP_001383420 |
Protein GI | 150864558 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.898121 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AGAGAACCAT GTCCACCACT ACCACGTCCG AGTGGACGGC TTCCAAAGTC AGATCCACCT TCTTGGACTA CTTCACCAAT AAAAGAGACC ACAAGTTTGT TCCTTCTTCC TCGGTTGTTC CACACAATGA TCCAACTCTC TTATTCGCCA ATGCTGGGAT GAACCAGTAC AAGCCTATCT TCCTAGGCAC TGCAGACCCA GCCTCGGATC TCGCTTCATT GAAGAGAGCT GCCAACTCTC AGAAGTGTAT CAGAGCTGGT GGTAAACACA ACGATTTGGA AGACGTCGGC CGTGACTCGT ATCACCACAC ATTCTTTGAG ATGTTGGGTA ACTGGTCCTT TGGCGACTAC TTCAAGCCAG AAGCCATTAC TTGGGCCTGG GAGCTTTTGA CTGAGGTCTA CGGCTTGGAT AAAGACAGAT TGTATGTGAC GTATTTCGAA GGTGACGCCA AGCAGGGTTT GGAACCAGAT TTGGAGGCCA AGTCTTTCTG GCTTAAGGTT GGTGTCGCTG AAGACCACAT TTTGCCAGGA AACGCTAAGG ACAACTTTTG GGAAATGGGC GACCAGGGTC CTTGTGGTCC TTGTTCTGAG ATCCACTACG ACCGTATTGG AGGTAGAAAC GCTGCTTCTC TCGTCAACCA GGACGACCCC AACGTCTTGG AAGTTTGGAA CATCGTCTTC ATCCAGTACA ACAGAGAGGC CGACAGCTCG TTAAAGACAT TGCCAGCCAA GCACATCGAT ACCGGTATGG GGTTTGAGCG TTTGGTGTCT GTTTTGCAGG ACAAATCTTC CAACTACGAT ACCGACGTGT TTCTTCCCAT CTTTGACAAG ATCAGAGACA TCACGGGTAT CAGACCCTAC GGTGGTAAAT TCGGCGCTGA AGACACCGAC GGTATTGATA CGGCCTACCG TGTCATTGCT GATCATGTGA GAACATTGAC GTTTGCCATT TGCGACGGTG GTGTTCCTAA CAACGACGGT GCTGGATATG TTTTGAGACG TATCTTAAGA AGAGGCTCTC GTTATGTGCG CAAATACATG AACTACCCCA TTGGTTCCTT CTTCCAACAA TTGGTTGATG TTGTCATCGA GCAGAACAAA GAGATTTTCC CTGAAATCGT CCATGGTGCC CAAGACTTGA AGGAAATCTT GAACGAAGAA GAGTTGTCGT TTGCTAAGAC TTTGGATAGA GGTGAAAAGT TGTTTGAACA GTACGCCATC ATCGCGTCAA AGACTCCAGA ACAGACTTTG TCTGGAAAAG ATGTGTGGAG ATTGTACGAC ACATACGGTT TCCCATCGGA CTTGACTCGT TTGATGGCCG AAGAAGCCGG CTTAAAGATC GACGAGCCTG CTTTTGAAAA GGCCCGTCTT GAATCCAAAG AAGCTTCTAA GGCTATAGGC AACAAAGACG GTGTGGAATT GGTCAAGTTG GATGTTCATT CCTTGTCGGA ATTAGACTCC AATGCTAACG TCAGCAAGAC TGACGACTCT GCCAAGTATG GCAGAGAAAA CATCAAGGCT ACTATCCAGG CCATCTATTC TAGCTCTGGA TTCGTTGATT CTATTGATGA CTCGTCTGTA CAGTACGGAG TCTTATTGGA CAAAACTCCA TTCTACGCTG AACAGGGTGG ACAGATCTAC GACACTGGTA AGTTGGTAAT CGACGGTAAA GCTGAATTTA ACGTCACCAA TGTTCAAGTC TATGGTGGTT ATGTTCTTCA CACTGGTAAC ATTGTGGAAG GCTCGTTGAA TGTCAAGGAT GACGTCATTG CAACCTATGA TGAGTTGAGA AGATGGCCCA TCAGAAATAA CCATACTGGT ACTCACGTGT TGAACTATGC CTTAAGAGAA GTGTTGGGCG ACTCTGTTGA TCAAAGAGGT TCCTTGGTTG CTCCAGAAAA GTTGAGATTC GACTTTTCTC ATAAACAAGC ATTAACTCCT AAGGAGTTGG CTGCCGTTGA ATCTATTTCC AACGAGTACA TCAAGTCGAA CAAAGAGGTT TTCTACAAGG ATGTTTCGTT GAACGAAGCA AGGGAAATCA ACGGCTTGAG AGCTGTCTTT GGAGAAACGT ATCCAGATCC AGTCAGAGTT GTTTCCATCG GTGTTCCTGT AGAAGACTTG CTTGCTGAGC CTAAGAAGGC TGACTGGCAC AAAGTTTCGA TTGAGTTCTG TGGTGGTACC CACGTAGCCA AAACGGGAGA CATCAAGGAC TTGGTCGTGA TTGAAGAATC CGGTATCGCC AAGGGTATCA GAAGAATTGT GGCTGTCACA GGTCATGATG CTCATGCTGT TCAGAAGATC GCCCTGGAGT TCGAAGCTGA ATTGGACCAC GCTGCTGGCT TACCATTTGG TTCTGTCAAG GAAACCAAGG CCAAGGAGTT GGGTGTAGCC TTGAAGAAGT TGTCGATCTC CGTTTTGGAC AAGCAGAAGT TAACTGAAAA GTTCAACAAG ATCGACAAGT CCATCAAGGA CGATTTGAAG ACGAAACAGA AGGCTGAAAC CAAACAGACT TTGGATGTAG TCAATGAATG GTTGGAAAAG AAAGATGCTG GGGACTACTT GGTTGCTCAT GTTCCAATCA ACGCTAACGC CAAAGCCATC ACTGAAGCCT TTAACTTGCT CAAGAAGCTG CACAAGGACA AATCATTGTA CTTGATCACT GGTTTAACTG ACAAGGTTGC TCATGGTTGT TACATTGCTG ACGAAGCTAT TGCCAAAGGT GTCAATGCCA GCGACTTGGC TCAAGCTGTA TCTGCCAAGA TTGGTGGTAA GGCTGGTGGT AAAGGCAACA TCGTCCAAGG TATGGGTGAC AAACCTGAAG GCATTAAGGA GGCTGTTAAC GAAGTGACCC AATTGTTGGC TGAGAAGTTG TAATTGATTG TTGTAGACTA GTAGTAGAGT TAGAGTAAAC TATTCAAGCA AGAGTAGAAC TCTTCAGACA GTAGAACAAT TTAGACAATA GTGGAACAAC GTTATAGCAA TAGCTATATG CTTTGATATA GTATCTGTTA ATAGTATAAT AAATAATGTA AATGTA
|
Protein sequence | MSTTTTSEWT ASKVRSTFLD YFTNKRDHKF VPSSSVVPHN DPTLLFANAG MNQYKPIFLG TADPASDLAS LKRAANSQKC IRAGGKHNDL EDVGRDSYHH TFFEMLGNWS FGDYFKPEAI TWAWELLTEV YGLDKDRLYV TYFEGDAKQG LEPDLEAKSF WLKVGVAEDH ILPGNAKDNF WEMGDQGPCG PCSEIHYDRI GGRNAASLVN QDDPNVLEVW NIVFIQYNRE ADSSLKTLPA KHIDTGMGFE RLVSVLQDKS SNYDTDVFLP IFDKIRDITG IRPYGGKFGA EDTDGIDTAY RVIADHVRTL TFAICDGGVP NNDGAGYVLR RILRRGSRYV RKYMNYPIGS FFQQLVDVVI EQNKEIFPEI VHGAQDLKEI LNEEELSFAK TLDRGEKLFE QYAIIASKTP EQTLSGKDVW RLYDTYGFPS DLTRLMAEEA GLKIDEPAFE KARLESKEAS KAIGNKDGVE LVKLDVHSLS ELDSNANVSK TDDSAKYGRE NIKATIQAIY SSSGFVDSID DSSVQYGVLL DKTPFYAEQG GQIYDTGKLV IDGKAEFNVT NVQVYGGYVL HTGNIVEGSL NVKDDVIATY DELRRWPIRN NHTGTHVLNY ALREVLGDSV DQRGSLVAPE KLRFDFSHKQ ALTPKELAAV ESISNEYIKS NKEVFYKDVS LNEAREINGL RAVFGETYPD PVRVVSIGVP VEDLLAEPKK ADWHKVSIEF CGGTHVAKTG DIKDLVVIEE SGIAKGIRRI VAVTGHDAHA VQKIASEFEA ELDHAAGLPF GSVKETKAKE LGVALKKLSI SVLDKQKLTE KFNKIDKSIK DDLKTKQKAE TKQTLDVVNE WLEKKDAGDY LVAHVPINAN AKAITEAFNL LKKSHKDKSL YLITGLTDKV AHGCYIADEA IAKGVNASDL AQAVSAKIGG KAGGKGNIVQ GMGDKPEGIK EAVNEVTQLL AEKL
|
| |