Gene PICST_88206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88206 
SymbolALA1 
ID4837683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp711471 
End bp714506 
Gene Length3036 bp 
Protein Length954 aa 
Translation table12 
GC content46% 
IMG OID640388998 
ProductAlanyl-tRNA synthetase, cytoplasmic (Alanine--tRNA ligase) (AlaRS) 
Protein accessionXP_001383420 
Protein GI150864558 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.898121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGAGAACCAT GTCCACCACT ACCACGTCCG AGTGGACGGC TTCCAAAGTC AGATCCACCT 
TCTTGGACTA CTTCACCAAT AAAAGAGACC ACAAGTTTGT TCCTTCTTCC TCGGTTGTTC
CACACAATGA TCCAACTCTC TTATTCGCCA ATGCTGGGAT GAACCAGTAC AAGCCTATCT
TCCTAGGCAC TGCAGACCCA GCCTCGGATC TCGCTTCATT GAAGAGAGCT GCCAACTCTC
AGAAGTGTAT CAGAGCTGGT GGTAAACACA ACGATTTGGA AGACGTCGGC CGTGACTCGT
ATCACCACAC ATTCTTTGAG ATGTTGGGTA ACTGGTCCTT TGGCGACTAC TTCAAGCCAG
AAGCCATTAC TTGGGCCTGG GAGCTTTTGA CTGAGGTCTA CGGCTTGGAT AAAGACAGAT
TGTATGTGAC GTATTTCGAA GGTGACGCCA AGCAGGGTTT GGAACCAGAT TTGGAGGCCA
AGTCTTTCTG GCTTAAGGTT GGTGTCGCTG AAGACCACAT TTTGCCAGGA AACGCTAAGG
ACAACTTTTG GGAAATGGGC GACCAGGGTC CTTGTGGTCC TTGTTCTGAG ATCCACTACG
ACCGTATTGG AGGTAGAAAC GCTGCTTCTC TCGTCAACCA GGACGACCCC AACGTCTTGG
AAGTTTGGAA CATCGTCTTC ATCCAGTACA ACAGAGAGGC CGACAGCTCG TTAAAGACAT
TGCCAGCCAA GCACATCGAT ACCGGTATGG GGTTTGAGCG TTTGGTGTCT GTTTTGCAGG
ACAAATCTTC CAACTACGAT ACCGACGTGT TTCTTCCCAT CTTTGACAAG ATCAGAGACA
TCACGGGTAT CAGACCCTAC GGTGGTAAAT TCGGCGCTGA AGACACCGAC GGTATTGATA
CGGCCTACCG TGTCATTGCT GATCATGTGA GAACATTGAC GTTTGCCATT TGCGACGGTG
GTGTTCCTAA CAACGACGGT GCTGGATATG TTTTGAGACG TATCTTAAGA AGAGGCTCTC
GTTATGTGCG CAAATACATG AACTACCCCA TTGGTTCCTT CTTCCAACAA TTGGTTGATG
TTGTCATCGA GCAGAACAAA GAGATTTTCC CTGAAATCGT CCATGGTGCC CAAGACTTGA
AGGAAATCTT GAACGAAGAA GAGTTGTCGT TTGCTAAGAC TTTGGATAGA GGTGAAAAGT
TGTTTGAACA GTACGCCATC ATCGCGTCAA AGACTCCAGA ACAGACTTTG TCTGGAAAAG
ATGTGTGGAG ATTGTACGAC ACATACGGTT TCCCATCGGA CTTGACTCGT TTGATGGCCG
AAGAAGCCGG CTTAAAGATC GACGAGCCTG CTTTTGAAAA GGCCCGTCTT GAATCCAAAG
AAGCTTCTAA GGCTATAGGC AACAAAGACG GTGTGGAATT GGTCAAGTTG GATGTTCATT
CCTTGTCGGA ATTAGACTCC AATGCTAACG TCAGCAAGAC TGACGACTCT GCCAAGTATG
GCAGAGAAAA CATCAAGGCT ACTATCCAGG CCATCTATTC TAGCTCTGGA TTCGTTGATT
CTATTGATGA CTCGTCTGTA CAGTACGGAG TCTTATTGGA CAAAACTCCA TTCTACGCTG
AACAGGGTGG ACAGATCTAC GACACTGGTA AGTTGGTAAT CGACGGTAAA GCTGAATTTA
ACGTCACCAA TGTTCAAGTC TATGGTGGTT ATGTTCTTCA CACTGGTAAC ATTGTGGAAG
GCTCGTTGAA TGTCAAGGAT GACGTCATTG CAACCTATGA TGAGTTGAGA AGATGGCCCA
TCAGAAATAA CCATACTGGT ACTCACGTGT TGAACTATGC CTTAAGAGAA GTGTTGGGCG
ACTCTGTTGA TCAAAGAGGT TCCTTGGTTG CTCCAGAAAA GTTGAGATTC GACTTTTCTC
ATAAACAAGC ATTAACTCCT AAGGAGTTGG CTGCCGTTGA ATCTATTTCC AACGAGTACA
TCAAGTCGAA CAAAGAGGTT TTCTACAAGG ATGTTTCGTT GAACGAAGCA AGGGAAATCA
ACGGCTTGAG AGCTGTCTTT GGAGAAACGT ATCCAGATCC AGTCAGAGTT GTTTCCATCG
GTGTTCCTGT AGAAGACTTG CTTGCTGAGC CTAAGAAGGC TGACTGGCAC AAAGTTTCGA
TTGAGTTCTG TGGTGGTACC CACGTAGCCA AAACGGGAGA CATCAAGGAC TTGGTCGTGA
TTGAAGAATC CGGTATCGCC AAGGGTATCA GAAGAATTGT GGCTGTCACA GGTCATGATG
CTCATGCTGT TCAGAAGATC GCCCTGGAGT TCGAAGCTGA ATTGGACCAC GCTGCTGGCT
TACCATTTGG TTCTGTCAAG GAAACCAAGG CCAAGGAGTT GGGTGTAGCC TTGAAGAAGT
TGTCGATCTC CGTTTTGGAC AAGCAGAAGT TAACTGAAAA GTTCAACAAG ATCGACAAGT
CCATCAAGGA CGATTTGAAG ACGAAACAGA AGGCTGAAAC CAAACAGACT TTGGATGTAG
TCAATGAATG GTTGGAAAAG AAAGATGCTG GGGACTACTT GGTTGCTCAT GTTCCAATCA
ACGCTAACGC CAAAGCCATC ACTGAAGCCT TTAACTTGCT CAAGAAGCTG CACAAGGACA
AATCATTGTA CTTGATCACT GGTTTAACTG ACAAGGTTGC TCATGGTTGT TACATTGCTG
ACGAAGCTAT TGCCAAAGGT GTCAATGCCA GCGACTTGGC TCAAGCTGTA TCTGCCAAGA
TTGGTGGTAA GGCTGGTGGT AAAGGCAACA TCGTCCAAGG TATGGGTGAC AAACCTGAAG
GCATTAAGGA GGCTGTTAAC GAAGTGACCC AATTGTTGGC TGAGAAGTTG TAATTGATTG
TTGTAGACTA GTAGTAGAGT TAGAGTAAAC TATTCAAGCA AGAGTAGAAC TCTTCAGACA
GTAGAACAAT TTAGACAATA GTGGAACAAC GTTATAGCAA TAGCTATATG CTTTGATATA
GTATCTGTTA ATAGTATAAT AAATAATGTA AATGTA
 
Protein sequence
MSTTTTSEWT ASKVRSTFLD YFTNKRDHKF VPSSSVVPHN DPTLLFANAG MNQYKPIFLG 
TADPASDLAS LKRAANSQKC IRAGGKHNDL EDVGRDSYHH TFFEMLGNWS FGDYFKPEAI
TWAWELLTEV YGLDKDRLYV TYFEGDAKQG LEPDLEAKSF WLKVGVAEDH ILPGNAKDNF
WEMGDQGPCG PCSEIHYDRI GGRNAASLVN QDDPNVLEVW NIVFIQYNRE ADSSLKTLPA
KHIDTGMGFE RLVSVLQDKS SNYDTDVFLP IFDKIRDITG IRPYGGKFGA EDTDGIDTAY
RVIADHVRTL TFAICDGGVP NNDGAGYVLR RILRRGSRYV RKYMNYPIGS FFQQLVDVVI
EQNKEIFPEI VHGAQDLKEI LNEEELSFAK TLDRGEKLFE QYAIIASKTP EQTLSGKDVW
RLYDTYGFPS DLTRLMAEEA GLKIDEPAFE KARLESKEAS KAIGNKDGVE LVKLDVHSLS
ELDSNANVSK TDDSAKYGRE NIKATIQAIY SSSGFVDSID DSSVQYGVLL DKTPFYAEQG
GQIYDTGKLV IDGKAEFNVT NVQVYGGYVL HTGNIVEGSL NVKDDVIATY DELRRWPIRN
NHTGTHVLNY ALREVLGDSV DQRGSLVAPE KLRFDFSHKQ ALTPKELAAV ESISNEYIKS
NKEVFYKDVS LNEAREINGL RAVFGETYPD PVRVVSIGVP VEDLLAEPKK ADWHKVSIEF
CGGTHVAKTG DIKDLVVIEE SGIAKGIRRI VAVTGHDAHA VQKIASEFEA ELDHAAGLPF
GSVKETKAKE LGVALKKLSI SVLDKQKLTE KFNKIDKSIK DDLKTKQKAE TKQTLDVVNE
WLEKKDAGDY LVAHVPINAN AKAITEAFNL LKKSHKDKSL YLITGLTDKV AHGCYIADEA
IAKGVNASDL AQAVSAKIGG KAGGKGNIVQ GMGDKPEGIK EAVNEVTQLL AEKL