Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_70094 |
Symbol | HTS1 |
ID | 4837193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2699286 |
End bp | 2700975 |
Gene Length | 1690 bp |
Protein Length | 536 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640388508 |
Product | histidine tRNA synthetase |
Protein accession | XP_001383270 |
Protein GI | 126133490 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.768048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.208592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGAAAAGAAG GATGTCGACT GAAACTGCTG CCAGTGCTGC TGAAAAAATA ACCGGAGCCA ATGCTCTTGC TGCTGCCAAA CCAACCAACC AGAAGTCTTC GAAGAAGTCC AAGAAGCAAG ATGCTCAGCA ATTTCTCTTG AAAACACCCA AAGGTACCAA GGACTGGTTT GACAAAGATA TGGTTATCAG AGATGCCATA TTTGGTTCGC TCACTTCGTT GTTCAAGAGA CACGGTGGTG TAACTATTGA TACTCCAGTA TTTGAACTTA GAGAAATCTT GACGGGAAAA TATGGTGAAG ACTCCAAATT GATCTACAAC TTGGAAGACC AAGGTGGTGA ATTGACATCA TTGAGATATG ACTTAACTGT TCCATTTGCC AGATTTGTAG CCATGAACTC TATCAGCTCT ATTAAGAGAT ACCATATTGC TAAAGTGTAC AGAAGAGACC AGCCAGCTAT GACCAAAGGT AGAATGAGAG AATTCTACCA ATGTGACATT GACATTGCAG GTAACTACGA TTCTATGGTT CCCGACTCCG AGATCTTGAG CATCGTTTGT GAAGGTTTGA CCAACTTGGG CATCAACGAG TTCAAAGTTA AGTTGAACCA CAGAAAGATC TTAGACGGGA TCTTTGAAGC CTGTGGAGTC AAAGAAGAAT ACGTGAGAAA GGTCTCGTCT GCTGTCGACA AGTTGGACAA ATTGCCCTGG GAAGCTGTCA AAAAAGAGAT GGTTCAAGAA AAGGACCAGC CTGAAGAAGT TGCCGACAAG ATCGGAGAGT TTGTCAAGGT CAAGGGTTCT ATTCGTGAAA CCTTGTCTTT TTTGAAGTCC AGCGAATTGT TAGCTAATAA CGCTTCTGCT CAGAAGGGTA TTGAAGAAAT GACTACCTTG GCTGACTATG TCGACGCCTT TGGCATTGGT GAGAAGATCA GCTTTGACTT GTCGTTGGCT AGAGGGTTGG ACTACTACAC TGGTTTGATC TATGAAGCTG TGACTGAAGG CTCTGCTCCT CCAGAAAATG CTGATGAGTT GAAGTCCAAA GCCCAGAAGA ATTCCAAGGA TAAGGAAGTC GATGATGCTT CTGAATACGT TGGTATCGGT TCCATTGCCG CTGGTGGTCG TTACGATAAC TTGGTAGGAA TGTTCTCCAA CGGTAAATCC ATTCCTTGTG TAGGTATCTC TTTTGGTGTG GAAAGAATCT TTTCAATCAT CAAGGCAAGA GCTGCTAAAC AGCTCGACAA GATCGGCTCG GCGCACACCC AGGTGTACGT GATGGCTTTT GGGGGTGGAG AAGGTTGGAA CGGATTCTTG AAGGAAAGAA TGGCTGTCAC CAACCAATTG TGGCAGGAAG GAGTAAATGC CGAATACTTA TACAAGGGCA AGGCCAACAT CCGTAAGCAA TTCGATGCTG CTGAAAAGAC TGGCGCTAAG GTTGCTGTCA TTCTTGGTAA AGAAGAGTAT CCAGCTGGCC AGATAAGACT CAAGGTTTTG GGACAAGGCG CCGAAAGTGA CGAAGGGGAA TTGATCAAGG CTGAAAATCT TGTTGAAGCC GTGAAGGCCA AGTTGGATTC GTTGAATCAA GACGGTTTGG ACGACATCAC ACGTTTGATT AGAGCCATTT AGATGTACAT AGTCTCTTCT TATCATGTTT ATGAAACAAC AAATAAAGCA ATATAAACTT CTTTCTGGTT
|
Protein sequence | MSTETAASAA EKITGANALA AAKPTNQKSS KKSKKQDAQQ FLLKTPKGTK DWFDKDMVIR DAIFGSLTSL FKRHGGVTID TPVFELREIL TGKYGEDSKL IYNLEDQGGE LTSLRYDLTV PFARFVAMNS ISSIKRYHIA KVYRRDQPAM TKGRMREFYQ CDIDIAGNYD SMVPDSEILS IVCEGLTNLG INEFKVKLNH RKILDGIFEA CGVKEEYVRK VSSAVDKLDK LPWEAVKKEM VQEKDQPEEV ADKIGEFVKV KGSIRETLSF LKSSELLANN ASAQKGIEEM TTLADYVDAF GIGEKISFDL SLARGLDYYT GLIYEAVTEG SAPPENADEL KSKAQKNSKD KEVDDASEYV GIGSIAAGGR YDNLVGMFSN GKSIPCVGIS FGVERIFSII KARAAKQLDK IGSAHTQVYV MAFGGGEGWN GFLKERMAVT NQLWQEGVNA EYLYKGKANI RKQFDAAEKT GAKVAVILGK EEYPAGQIRL KVLGQGAESD EGELIKAENL VEAVKAKLDS LNQDGLDDIT RLIRAI
|
| |