Gene PICST_70094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70094 
SymbolHTS1 
ID4837193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2699286 
End bp2700975 
Gene Length1690 bp 
Protein Length536 aa 
Translation table12 
GC content44% 
IMG OID640388508 
Producthistidine tRNA synthetase 
Protein accessionXP_001383270 
Protein GI126133490 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.768048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.208592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGAAAAGAAG GATGTCGACT GAAACTGCTG CCAGTGCTGC TGAAAAAATA ACCGGAGCCA 
ATGCTCTTGC TGCTGCCAAA CCAACCAACC AGAAGTCTTC GAAGAAGTCC AAGAAGCAAG
ATGCTCAGCA ATTTCTCTTG AAAACACCCA AAGGTACCAA GGACTGGTTT GACAAAGATA
TGGTTATCAG AGATGCCATA TTTGGTTCGC TCACTTCGTT GTTCAAGAGA CACGGTGGTG
TAACTATTGA TACTCCAGTA TTTGAACTTA GAGAAATCTT GACGGGAAAA TATGGTGAAG
ACTCCAAATT GATCTACAAC TTGGAAGACC AAGGTGGTGA ATTGACATCA TTGAGATATG
ACTTAACTGT TCCATTTGCC AGATTTGTAG CCATGAACTC TATCAGCTCT ATTAAGAGAT
ACCATATTGC TAAAGTGTAC AGAAGAGACC AGCCAGCTAT GACCAAAGGT AGAATGAGAG
AATTCTACCA ATGTGACATT GACATTGCAG GTAACTACGA TTCTATGGTT CCCGACTCCG
AGATCTTGAG CATCGTTTGT GAAGGTTTGA CCAACTTGGG CATCAACGAG TTCAAAGTTA
AGTTGAACCA CAGAAAGATC TTAGACGGGA TCTTTGAAGC CTGTGGAGTC AAAGAAGAAT
ACGTGAGAAA GGTCTCGTCT GCTGTCGACA AGTTGGACAA ATTGCCCTGG GAAGCTGTCA
AAAAAGAGAT GGTTCAAGAA AAGGACCAGC CTGAAGAAGT TGCCGACAAG ATCGGAGAGT
TTGTCAAGGT CAAGGGTTCT ATTCGTGAAA CCTTGTCTTT TTTGAAGTCC AGCGAATTGT
TAGCTAATAA CGCTTCTGCT CAGAAGGGTA TTGAAGAAAT GACTACCTTG GCTGACTATG
TCGACGCCTT TGGCATTGGT GAGAAGATCA GCTTTGACTT GTCGTTGGCT AGAGGGTTGG
ACTACTACAC TGGTTTGATC TATGAAGCTG TGACTGAAGG CTCTGCTCCT CCAGAAAATG
CTGATGAGTT GAAGTCCAAA GCCCAGAAGA ATTCCAAGGA TAAGGAAGTC GATGATGCTT
CTGAATACGT TGGTATCGGT TCCATTGCCG CTGGTGGTCG TTACGATAAC TTGGTAGGAA
TGTTCTCCAA CGGTAAATCC ATTCCTTGTG TAGGTATCTC TTTTGGTGTG GAAAGAATCT
TTTCAATCAT CAAGGCAAGA GCTGCTAAAC AGCTCGACAA GATCGGCTCG GCGCACACCC
AGGTGTACGT GATGGCTTTT GGGGGTGGAG AAGGTTGGAA CGGATTCTTG AAGGAAAGAA
TGGCTGTCAC CAACCAATTG TGGCAGGAAG GAGTAAATGC CGAATACTTA TACAAGGGCA
AGGCCAACAT CCGTAAGCAA TTCGATGCTG CTGAAAAGAC TGGCGCTAAG GTTGCTGTCA
TTCTTGGTAA AGAAGAGTAT CCAGCTGGCC AGATAAGACT CAAGGTTTTG GGACAAGGCG
CCGAAAGTGA CGAAGGGGAA TTGATCAAGG CTGAAAATCT TGTTGAAGCC GTGAAGGCCA
AGTTGGATTC GTTGAATCAA GACGGTTTGG ACGACATCAC ACGTTTGATT AGAGCCATTT
AGATGTACAT AGTCTCTTCT TATCATGTTT ATGAAACAAC AAATAAAGCA ATATAAACTT
CTTTCTGGTT
 
Protein sequence
MSTETAASAA EKITGANALA AAKPTNQKSS KKSKKQDAQQ FLLKTPKGTK DWFDKDMVIR 
DAIFGSLTSL FKRHGGVTID TPVFELREIL TGKYGEDSKL IYNLEDQGGE LTSLRYDLTV
PFARFVAMNS ISSIKRYHIA KVYRRDQPAM TKGRMREFYQ CDIDIAGNYD SMVPDSEILS
IVCEGLTNLG INEFKVKLNH RKILDGIFEA CGVKEEYVRK VSSAVDKLDK LPWEAVKKEM
VQEKDQPEEV ADKIGEFVKV KGSIRETLSF LKSSELLANN ASAQKGIEEM TTLADYVDAF
GIGEKISFDL SLARGLDYYT GLIYEAVTEG SAPPENADEL KSKAQKNSKD KEVDDASEYV
GIGSIAAGGR YDNLVGMFSN GKSIPCVGIS FGVERIFSII KARAAKQLDK IGSAHTQVYV
MAFGGGEGWN GFLKERMAVT NQLWQEGVNA EYLYKGKANI RKQFDAAEKT GAKVAVILGK
EEYPAGQIRL KVLGQGAESD EGELIKAENL VEAVKAKLDS LNQDGLDDIT RLIRAI