Gene PICST_31482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31482 
Symbol 
ID4838471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp912427 
End bp913887 
Gene Length1461 bp 
Protein Length486 aa 
Translation table12 
GC content41% 
IMG OID640389786 
Productpredicted protein 
Protein accessionXP_001384129 
Protein GI150865068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.2265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.161044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTGT TTATCCAAAG CAATCCATTA TGGGCTGTCT TCGTCTTTTT GGCGTTGCAA 
TTGTCGTCTG CTGCTGCTTT TTCACACAGA GCTTTCGATA TGAATCTTTA CTCTAACGGA
GCCTGTCTTT ATGTTAATGG TTCTAGCAAA GATACATCTC TTCGTTTCGA GTTCTCTGGT
GTCAGACAAC CTACTGCTCT TGTCGTTTTC CAATACGACG ACATCATTGC TTTCAGAAAC
TTGCCCAACT TGAACTATTT CCTCAACCAT TCGTATATAT TGGACAAGCC AATCTTGGGT
TTTGATGATG AGACTGACCA ATTAGGTTTC CTGTTGATTT CTCGTAAAGA CTTCGATATT
CCAGAGTCCA TTTTGGCAAC CCTCGTCCTC TCTGACAAGG AGGTCGATTA TCCAATCGAG
AATGATGGAG TCTACTGCAT CTACATGCCT TTGTACAAGT ACAACGAGAC GCTTGTCTTG
CCCCAGGCCC ATTACAATGC CAGAGTTACC GTGAAGAACG AAACTATGCC TGTCAACATC
TACCACGATA TCTCGACCCA CATCAGTTTG TCGATCGTGT TTGGCCTTGG TTTGGTTCTC
ATGAGCTTAT TCCACCCTGC TATAGGTCAA GGAAAGCTCA CGCAATTGCC TATCGTAACT
CGTTATATCA TTTACTTCTT CATTGTCAAC TTTGTTTACA ACTCTTTGTA CTTCATCTTG
GAACTCATCT GCACTTACTT CCCTAACGAT GCATTGTACG ACTTTACTGA AAACTTTTAC
TTCAGATTGC AAACTGGTGT TATCGACAAG TTCCAACTCT ATATTTTGAT TCTTGTCTAC
TTGGGCTACG GGTATGCCGG TGCTGACGTT AAAATCAACA CCAAGCTTAC CACAACATTT
TGGTTAGCTA ACACCGTTAG TCGCATTGTC ACCGATTACA TTTACACCGA CGTCAATACC
CTCATAGACA TCACCTTGCT CGAAGAAAAG TACCTGATCT TGTACAACTC TGTGATCCTT
CCTGGCGGGT ACAAGAAGAG TTTGATTAGA TCAATGGATC CAAGAGGAAA AGGCATCGTC
TCAACATTCT CCTTGATACA GGTTGTTTCC GAATCATTGT TCTACATTGC TATCACATAT
TATGCCTTCA AGATCAACAA GTACTGGAAG ACGAAGGGAG AAACCCAAAT GAAGAAGTAC
ATATACAAAT CAATAATCTT CCACTTGTTC ATCTACAAGT TTGCCCTCAA GTATGTTGCA
TTGCAAATTT CTACCATCAC ATTCGGCGGC TTCTTCGATG TCGGTGAATT GTTGAAAGTT
CTCGGTAACT TGATTGAAAT CTATGAAATG AAGGTTATCA GTTTGAACGT AATTGAAGTG
TTTATTTTGT GGTTAATCTG GGGTGTCAAT AAGCCATTGA AGCCCGACGG AACGAAGAAG
AAGGAAAAGG AAAAGAAGTA G
 
Protein sequence
MRVFIQSNPL WAVFVFLALQ LSSAAAFSHR AFDMNLYSNG ACLYVNGSSK DTSLRFEFSG 
VRQPTALVVF QYDDIIAFRN LPNLNYFLNH SYILDKPILG FDDETDQLGF SLISRKDFDI
PESILATLVL SDKEVDYPIE NDGVYCIYMP LYKYNETLVL PQAHYNARVT VKNETMPVNI
YHDISTHISL SIVFGLGLVL MSLFHPAIGQ GKLTQLPIVT RYIIYFFIVN FVYNSLYFIL
ELICTYFPND ALYDFTENFY FRLQTGVIDK FQLYILILVY LGYGYAGADV KINTKLTTTF
WLANTVSRIV TDYIYTDVNT LIDITLLEEK YSILYNSVIL PGGYKKSLIR SMDPRGKGIV
STFSLIQVVS ESLFYIAITY YAFKINKYWK TKGETQMKKY IYKSIIFHLF IYKFALKYVA
LQISTITFGG FFDVGELLKV LGNLIEIYEM KVISLNVIEV FILWLIWGVN KPLKPDGTKK
KEKEKK