Gene PICST_54387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_54387 
Symbol 
ID4837224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2280889 
End bp2282145 
Gene Length1257 bp 
Protein Length418 aa 
Translation table12 
GC content40% 
IMG OID640388539 
ProductPutative trehalase N2227-like protein 
Protein accessionXP_001383193 
Protein GI150864402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.331119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAC GTTCAAATGT GTTCCAAACT TTGGTTCAGC GTTTAGGACC ACTGTTTTCA 
AAATTGATTA CAAAATTCAC GAATTTTCCA GCAATGTCTT CCAATTCATC TACTTCTGCT
AATAATACTT TAAAAGATAG TTCCAAGTTA GAATTGCTCA CTGCCCTTAG GTCGCTCGAA
TCATATGCTA TAAATACTAA AAAGACCAAC GACAGAAGGC GTAAACTATT CAAGTTAATG
ACATGGAGAC AACAGAAATT GTGTGAAGAC GTCGGCTATT TGCAGAAGTT GAAGAGAATC
GATGTTTCGG TTCAGATGAA CCAGTTCTTT CTCAAGGCTG TATCTAAGCA CTCTTTCGAA
ACATTTGGCT TGTCTTTTCA GGACTATCAC CTACTCAAGG ATCATGACAG TCCGACCCAG
ACTTCGTCTT CTAACTATCG TGTTATCGAG TCCCTTGGCC ATTTTACTCG TGACTGGACA
GCAGAAGGAG AAGTTGAGAT AAAACCAGTA TGGGACTATG TTAGAACTCA GGTAGATAAG
TTGGTCAAGC CTCAGGATAG AGCTAAAACA TGTGTTGTAG TTCCAGGCTC AGGTTTAGGT
CGTATAGCCC ATGAATTAGC GTCTTATGGT TCTGAAACAG AAAGATTTGG TGCTGTTCAC
GCTATAGAAT ACTCGGGACT TATGCATATC TGTAATAGAT TCATGTATTC TTCGCCTGAA
AATAGTTCTC AATCTAAGAA CTATGAAATC TATCCATATG TCCATAGCTG TTCCAACTTT
TACGATTCTC AATCCCAGTT TAAGTCTCTG CACTTTTCTA CGATGAACCA GCCAAAGAAT
TTGCACTTGA ATCATGAAGA TTTCCGCTAC TTTAGTTTAC AAAATAACTA CGAGAATATT
GTCGTGGTTT CAGTCTTCTT CATGGACACG GCAGAGAATT TAGTAGACTA CATGGATGCC
ATCCAAAGTC TCACTGTTCC AAGCAAGAAG AACGGAGTCA AGAACGGCTA CTGGATCAAC
GTAGGACCCT TGAAGTACGG AAGTGCAGCT CAAGTAGAAT TGAATGCTGA CGAGTTTGCC
TTGCTTAGAA AGGGCATGGG CTGGAAAGAC GTCGATAACG TGAAAACTGT ACAAGAACCA
AACAAATATG GCGAGAATGG TTTGGTTGGA TATATTACCC ATCGCGAAAG TATGTGGCAG
GGGTACTATG GCTTGAACAT GTATACCAGT GTCCGAAGCG AAAACACTTG TAAATAG
 
Protein sequence
MISRSNVFQT LVQRLGPSFS KLITKFTNFP AMSSNSSTSA NNTLKDSSKL ELLTALRSLE 
SYAINTKKTN DRRRKLFKLM TWRQQKLCED VGYLQKLKRI DVSVQMNQFF LKAVSKHSFE
TFGLSFQDYH LLKDHDSPTQ TSSSNYRVIE SLGHFTRDWT AEGEVEIKPV WDYVRTQVDK
LVKPQDRAKT CVVVPGSGLG RIAHELASYG SETERFGAVH AIEYSGLMHI CNRFMYSSPE
NSSQSKNYEI YPYVHSCSNF YDSQSQFKSS HFSTMNQPKN LHLNHEDFRY FSLQNNYENI
VVVSVFFMDT AENLVDYMDA IQSLTVPSKK NGVKNGYWIN VGPLKYGSAA QVELNADEFA
LLRKGMGWKD VDNVKTVQEP NKYGENGLVG YITHRESMWQ GYYGLNMYTS VRSENTCK