Gene PICST_55665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55665 
SymbolSHY1 
ID4837150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1827902 
End bp1828981 
Gene Length1080 bp 
Protein Length359 aa 
Translation table12 
GC content43% 
IMG OID640388465 
Productmitochondrial protein involved in respiration 
Protein accessionXP_001382576 
Protein GI150863927 
COG category[S] Function unknown 
COG ID[COG3346] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.585569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCAT TGAGGTTTTC CAGGAACTTT CATCTCAACA AAACGACTAT CAGAACCGTC 
AAGACATCCA CCGTGGACTG GAAACCGATT ATATCTACGA AGGGAAACTT GGCAACAATT
GAGTATCAAT CTAAGATGCC CTTGTTGAGA AAATTCTTCC TCGGCTTGAT GATAGCTATG
CCTGTTATTT CGTTTGTATT AGGCTGTTGG CAAGTTAAGA GACTTCAGTG GAAGACAGCT
TTGATATCCA AATGTGAGAA CGCTTTGGCG CAACCACCCA TTGAAGAAAT TCCGGCCGAG
CTCGATCCAG ATGCTATTGT AGACTTTGAG TACCGTAGAT TCAAATGTAA GGGACATTTT
GACTACGATC AAGAGATATT CTTGGGTCCC AGAATCAGAG ATGGCCAGTT AGGATATTTG
GTTATCACTC CGTTCGTCAG AACTTCTGGC GGAAAGCCTA TTTTGGTTGA AAGAGGCTGG
ATTCACAAAG ATAAGGTAGT TCCAGAAACT AGAAAACATG GCTATTTGTC TCATTTGGCA
TTTCCTCAGG GTGAAATCGA AATCGAAGCC TTGTTCAGAG TGATGCCAGT TAAGTCGTAC
TTACAATTTG ACCACCAAGA TGGAGCCAGA CTCTTCAATG TTCATGATGT GCCGGAAATG
GCCAAGCAGT CTGGCGCTTT ACCTATTTAT TGTCAGATGA TATATGATCT TAGAGACCAT
GTGGACTGGA AGGGCCCCGA TGATGCCAAA AAACCTGCTA GCAAAAGTTC GTGGTTGAAG
TCGCTTGCTT TTGCTCAGAA GCAAGAGCCA CAGGACGATG CCCATTTCAT CTCATCTCAG
GCTGAATTCG ATCACACTTT GGAATACCAA GATTTTGAAT TCGTCAAGCA GGGTGTACCT
ATTGCACCCA CACCCAAGTT GAAGTTCAGC AATAACCACT TGCAGTACCT TGTGACATGG
TTTGGACTTT CAATTTGCAG CGCTGGACTT TTGATTTACA GTTTTATGAA GAAGGGAAGA
TACCTGAGTG CTGAAAAAGT GATTGCTGAG AAGAGAAGAC AGATGGGAAG AACATTCTAA
 
Protein sequence
MVPLRFSRNF HLNKTTIRTV KTSTVDWKPI ISTKGNLATI EYQSKMPLLR KFFLGLMIAM 
PVISFVLGCW QVKRLQWKTA LISKCENALA QPPIEEIPAE LDPDAIVDFE YRRFKCKGHF
DYDQEIFLGP RIRDGQLGYL VITPFVRTSG GKPILVERGW IHKDKVVPET RKHGYLSHLA
FPQGEIEIEA LFRVMPVKSY LQFDHQDGAR LFNVHDVPEM AKQSGALPIY CQMIYDLRDH
VDWKGPDDAK KPASKSSWLK SLAFAQKQEP QDDAHFISSQ AEFDHTLEYQ DFEFVKQGVP
IAPTPKLKFS NNHLQYLVTW FGLSICSAGL LIYSFMKKGR YSSAEKVIAE KRRQMGRTF