Gene PICST_30665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30665 
Symbol 
ID4838142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp630277 
End bp631377 
Gene Length1101 bp 
Protein Length366 aa 
Translation table12 
GC content38% 
IMG OID640389457 
Productpredicted protein 
Protein accessionXP_001383754 
Protein GI150864782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.522798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.228547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAACT CCGTCTTGAC GAATGATGAA GTGCTATTTA AGACTAGAGA TCCTTTAAAC 
AATGCCAGAG CCATAGGAGT CAAGTATGGG TATGGTGATT CTGGGTCTAA TTTCCACGAT
AAGGGATTAG CAATTTTACG TAAGGATACG GTGACGGGTG AAATTTCGTT CAATCCCATT
AATTATGCAC ATAAGCTTAC ACTGAAGGTG TTTCGATACA TTCGTATCAA GATTCTTAGT
GAGATCGATG GAGACTTCAT GCTTACTGGA CAGTCTTTGT TTGAAAACGA TTTCAGTAGC
AGCAAAGATC AAATTATCAA CGACATCGAA AAAGCCCGAT TTTTTCTCTT TGAGGAAGAT
TTGTTCTACC AGTTGACACG AGAAGCAAAG ACGTTGATCA ACTACAATGT TTCAATTATC
TCTAACAAGA TCATTATAGA GATCAACAAT GAAATAATCG AGATTGAGTC TGTAGTTTAT
GATGAGAACA ACGATGACGA GTTGAATAAC TACTACCAGA ACATCAACGC TTACTCTTCT
ATCAATAATG GCAAGTGTCA ATTGATACTA AAGTTTTTCA AGATCATGTT GTGCTGTTAC
TATAAATACA ACTTGAAGTT GAAGCAAAAG ATTCCCACGT CGTTGACGAA GTGGAAACAG
CTGAATTCAC ATCCATTGGT ACTACGTCCA CTTCTCGGCA ATATCAGACA CGAGTTGAAC
TTGAAAAACA TGCAGAGCAT TATCGACCGA TTGATAGCCA AGTTCAAGGA AAGTCTAGAG
TGTAAATTGC AGGTAGACAA GTTTGCCAAT TTGGAACACA GATCAGAAAA CCCGTTCCAG
AAATCGATCG AAAGACCAGT ATCGAAGTTT AATATCGTGT TGAAGAGTAA ACGTAGCGCC
TACTTGAAGA TTGACTTGGA GTTGACCACC AACGAAATCT TTGTGAATCT CATCATCAAC
ATGAACGTCA TCAAGTTCAA GTGCGAAGAT GATTTTAAGA ACAACTTTAA TGGAGTTAAT
GTGCTACAGA TCAACTTCAA CGATTTCCAT GAGATTGAGG AATGCTTGGA TTGGACTTTG
TTGAACTTTG TCAATGGATA A
 
Protein sequence
MINSVLTNDE VLFKTRDPLN NARAIGVKYG YGDSGSNFHD KGLAILRKDT VTGEISFNPI 
NYAHKLTSKV FRYIRIKILS EIDGDFMLTG QSLFENDFSS SKDQIINDIE KARFFLFEED
LFYQLTREAK TLINYNVSII SNKIIIEINN EIIEIESVVY DENNDDELNN YYQNINAYSS
INNGKCQLIL KFFKIMLCCY YKYNLKLKQK IPTSLTKWKQ SNSHPLVLRP LLGNIRHELN
LKNMQSIIDR LIAKFKESLE CKLQVDKFAN LEHRSENPFQ KSIERPVSKF NIVLKSKRSA
YLKIDLELTT NEIFVNLIIN MNVIKFKCED DFKNNFNGVN VLQINFNDFH EIEECLDWTL
LNFVNG