Gene PICST_40652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40652 
Symbol 
ID4836772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp604215 
End bp605507 
Gene Length1293 bp 
Protein Length430 aa 
Translation table12 
GC content45% 
IMG OID640388087 
Productpredicted protein 
Protein accessionXP_001382878 
Protein GI150864161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.470816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT CAAGTCTTTG CAAGCAGGCT TTGACGGCTT CGTCTATGAA TTACAAGCTG 
GTACAGGCAA CAGCCATCAG AAGCTTCCAC GAGTCACAGA TCCATTTCAA CAAAGAACAA
GCACCACAAG GATCGCCTTT AAAGGTCTTT TTCGATACCT TCAAGAACGA AGTCAAGAAA
TCGAACGAGT TGAAGGAAAA TATCAAGGCT TTACAGGATG AGTCTGGAAG AATGGCTGAA
TCTGAAGCTT TCAAGAAGGC CAGAGAAGCT TATGAGACAG CACAGAAAGG TAGTAATGCT
GCTGGTAAAG TGCTCAAGCT GACTGCAGAC GTTGTAGGCG GTGCTGCTGT AAAAGCATGG
GATTCTCCTG TTGGTAAGGG AGTCAGAACT ACAGTACGTG TAAGTGCTGA AGTAGCAGAC
AAAGCCTTTG AGCCTGTAAG ACAGACACAA GTCTACAAGG ATGTTTCTGA AGTTATTGAT
GATGGTTCGT CGACTTCATA TGGTGGATTT TTGACAAAGG AGCAGAGACA GAGATTGAGA
GAGAAGGAAT TGGAGGAAAG AGCCAGAAAG GGAGTCAAGG GTCCTGTTCG TGAGAACGAA
GAGGCTGGCG GAGAGTTAGT AGCCACTGAA CATAAGGCTT CAGGTCCATC TGTTGGTGAA
AGATGGGAAG AGTACAAACT TAAAACACCT GTGGGCCGTT TCTTTACGTA CTTGCAAGAG
AAATGGCAGG ACTCTGAGAA TGGCTTGATT TCACTTATAA GAACCATCAT TGAAAAAGTA
ACAGGGTTCT TTGCCGAAAC TGAACAAGCC AAGGTGGTTA AGCAATTTAG AATGATGGAT
CCTTCTTTCC GTTTAACCGA CTTCCAGAAG ACATTGACCA ACTACATTGT GCCCGAGATC
TTAGATGCCT ACATCAAGAA CGACGAAACC GTGTTGAAGC AATGGTTCTC CGAAGCTCCC
TTCAACGTCT GGCAAGCCAA CAATAAGCAG TTCATCCAAC AGGGCTTGTT CCTGGACGGT
CGTATCTTAG ATATCCGTGG TGTTGAAGTA GTCACATGCA AGCAGTTGCA ACCTAACGAT
ACTCCTGTCA TTGTTGTCAG TTGTCGTGCC CAAGAGGTTC ATTTGTACCG TAAGGCTAAG
ACAGGTGACA TTGCTGCAGG TACCGAGGAT CATATCCAGT TGAGCACATA CGCTATGGTT
CTTACAAGAG TTCCAGAAGA ATTCGACAAC GCCACTACGG AAGGATGGAA GATCATAGAG
TTCGCTCGTG GTGGTTCCAG ACCTTTCCAT TGA
 
Protein sequence
MMKSSLCKQA LTASSMNYKS VQATAIRSFH ESQIHFNKEQ APQGSPLKVF FDTFKNEVKK 
SNELKENIKA LQDESGRMAE SEAFKKAREA YETAQKGSNA AGKVLKSTAD VVGGAAVKAW
DSPVGKGVRT TVRVSAEVAD KAFEPVRQTQ VYKDVSEVID DGSSTSYGGF LTKEQRQRLR
EKELEERARK GVKGPVRENE EAGGELVATE HKASGPSVGE RWEEYKLKTP VGRFFTYLQE
KWQDSENGLI SLIRTIIEKV TGFFAETEQA KVVKQFRMMD PSFRLTDFQK TLTNYIVPEI
LDAYIKNDET VLKQWFSEAP FNVWQANNKQ FIQQGLFSDG RILDIRGVEV VTCKQLQPND
TPVIVVSCRA QEVHLYRKAK TGDIAAGTED HIQLSTYAMV LTRVPEEFDN ATTEGWKIIE
FARGGSRPFH